Data Engineer (Chevy Chase,MD)

  • National Geospatial-Intelligence Agency (NGA)
  • 5425 Wisconsin Ave Suite 301, Chevy Chase, MD 20815, USA
  • Nov 09, 2020
[Information Technology]

Job Description

Job SummaryASSIGNMENT DESCRIPTION: Data Engineers develop, construct, test, and maintain architectures such as databases and large-scale data processing systems. They clean, prepare, and optimize data for consumption through the design and construction of massive reservoirs for big data. They solve problems associated with database access and integration and unstructured data sets to provide clean, usable data for customers and IT counterparts. These engineers serve as integrators between data architects, data scientists, and other data consumers. They apply knowledge of scripting tools, programming languages, standards, and software packages to build the data pipelines that enable faster, better, data-informed decision-making within the Agency.Additional Job InformationADDITIONAL INFORMATION: The National Geospatial-Intelligence Agency (NGA) serves as the world leader in providing timely, relevant, accurate and actionable geospatial intelligence (GEOINT). NGA's civilian, military and contract personnel evaluate imagery, maps, charts, multiple layers of foundation data - such as terrain, elevation and gravity - and the full spectrum of visible and invisible light in order to help users visualize what is happening at a particular place and time. NGA is in search of Data Engineers who understand data, data architectures, data pipeline infrastructure, cloud solutions, and security best practices. Data Engineers create data infrastructure and integrated data pipelines; transforming, enriching, and delivering data for consumers. They may work independently and/or as part of a team of data and non-data professionals on projects of varying complexities. They will design logical and physical data models for data warehousing; develop robust data models, data dictionaries, and data flow diagrams. Data Engineers must be able to operate at the tactical and strategic level while working in a dynamic and sometimes ambiguous environment to deliver impactful results. They will leverage their strong interpersonal skills to effectively communicate with mission owners at varying levels inside and outside the Agency. As a Data Engineer, you will be tasked against the Agency's biggest data challenges in support of missions such as National Intelligence, National Security, Military Operations, and Disaster Relief and Preparedness, for a diverse audience including the Department of Defense, the Intelligence Community, and senior government officials up to and including the United States Congress and the President of the United States. Other duties may include: * Utilize a variety of languages and tools (e.g., scripting languages) to build data pipelines to pull together information from different source systems. * Design, construct, install, test, and maintain highly scalable data management systems; * Develop data set processes for data discovery, modeling, mining, and production. * Integrate and prepare large, complex data sets that meet functional /non-functional business requirements. * Design and implement high performance data pipelines for distributed systems and data analytics for customers. * Build, deploy, operate, and maintain big data analytics infrastructure. * Orchestrate large PB sized data storage and compute clusters across bare-metal and cloud. * Deploy and manage infrastructures based on Docker, Kubernetes, or OpenStack, and public Clouds such as Azure, AWS or Google Cloud Platform. * Create tool-chains for analytics and data scientist team members that assist them in building and optimizing AI workflows. * Experience with engineering data pipeline from collection and processing to visualization of SQL and NOSQL cyber security data sources for threat analysis and modelling. * Work with Agency data and machine learning experts to improve functionality in our data and model life cycle management capabilities. * Develop enterprise interoperable coding and data standards and create processes to ensure use of enterprise-wide data architecture capabilities. * Solve problems and deliver solutions with database access and data integration, define data conflation, fusion, and de-confliction rules. * Collaborate with data architects, data scientists, data stewards, and customers to enable more efficient and effective data-informed decision-making. * Apply expert knowledge of scripting, tools, programming languages, standards, and software packages to build databases and data pipelines. * Accomplish data cleansing, preparation, storage, and security. * Web-based data creation and editing through iD editor or JDSM. * Data conflation activities through Hootenanny. * Upgrades to the UI and system capabilities to enhance user experience. * Use business analytics and data validation to improve campaign focus and output. * Establish and sustain a NOME User Forum. * Create and brief presentations around the community and conferences. Additional Application Requirement: You may be asked to complete one or more assessments in addition to the application you submit on this website as part of your application to NGA. These assessments may include but are not limited to: 1) Online questionnaires or assessments that require you to describe your job-related knowledge, skills, abilities, or other characteristics that are aligned with the mandatory and desirable qualifications of this job posting. The information you provide in the application you submit must support the response you provide to this questionnaire. You will receive an email to describe any additional assessments required. Please monitor your emails and complete any required assessments as soon as possible.Mandatory QualificationsMANDATORY QUALIFICATION CRITERIA: For this particular job, applicants must meet all competencies reflected under the Mandatory Qualification Criteria to include education (if required). Online applications must demonstrate qualification by providing specific examples and associated results, in response to the announcement's mandatory criteria specified in this vacancy announcement: 1. Demonstrated proficiency in applied programming and/or manipulation of data with a programing language such as Python, R or Java. 2. Demonstrated experience working with one or more database management systems, e.g. relational (Oracle, MySQL, Postgresql, etc.), noSQL (MongoDB, Accumulo, etc.), graph (Allegrograph, JanusGraphy, Neo4J, etc) to include experience with database retrieval methods and optimizations. 3. Demonstrated experience working with a range of data storage/access options, e.g. File(S3, FTP,etc.) and/or API(WFS, FeatureServer, Elasticsearch, SDK, etc.). 4. Demonstrated experience enabling access to data by way of databases or dashboards. 5. Demonstrated experience cleaning, filtering, transforming data, and/or enriching data. EDUCATION REQUIREMENT: A. Education: Bachelor's degree from an accredited college or university in Computer Science, Engineering, Information Science, Information Systems Management, Mathematics, Operations Research, Physical Sciences, Statistics, Technology Management, or a degree that provided a minimum of 24 semester hours in one or more of the fields identified above and required the development or adaptation of applications, systems, or networks. -OR- B. Combination of Education and Experience: A minimum of 24 semester (36 quarter) hours of coursework in any area listed in option A, plus experience in developing, constructing, testing and maintaining architectures and/or related technologies, or in a related field that demonstrates the ability to successfully perform the duties associated with this work. As
Associated topics: data analyst, data analytic, data center, data engineer, data scientist, database, etl, mongo database administrator, sql, sybase