Job Title: Data Engineer
Location: Hybrid (Up to 4 days in office) 1790 Ash St SE, Washington, DC 2003
Clearance: Ability to Obtain Public Trust Clearance (U.S. Citizenship Required)
Position Type: Full-Time
Salary: $160,000 - $170,000
Position Overview
We are seeking a Data Engineer to support a federal agency. The Data Engineer will design, build, and maintain secure, high-performance data pipelines that integrate diverse systems into the agency’s Integrated Data Environment. This role emphasizes expertise in ETL orchestration, cloud-native architectures, and metadata governance to enable enterprise-scale analytics and AI. The position offers the opportunity to work with modern platforms such as Databricks, Apache Spark, Enlighten BDP, and Collibra, directly contributing to the agency’s digital modernization, compliance readiness, and mission support.
Key Responsibilities
Data Pipeline Development
- Build and orchestrate ETL/ETI workflows using Apache Spark, Airflow, Kafka, and AWS Glue.
- Develop pipelines in Databricks to process both batch and streaming data.
- Translate mission requirements into optimized SQL queries and reusable data structures.
- Identify data gaps, design test cases, and perform data quality assessments to ensure reliable pipelines.
Architecture & Governance
- Implement lakehouse architectures using Enlighten BDP and Delta Lake for secure, ACID-compliant storage.
- Manage metadata, data tagging, and lineage through Collibra to ensure governance and interoperability.
- Support the identification and maintenance of authoritative and trusted data sources across the enterprise.
Cloud & DevOps Integration
- Develop and maintain CI/CD pipelines using GitHub or GitLab for automated deployment.
- Integrate and secure APIs for scalable data exchange across platforms.
- Support A&A compliance activities, including eMASS security assessments, POA&M tracking, and adherence to FedRAMP and FISMA standards.
Collaboration & Support
- Partner with Data Scientists and Automation/AI Engineers to operationalize AI/ML models within production pipelines.
- Leverage Jupyter Notebooks and Anaconda for prototyping and collaborative development.
- Provide technical documentation, data dictionaries, and reproducible workflows.
- Contribute to stakeholder deliverables including reports, dashboards, and strategic briefings, with visualization support using Qlik where applicable.
Required Qualifications
- Bachelor’s degree in Computer Science, Data Engineering, or related field.
- Minimum of 7 years of experience in data engineering, ETL/ETI processes, or big data environments.
- Proficiency with SQL, Python, and Spark-based frameworks.
- Experience with Databricks, Hadoop, Kafka, Airflow, or AWS Glue.
- Familiarity with Git-based version control and CI/CD practices.
Preferred Qualifications
- Certifications such as AWS Data Engineer Associate, AWS Certified Data Analytics, or Microsoft Certified: Azure Data Engineer Associate.
- Knowledge of Collibra and Enlighten BDP environments.
- Experience supporting federal data governance, security compliance, and ATO readiness.
Pay: $160,000.00 - $170,000.00 per year
Benefits:
- 401(k)
- 401(k) matching
- Dental insurance
- Flexible spending account
- Health insurance
- Paid time off
- Vision insurance
Work Location: Hybrid remote in Washington, DC 20032