MUST HAVE THE ABILITY TO OBTAIN A SECURITY CLEARANCE!!!!
Overview
Join our innovative team as a Data Engineer and become a vital driver of our data-driven decision-making processes. In this dynamic role, you will design, develop, and maintain scalable data pipelines and architectures that empower our organization to harness the full potential of big data. Your expertise will enable seamless integration of diverse data sources, optimize data workflows, and support advanced analytics initiatives. If you thrive in a fast-paced environment and are passionate about transforming raw data into actionable insights, this opportunity is for you!
Responsibilities
- Develop, implement, and optimize robust ETL (Extract, Transform, Load) processes to facilitate efficient data flow across systems using tools like Informatica, Talend, and Shell Scripting.
- Design and maintain scalable data warehouses and data lakes on platforms such as Azure Data Lake and Hadoop ecosystems to support large-scale analytics.
- Build and manage complex SQL databases using Microsoft SQL Server, Oracle, and other relational database systems; ensure their performance, security, and reliability.
- Collaborate with cross-functional teams to understand data requirements and translate them into technical solutions utilizing Python, Java, Bash (Unix shell), and RESTful APIs.
- Leverage big data technologies like Apache Hive, Spark, and Hadoop to process vast datasets efficiently while ensuring data quality and consistency.
- Integrate linked data sources to enhance the richness of datasets for comprehensive analysis; utilize tools like Looker for visualization and reporting.
- Support model training and analysis activities by providing clean, well-structured datasets; contribute to continuous improvement of data models through iterative testing.
- Participate in Agile development cycles to deliver high-quality solutions rapidly; document processes thoroughly for ongoing maintenance.
Qualifications
- Proven experience designing and implementing large-scale data pipelines using AWS cloud services such as AWS Glue or S3; familiarity with Azure Data Lake is a plus.
- Strong programming skills in Python, Java, VBA, Bash (Unix shell), or Shell Scripting for automation and customization tasks.
- Extensive knowledge of SQL databases including Microsoft SQL Server, Oracle, and experience with Data Warehouse concepts.
- Hands-on experience working with big data frameworks such as Hadoop ecosystem components (HDFS, Hive), Spark, and related tools.
- Proficiency with ETL tools like Talend or Informatica; understanding of RESTful API integration for data exchange.
- Familiarity with analytics platforms such as Looker for creating dashboards and reports that drive business insights.
- Ability to design efficient database schemas and optimize query performance; strong analysis skills to interpret complex datasets.
- Knowledge of model training techniques for predictive analytics; experience working within Agile development methodologies is preferred. Join us to leverage your technical expertise in a collaborative environment where innovation meets impact!
Pay: $114,880.73 - $138,350.98 per year
Benefits:
- 401(k)
- Dental insurance
- Health insurance
- Paid time off
- Tuition reimbursement
- Vision insurance
Work Location: In person