Data Engineer - Tool Abstraction

Posted 2 Days Ago
Be an Early Applicant
Rockville, MD
115K-155K Annually
Mid level
Information Technology • Consulting
The Role
The Data Engineer will be responsible for designing and maintaining scalable data pipelines, automating ETL processes, collaborating with data science teams to harmonize datasets, and implementing data standardization practices. Responsibilities also include utilizing workflow management systems and containerization technologies to support clinical and research data projects.
Summary Generated by Built In

(ID: 2024-6798)


Axle is a bioscience and information technology company that offers advancements in translational research, biomedical informatics, and data science applications to research centers and healthcare organizations nationally and abroad. With experts in biomedical science, software engineering, and program management, we focus on developing and applying research tools and techniques to empower decision-making and accelerate research discoveries. We work with some of the top research organizations and facilities in the country including multiple institutes at the National Institutes of Health (NIH).

Axle is seeking a Data Engineer - Tool Abstraction to join our vibrant team at the National Institutes of Health (NIH) supporting the National Center for Advancing Translational Sciences (NCATS) located in Rockville, MD.


Benefits We Offer:

  • 100% Medical, Dental & Vision Coverage for Employees
  • Paid Time Off and Paid Holidays
  • 401K match up to 5%
  • Educational Benefits for Career Growth
  • Employee Referral Bonus
  • Flexible Spending Accounts:
    • Healthcare (FSA)
    • Parking Reimbursement Account (PRK)
    • Dependent Care Assistant Program (DCAP)
    • Transportation Reimbursement Account (TRN)

Axle Informatics is a bioinformatics and information technology company that offers innovative computer services, informatics, and enterprise solutions to research centers and healthcare organizations worldwide. Our mission is to empower decision-making and accelerate discovery in translational research. We are looking for a Data Engineer to support clinical and research data projects through pipeline development, data ingestion, harmonization, and containerization.

Key Responsibilities: 

  • Design, build, and maintain scalable and efficient data pipelines for clinical and research datasets.

  • Ensure the automation of data extraction, transformation, and loading (ETL) processes to support downstream analysis.

  • Collaborate with data science teams to ensure data is harmonized and consistent across different systems and workflows.

  • Ingest and harmonize large-scale clinical and research datasets from multiple sources.

  • Implement best practices for data standardization and cleaning to support various data users, including researchers, clinicians, and analysts.

  • Work closely with clinical data teams to ensure that datasets comply with research and healthcare standards.

  • Develop and optimize workflows using languages such as Snakemake, Nextflow, or a similar workflow language, to automate data processing.

  • Support continuous integration and delivery practices within data workflows.

  • Utilize Docker for containerization of workflows and data pipelines to ensure reproducibility and scalability.

  • Collaborate with multidisciplinary teams of data scientists, bioinformaticians, and software engineers to ensure data processes align with project goals.

  • Document pipeline processes and workflows to ensure transparency and reproducibility of data workflows.

Qualifications:

  • Master’s degree in Computer Science, Data Engineering, Bioinformatics, or related fields.

  • Proven experience building data pipelines and automating ETL processes.

  • Strong experience working with clinical data and familiarity with healthcare data standards and Common Data Models (e.g. CDISC, OMOP).

  • Hands-on experience with at least one workflow management system (e.g., Snakemake, Nextflow, or a similar tool).

  • Experience with Docker and containerization practices for deploying reproducible workflows.

  • Excellent scripting and programming skills in languages such as SQL, Python or Bash..

Nice to Have:

  • Experience with cloud-based services (e.g., AWS, GCP, or Azure) for data processing and pipeline scaling.

  • Familiarity with big data frameworks (e.g., Apache Spark or Hadoop).

  • Knowledge of version control systems, such as Git, for workflow management and collaboration.


Disclaimer: The above description is meant to illustrate the general nature of work and level of effort being performed by individuals assigned to this position or job description. This is not restricted as a complete list of all skills, responsibilities, duties, and/or assignments required. Individuals may be required to perform duties outside of their position, job description or responsibilities as needed.

The diversity of Axle’s employees is a tremendous asset. We are firmly committed to providing equal opportunity in all aspects of employment and will not tolerate any illegal discrimination or harassment based on age, race, gender, religion, national origin, disability, marital status, covered veteran status, sexual orientation, status with respect to public assistance, and other characteristics protected under state, federal, or local law and to deter those who aid, abet, or induce discrimination or coerce others to discriminate.

Accessibility: If you need an accommodation as part of the employment process please contact: [email protected]

This role has a market-competitive salary with an anticipated base compensation range listed below. Actual salaries will vary depending on a candidate’s experience, qualifications, skills, and location.

Salary Range

$115,000$155,000 USD

Top Skills

Bash
Python
SQL
The Company
HQ: Rockville, MD
191 Employees
On-site Workplace
Year Founded: 2002

What We Do

Axle Informatics is a bioscience and information technology company that offers advancements in translational research, health informatics, and data science applications to research centers and healthcare organizations around the globe. With experts in biomedical science, software engineering, and program management, we develop and apply research tools and techniques that empower decision-making and accelerate research discoveries. We work with some of the top research organizations and facilities in the country including multiple institutes at the National Institutes of Health (NIH) by offering the responsiveness of a small business coupled with the experience, breadth, and depth of a large organization.

Similar Jobs

PwC Logo PwC

Cloud Data & Analytics Senior Manager (Financial Services-Insurance)

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Hybrid
Baltimore, MD, USA
364000 Employees
130K-256K Annually

Ahold Delhaize USA Logo Ahold Delhaize USA

In Stock Analyst II

AdTech • eCommerce • Food • Marketing Tech • Retail
Hyattsville, MD, USA
10000 Employees

BAE Systems, Inc. Logo BAE Systems, Inc.

SIGINT Cyber Analyst - Level 2

Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
Hybrid
Fort Meade, MD, USA
40000 Employees
112K-191K Annually

BAE Systems, Inc. Logo BAE Systems, Inc.

SIGINT Cyber Analyst - Level 3

Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
Hybrid
Fort Meade, MD, USA
40000 Employees
127K-215K Annually

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
HERE Thumbnail
Software • Logistics • Information Technology
Amsterdam, NL
9000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account