Data Engineer

Boston, Massachusetts

NextGen Invent Corporation
Apply for this Job
About the job Data Engineer (Boston, USA)

Department: Data Engineering

Experience: 5+ Years

Job Location: Hybrid /Boston, MA

No. of Position: Multiple

Qualifications: Undergrad orHigher

Work Timings: 8:00 AM - 5:00 PM EST

Job Description:

Weare seeking an experienced Data Engineer with 5+ years of hands-on experience in building scalable datapipelines and data ingestion frameworks. The ideal candidate must have a strongcommand of Python, excellentskills in data pipeline creation, and a deep understanding of healthcaredata systems. Experience with Databricks and/or Snowflake ishighly desirable.

Key Responsibilities:
  • Design, develop, and maintain robust data pipelines to ingest, transform, and store data from multiple sources.
  • Build efficient and scalable ETL processes using Python, PySpark, and SQL.
  • Implement and optimize data workflows on AWS, Azure, or hybrid cloud environments.
  • Leverage Databricks, Snowflake, and/or Azure Data Factory for advanced data engineering solutions.
  • Collaborate with data scientists, analysts, and other engineers to ensure seamless data access and integration.
  • Ensure healthcare data security, compliance, and governance best practices are embedded into solutions.
  • Identify bottlenecks in data ingestion and recommend optimizations.
  • Develop automated solutions for data validation, monitoring, and reporting.
  • Stay current with evolving data engineering practices and tools, particularly in the healthcare sector.
Required Skills and Experience:
  • Bachelors degree in Computer Science, Engineering, or a related technical field.
  • Minimum of 5 years of experience in data engineering roles, with at least 2 years working with healthcare data (mandatory).
  • Strong proficiency in Python and SQL for data engineering tasks.
  • Solid experience with data ingestion, ETL/ELT pipeline creation, and large-scale data integration.
  • Experience with web scraping and data gathering from third-party sources is a plus
  • Familiarity with healthcare interoperability standards like HL7, FHIR, etc. is a plus
  • Exposure to automation tools like UI Path or Power Automate is a plus.
  • Familiarity with cloud platforms such as AWS or Azure.
  • Hands-on experience with Databricks and/or Snowflake (preferred).
  • Experience with PySpark for large data processing.
  • Deep understanding of HIPAA compliance and healthcare-specific data regulations.
  • Excellent analytical, troubleshooting, and communication skills.
  • This position does not offer visa sponsorship; applicants must have valid authorization to work in the U.S.
Date Posted: 01 May 2025
Apply for this Job