Job Title: Data Engineer
Reports to: Sr. Manager, Software Engineering in Technology
Job Type: Full-time
Location: Newport Beach, CA
Salary Range: $120K-$140K
About Our Organization: RIS Rx (pronounced "RISE") is a healthcare technology organization with a strong imprint in the patient access and affordability space. RIS Rx has quickly become an industry leader in delivering impactful solutions to stakeholders across the healthcare continuum. RIS Rx is proud to offer an immersive service portfolio to help address common access barriers. We don't believe in a "one size fits all" approach to our service offerings. Our philosophy is to bring forward innovation, value and service to everything that we do. This approach has allowed us to serve countless patients to help produce better treatment outcomes and improved quality of life. Here at RIS Rx, we invite our partners and colleagues to "Rise Up" with us to bring accessible healthcare and solutions for all.
Summary: We are seeking a highly skilled Data Engineer to join our team and play a crucial role in building and maintaining scalable, high-performance data pipelines and data lake architectures. You will be working with large data sets across multiple products, ensuring real-time data processing, data quality, and governance that support machine learning and model training efforts. The ideal candidate has deep expertise in SQL, PostgreSQL, AWS, data lakes, and big data tools like Spark, along with strong proficiency in Python and Golang.
Duties and Responsibilities include but are/ not limited to the following:
- Design, build, and maintain scalable ETL pipelines for processing structured and unstructured data.
- Develop and optimize data lakes using AWS S3, AWS Lake Formation, and Glue to enable efficient data storage and retrieval.
- Develop real-time data processing solutions to handle streaming data efficiently.
- Automate and monitor data workflows to ensure system reliability and performance.
- Optimize and manage PostgreSQL databases for performance and scalability.
- Utilize AWS services (e.g., S3, Glue, Lambda, Kinesis) for cloud-based data processing.
- Work with Apache Spark or similar big data processing frameworks.
- Ensure data quality, governance, and security best practices are implemented for data lakes and pipelines.
- Collaborate with Engineering team to support model training and deployment.
- Write efficient, well-structured SQL queries for data analysis and transformation.
- Develop and maintain data infrastructure using Python, Golang, Terraform, and Pulumi.
Qualifications/Skills:
To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
- 5+ years of experience in data engineering or a similar role.
- Strong expertise in SQL and database design, specifically in Postgres.
- Hands-on experience with AWS big data tools (e.g., S3, Glue, Lake Formation, Lambda, Kinesis).
- Proficiency in big data processing frameworks such as Apache Spark.
- Strong programming skills in Python/Golang and Terraform/Pulumi.
- Experience with real-time data processing using tools like Kafka or Kinesis.
- Implemented data governance, data quality, and security best practices.
- Experience working with machine learning pipelines and supporting model training.
- Experience with Scrum and Agile processes.
- Preferred experience with Power BI.
- Effective communication and problem-solving skills and the ability to work in a fast-paced environment.