GCP Data Engineer with a client in Richardson Texas.
Mid level : $55/hr. c2c
Description:
GCP Data Engineer (w/ Machine Learning) - Good programming skills in Python, PySpark and knowledge of Client-Ops, building Client models using open source libraries.
- A mix of data engineering and data science skills - the primary focus been on data engineering.
Must Haves - 3+ years of knowledge in Big Data environment:
- GCP Big Query
- Hadoop Architecture
- HDFS commands
- Designing and optimizing queries to build data pipelines.
- 2+ years of experience:
- Building data marts and data models to support Data Science and other internal customers.
- Integrate data from a variety of sources and ensure adherence to data quality and accessibility standards.
- Understanding of Client-ops, Client modelling, track record to wrangle data for exploratory data analysis.
- Strong programming skills in:
- SQL
- Python
- Spark
- Scala / Java ( Good to have but not mandatory)
- Experience:
- Develop, build, and manage large-scale data structures and pipelines,
- Efficient Extract/Load/Transform (ETL) workflows to address complex problems and support business applications.
- Extensive experience with databases as well interpretation and manipulation of related data.
- Excellent verbal and written communication skills. Demonstrated ability to handle multiple assignments.
- GCP Certification - Preferred