Position: Senior Data Scientist
Core Responsibilities:
- Design and deploy scalable machine learning, data mining, and graph-based algorithms across large, complex datasets
- Partner with stakeholders to develop tailored solutions that extract actionable insights from disparate data sources
- Establish and maintain robust data quality control processes across multiple systems
Required Qualifications:
- Must have hands-on experience working with healthcare administrative claims data
- Demonstrated ability to analyze social media, graph, and time series datasets
- Proven experience building scalable algorithms for large datasets (1TB)
- 5+ years of programming in SQL, Cypher, Python, Python-Polars, and Python-SciKitLearn
- Strong understanding of clustering, feature extraction, embeddings, and graph-embedding techniques (eg, GraphSage, FastRP)
- Proficiency in working with relational, graph, and vector databases
- Cloud experience with deploying and scaling machine learning solutions
- Solid background in Linux-based software development
- BS or MS in Computer Science, Physics, or Engineering (Mechanical, Electrical, or Chemical)
Preferred Qualifications:
- PhD in Computer Science, Physics, or Engineering
- Experience with Rust or Mojo programming languages