Job Description

Job Description
An employer is seeking a Data Engineer II (ML Training & Multi-Source Integration) to join a large healthcare client supporting the AI Insight & Next Best Action platform. The project focuses on building and scaling the data layer that powers machine learning models, including integrating multiple data sources, developing feature pipelines, and enabling high-quality, production-ready ML datasets.
Responsibilities will include:

Build and maintain Feature Store pipelines that ingest and process behavioral, clinical, engagement, and Rx data signals
Design and develop ML training datasets, including batch and real-time feature pipelines, dataset versioning, and training/evaluation splits
Integrate and normalize multi-source data such as Kafka event streams, Adobe Analytics data, and healthcare datasets
Develop and optimize large-scale data processing jobs using Apache Spark (Dataproc) for feature engineering and model input preparation
Monitor and...

Ready to Apply?

Take the next step in your AI career. Submit your application to Insight Global today.

Submit Application