Job Description

Job Summary:

We are looking for a detail-oriented AI Data Engineer to join our team. Your primary mission will be to build the foundational datasets used to train our next generation of models. Unlike traditional data roles, this position requires a blend of creative engineering and rigorous scientific evaluation.

You will spend your time navigating the vast landscape of public data and leveraging state-of-the-art Generative AI techniques to fill gaps where real-world data is scarce, biased, or restricted. You will be responsible for the entire data-to-training pipeline, ensuring that every byte of information fed into our models is clean, diverse, and ethically sourced.

Core Responsibilities

  • Design and maintain scalable pipelines to collect data from public sources (web scraping, public APIs, and open-source repositories).
  • Utilize LLMs, GANs, or heuristic-based simulations to generate high-fidelity synthetic data to augment training sets. <...

Ready to Apply?

Take the next step in your AI career. Submit your application to Sophic Automation Sdn Bhd today.

Submit Application