Job Description

Job Description

Senior Software Engineer – Data Engineer: Python/Scala Developer with Spark and Snowflake )

Responsibilities

  • Hands‑on experience implementing an AWS Big Data Lake using EMR and Spark.
  • Working experience with Spark, Hive, message queue or Pub/Sub, and streaming technologies (3+ years).
  • Experience developing data pipelines using languages such as Python, Scala, SQL and open‑source frameworks for data ingest, processing, and analytics.
  • Leveraging open‑source big data processing frameworks such as Apache Spark, Hadoop, and streaming technologies such as Kafka.
  • Hands‑on experience with newer technologies relevant to the data space such as Spark, Airflow, Apache Druid, Snowflake (or any other OLAP database).
  • Developing and deploying data pipelines and real‑time data streams within a cloud‑native infrastructure, preferably AWS.
  • Using CI/CD pipelines (GitLab).
  • Implemen...

Ready to Apply?

Take the next step in your AI career. Submit your application to Capgemini today.

Submit Application