Job Description

Key Responsibilities:

  • Lead the design, development, and optimization of scalable and secure data pipelines using AWS services such as Glue, S3, Lambda, EMR , and Databricks Notebooks , Jobs, and Workflows.
  • Oversee the development and maintenance of data lakes on AWS Databricks , ensuring performance and scalability.
  • Build and manage robust ETL/ELT workflows using Python and SQL , handling both structured and semi-structured data.
  • Implement distributed data processing solutions using Apache Spark PySpark for large-scale data transformation.
  • Collaborate with cross-functional teams including data scientists, analysts, and product managers to ensure data is accurate, accessible, and well-structured.
  • Enforce best practices for data quality, gov...

Ready to Apply?

Take the next step in your AI career. Submit your application to Blend360 India today.

Submit Application