Job Description

  • Design, develop, maintain efficient and scalable solutions using PySpark
  • Ensure data quality and integrity by implementing robust testing, validation and cleansing processes
  • Integrate data from various sources, including databases, APIs, external datasets etc.
  • Optimize and tune PySpark jobs for performance and reliability
  • Document data engineering processes, workflows and best practices
  • Strong understanding of databases, data modelling, and ETL tools and processes
  • String programming skills in python and proficiency with PySpark, SQL
  • Experience with relational databases, Spark, AWS, Python skill
  • Excellent communication and collaboration skills

Key Responsibilities:

  • Design and Development: Create, develop, and maintain robust solutions using PySpark to handle large-scale data processing.
  • Data Quality Assurance: Implemen...

Ready to Apply?

Take the next step in your AI career. Submit your application to Vimerse Infotech today.

Submit Application