Job Description
We are seeking a highly skilled Big Data Engineer with strong experience in Apache Spark, Hadoop ecosystem, and Apache Ozone. The ideal candidate will design, develop, and optimize large-scale data processing systems, ensuring high performance, scalability, and reliability for enterprise-level applications.
Key Responsibilities:
• Design and implement distributed data processing solutions using Apache Spark, Hadoop, Flink
• Develop and maintain Spark applications for data transformation, aggregation, and ETL processes using Scala, Java, or Python
• Utilize Apache Ozone for storing large-scale datasets, ensuring efficient data access and management in a distributed environment
• Manage and optimize HDFS and Apache Ozone, Kafka for scalable and fault-tolerant storage.
• Develop ETL pipelines for batch and real-time data ingestion and transformation.
• Implement and ensure data validation, data security, integrity, and compliance across ...
Ready to Apply?
Take the next step in your AI career. Submit your application to Mphasis today.
Submit Application