Job Description
Citi is looking for a Senior Big Data Engineer to design, build, and optimize large-scale data pipelines and distributed data systems that power critical business intelligence across the organisation. Based in Pune and operating in a hybrid model, you will work within a high-performing engineering team where your expertise in PySpark, the Hadoop ecosystem, and streaming data platforms will directly shape the reliability and performance of Citi's data infrastructure.
**Responsibilities**
+ Build and maintain scalable data pipelines using PySpark within a Big Data environment to process and transform large volumes of structured and unstructured data.
+ Design and develop solutions across the Hadoop ecosystem — including Hive, HDFS, Sqoop, Spark, Impala, and Scala — to enable efficient data ingestion, processing, and storage.
+ Develop and manage real-time and batch data workflows using streaming data platforms, ensuring high availability and low-latency data deliv...
**Responsibilities**
+ Build and maintain scalable data pipelines using PySpark within a Big Data environment to process and transform large volumes of structured and unstructured data.
+ Design and develop solutions across the Hadoop ecosystem — including Hive, HDFS, Sqoop, Spark, Impala, and Scala — to enable efficient data ingestion, processing, and storage.
+ Develop and manage real-time and batch data workflows using streaming data platforms, ensuring high availability and low-latency data deliv...
Ready to Apply?
Take the next step in your AI career. Submit your application to Citigroup today.
Submit Application