Job Description
Role Overview
We’re looking for a Data Engineer who’s passionate about building scalable, high-performance data solutions that empower analytics and business decisions. In this role, you’ll design, develop, and optimize data pipelines using PySpark, Databricks, and Delta Lake, ensuring data integrity and reliability across large distributed systems.
Key Responsibilities
- Build and maintain robust, high-performance data pipelines using PySpark, Databricks, and Delta Lake.
- Develop a strong understanding of data models, lineage, and business logic — not just ETL flow.
- Ensure data quality, consistency, and accuracy through validation, profiling, and automated testing.
- Debug, optimize, and fix issues across large-scale distributed systems
Ready to Apply?
Take the next step in your AI career. Submit your application to Roca Alliances S.A today.
Submit Application