Job Description
Role Introduction
Designs and owns the overall data platform architecture across a hybrid (public cloud + on-prem) topology. Defines how data flows from source systems through the lakehouse to consumption layers, and ensures the platform is governable, scalable, and cost-efficient at enterprise scale.
Features
- Onsite
Requirements
- Define the end-to-end lakehouse architecture using Apache Iceberg, Parquet, and Trino as the core open table format and query layer.
- Architect hybrid topology decisions, what runs on public cloud vs. on-prem, and how data and compute move between them.
- Set standards for table design, partitioning strategy, schema evolution, and metadata/catalog governance across Iceberg tables.
- Own technology selection and integration patterns for Informatica IDMC within the broader platform.
- Define data governance frameworks: lineage, cataloging, access control, and...
Ready to Apply?
Take the next step in your AI career. Submit your application to Confidential today.
Submit Application