You are viewing a preview of this job. Log in or register to view more details about this job.

Data Engineer

About the role

We're looking for a Data Engineer with 3–5 years of hands-on experience to take ownership of complex data infrastructure across client engagements. You'll go beyond building pipelines — you'll architect them, optimize them, and make key design decisions that shape how clients store, move, and use their data.

This role sits at the intersection of deep technical execution and client delivery. You'll be expected to work independently, mentor junior engineers, and lead the technical conversation with clients from requirements through to production.

What you'll do

Design, build, and own end-to-end data pipelines across platforms including Databricks, Palantir Foundry, and cloud-native services
Architect data models and schemas optimized for performance, scalability, and maintainability — including complex ERD design across multi-source environments
Lead data integration efforts connecting disparate databases, APIs, and enterprise applications into unified data environments
Implement and enforce data quality frameworks, validation rules, and pipeline monitoring to ensure reliability in production
Develop and maintain orchestration workflows using tools such as Apache Airflow, dbt, or equivalent
Build and deploy data-driven applications and dashboards on top of the infrastructure you design
Lead technical discovery sessions with clients, translating complex business requirements into scalable data solutions
Review and provide feedback on junior engineers' code, models, and pipeline designs
Identify performance bottlenecks in existing pipelines and drive optimization efforts
Maintain thorough documentation of architecture decisions, data lineage, and pipeline logic

Requirements

3–5 years of experience in data engineering or a closely related role
Advanced proficiency in SQL, including query optimization, indexing strategies, and complex transformations
Strong Python skills applied to data pipeline development, automation, and data processing
Demonstrated experience with enterprise data platforms — Databricks, Palantir Foundry, Snowflake, or similar
Deep understanding of data modeling methodologies including star/snowflake schemas, data vault, and dimensional modeling
Hands-on experience building and maintaining production-grade ETL/ELT pipelines
Familiarity with cloud infrastructure (AWS, Azure, or GCP) as it relates to data architecture and storage
Experience with workflow orchestration tools (Airflow, dbt, Prefect, or similar)
Ability to lead client-facing technical discussions and independently manage deliverables
Strong written and verbal communication skills; comfortable translating technical concepts for non-technical audiences