[Remote] Senior Data Engineer - Forward Deployed
Note: The job is a remote job and is open to candidates in USA. Andela is a global talent marketplace that connects customers with remote technical talent. In the Senior Data Engineer role, you will work with an enterprise-scale organization in the automotive industry to enhance their data infrastructure and analytics capabilities, focusing on delivering actionable insights and ensuring data reliability.
Responsibilities
- Build and maintain Snowflake data pipelines for Dealer 360 and Aftermarket workstreams respectively
- Design and implement the dealer and aftermarket feature stores (Layer 1–2)
- Build ingestion pipelines for all external data sources (JD Power, PIN, S&P, Vehicle Registration, competitive scraping + 2 TBC)
- Write and maintain dbt models for data transformation, cleaning, and normalisation
- Enforce schema validation, data quality checks, and freshness SLAs across all feeds
- Collaborate with the Data Architect to implement the unified data model
- Produce documented data lineage for every pipeline before any model is trained against it
Skills
- 8+ years in data engineering on cloud platforms
- Snowflake — data modelling, query optimisation, staging environments
- Python — pandas, PySpark, data pipeline scripting
- Experience building feature stores for ML consumption
- Strong understanding of schema design and dimensional modelling
- Experience in automotive, retail, or dealer network data
- Familiarity with CRM data structures (for Aftermarket hire)
- Azure — Data Factory, Blob Storage, or Synapse
- Apache Airflow or similar orchestration tooling
- Azure DevOps for pipeline CI/CD
Company Overview
Company H1B Sponsorship