Senior Data Engineer

May 20, 2026

Job Description

A Senior Data Engineer is responsible for designing, building, and maintaining scalable data systems and pipelines that support analytics, reporting, and business intelligence.

This role focuses on:

  • Data engineering
  • ETL workflows
  • Data warehousing
  • Data lakes
  • Big data processing
  • Data architecture

The engineer ensures that enterprise data is:

  • Reliable
  • Structured
  • Scalable
  • High quality
  • Easily accessible for analytics and reporting

The role also involves collaboration with:

  • Backend teams
  • Frontend teams
  • Data analysts
  • BI teams
  • AI/ML teams

to support modern data-driven applications and platforms.


Responsibilities

Data Pipeline Development

  • Design and build scalable data pipelines
  • Develop ETL and orchestration workflows
  • Handle data ingestion, transformation, and storage
  • Process structured and large-scale datasets

Data Warehousing & Lakehouse

  • Work with modern data platforms and warehouses
  • Implement scalable data lake and lakehouse architectures
  • Ensure proper data modelling and organization

Data Quality & Optimization

  • Ensure data accuracy and consistency
  • Optimize pipeline performance and scalability
  • Troubleshoot data processing issues
  • Improve reliability of data systems

Architecture & Collaboration

  • Participate in technical discussions on data architecture
  • Collaborate with backend/frontend teams
  • Support efficient data access for applications and reporting tools

Documentation & Standards

  • Document:
    • Data flows
    • Transformations
    • Assumptions
    • Architecture decisions
  • Follow best practices in data engineering

Required Skills

Programming Skills

  • Python
  • SQL

Data Engineering Skills

  • Data Processing
  • ETL Pipelines
  • Data Orchestration
  • Data Modelling
  • Data Warehousing

Data Platforms & Warehouses

  • Databricks
  • Azure Synapse Analytics

Big Data & Processing Tools

  • Apache Spark
  • dbt
  • DuckDB
  • Parquet
  • Delta Lake

Data Lake Technologies

  • Amazon S3
  • ADLS (Azure Data Lake Storage)
  • GCS (Google Cloud Storage)

Visualization & Reporting

  • BI Tools
  • Analytics Platforms
  • Reporting Systems

Cloud & DevOps Knowledge

  • CI/CD Practices
  • Cloud Data Platforms
  • AI/ML Workflow Exposure

Preferred Skills

  • AI/ML pipeline exposure
  • Data platform optimization
  • Cross-functional collaboration
  • Modern lakehouse architecture experience