Job Description
McKesson, established in 1833, is a US Fortune 10 global healthcare leader operating across supply chain management, retail pharmacy, healthcare technology, oncology, and specialty care.
McKesson Compile, based in Bangalore, manages one of the most comprehensive healthcare data platforms in the US, covering:
- 2M+ healthcare professionals
- 800K+ healthcare facilities
- Medical & pharmacy claims, Medicare data, and provider affiliations
Compile transforms fragmented healthcare data into actionable intelligence that drives real-world healthcare and life sciences decisions.
🎯 Role Overview
As a Principal Data Platform Engineer, you will be the hands-on technical leader designing and building a modern, scalable, and secure data platform that powers data products across the organization.
This role is ideal for engineers passionate about clean architecture, distributed systems, and healthcare data challenges.
🛠Key Responsibilities
- Architect and lead development of a scalable, reusable data platform
- Design robust ETL / ELT pipelines for healthcare data
- Build high-performance APIs and internal tools using Django
- Orchestrate workflows using Prefect
- Implement distributed computing using Ray or Apache Spark
- Use Databricks for pipeline testing and validation
- Ensure data quality, reliability, and observability (Metaplane or similar)
- Manage data across Postgres, Snowflake, and Snowflake Shares
- Optimize cloud-native solutions in Azure
- Mentor engineers and collaborate across product, data, and platform teams
🧰 Tech Stack
Languages & Frameworks: Python (Django, FastAPI), SQL
Orchestration & Compute: Prefect, Ray, Apache Spark
Data: Postgres, Snowflake, dbt, Snowflake Shares
Cloud: Azure (Blob Storage, Data Factory, Azure Functions)
Testing & CI/CD: Pytest, GitHub Actions, Databricks
Observability: Metaplane or similar tools
Nice to Have:
- Apache Iceberg, Airbyte
- GenAI / LLM concepts (RAG, embeddings, vector stores)
👤 What We’re Looking For
- 15+ years of experience in data engineering or platform architecture
- Strong hands-on experience with ETL frameworks & distributed systems
- Proven expertise in Django-based API development
- Deep understanding of data modeling, warehousing, and pipeline reliability
- Experience with Azure, Snowflake, and Postgres
- Familiarity with data observability tools
- Healthcare or life sciences domain experience is a strong plus
🌱 Work Culture
- High-ownership, collaborative engineering environment
- Lean, fast-moving team solving complex domain problems
- Backed by McKesson, one of the world’s largest healthcare companies
- Careers with purpose, growth, and impact