Job Description
We are looking for an experienced Data Architect with strong expertise in modern cloud data platforms, AI-ready architectures, and large-scale data engineering solutions. This role focuses on designing scalable data ecosystems, integrating AI/GenAI capabilities, and enabling enterprise-wide analytics and intelligent applications.
The ideal candidate should have deep experience in data architecture, lakehouse platforms, cloud technologies, ETL/ELT pipelines, and AI-integrated data systems.
Responsibilities
Architecture & Strategy
- Define and evolve enterprise data architecture across digital platforms
- Translate business requirements into scalable and resilient technical solutions
- Maintain architectural roadmaps, standards, and governance frameworks
- Collaborate with architects, analysts, engineers, and analytics teams
Data Engineering & Platform Development
- Design and implement modern ELT/ETL pipelines using Python, SQL, Spark, Scala, and Databricks
- Build cloud-native data platforms on Azure, AWS, or GCP
- Develop reusable AI accelerators, templates, and frameworks
AI & GenAI Integration
- Design AI-ready data models for:
- RAG (Retrieval-Augmented Generation)
- Agent orchestration
- Multimodal AI pipelines
- Implement vector databases, semantic retrieval, chunking logic, and hybrid search systems
- Support AI/ML and GenAI integration into enterprise platforms
Operations & Governance
- Ensure scalability, security, observability, and cost optimization
- Drive PoC-to-production transitions
- Apply DataOps and DevOps best practices with CI/CD automation
Requirements
- 12+ years of experience in data architecture and engineering
- Strong expertise in cloud data platforms (Azure / AWS / GCP)
- Experience with lakehouse architectures (Databricks, Microsoft Fabric)
- Advanced proficiency in Python, SQL, Spark, and Scala
- Experience with ETL/ELT tools like Informatica, Talend, or Fivetran
- Strong understanding of data modeling (ERD, Star Schema, Snowflake Schema)
- Knowledge of Docker and Kubernetes
- Experience integrating AI/ML and GenAI systems into enterprise platforms
- Familiarity with REST APIs, GraphQL, and event-driven systems
Skills
- Data Architecture
- Data Lakehouse Architecture
- Databricks / Microsoft Fabric
- Python / SQL / Spark / Scala
- ETL / ELT Pipelines
- AI & Generative AI Integration
- RAG Architecture
- Vector Databases & Semantic Search
- Data Modeling (Star/Snowflake Schema)
- Azure / AWS / GCP
- Docker / Kubernetes
- CI/CD & DataOps
- REST APIs / GraphQL