Job Description
The System Analyst – Data Scientist Lead is responsible for leading AI and Data Science initiatives, developing machine learning solutions, and managing end-to-end data science projects. The role involves building predictive models, NLP applications, forecasting systems, and deploying scalable ML solutions that drive business value. The candidate will lead teams, collaborate with stakeholders, and ensure successful delivery of AI-powered products.
Responsibilities
Leadership & Project Management
- Lead data science projects from concept to deployment.
- Mentor and guide data scientists and engineers.
- Coordinate with business and technical stakeholders.
- Drive AI and machine learning strategy within the team.
Machine Learning Development
- Build end-to-end ML pipelines.
- Perform data preprocessing and feature engineering.
- Train, validate, and deploy machine learning models.
- Develop supervised, unsupervised, and deep learning solutions.
Natural Language Processing (NLP)
- Develop NLP applications using:
- LDA (Latent Dirichlet Allocation)
- Embeddings
- Retrieval-Augmented Generation (RAG)
- Build intelligent text analysis and AI-powered solutions.
Predictive Analytics & Forecasting
- Create time-series forecasting models.
- Perform statistical analysis and predictive modeling.
- Generate actionable business insights from data.
MLOps & Deployment
- Deploy machine learning models into production.
- Use MLflow, Kubeflow, and MLOps practices.
- Implement CI/CD pipelines for machine learning workflows.
- Monitor model performance and scalability.
Collaboration
- Work closely with Data Engineers, Developers, and Business Teams.
- Explain technical concepts to non-technical stakeholders.
- Support AI innovation and research initiatives.
Required Skills
Programming
- Python
Machine Learning
- Supervised Learning
- Unsupervised Learning
- Deep Learning
- Predictive Analytics
NLP (Natural Language Processing)
- LDA
- Embeddings
- RAG (Retrieval-Augmented Generation)
- Text Analytics
Frameworks & Libraries
- TensorFlow
- PyTorch
- Scikit-learn
- Django
Cloud & Platforms
- Databricks
- Azure Machine Learning
- Azure Cloud
MLOps
- MLflow
- Kubeflow
- CI/CD for ML
- Model Monitoring
Analytics & Statistics
- Statistical Modeling
- Forecasting
- Time Series Analysis
- Data Analysis
Data Science Workflow
- Data Engineering
- Data Preprocessing
- Feature Engineering
- Model Deployment
Experience Required
- 5+ years in Data Science.
- 2+ years in Team Leadership or Management.
- Experience delivering production-grade machine learning solutions.
Key Technologies
Python • NLP • RAG • LDA • Embeddings • Machine Learning • Deep Learning • TensorFlow • PyTorch • Scikit-learn • Databricks • Azure ML • Azure • MLflow • Kubeflow • Django • Forecasting • Statistical Modeling • Predictive Analytics • MLOps