MLOps Course Repository – Complete Learning Guide

Welcome to the MLOps Course Repo, a hands-on, step-by-step repository designed for students and beginners to advanced learners to build a full production-grade MLOps pipeline.

This repository covers real-world tools, secure practices, and best practices used in the industry.

🚀 Course Roadmap

This course takes you from the basics of ML and DevOps to full production-grade MLOps:

Environment Setup – Python, Docker, Kind, MLflow
Data Management – Raw & processed datasets, version control with DVC
Source Code & Pipelines – Modular ML pipelines: data ingestion, preprocessing, features, training, evaluation, prediction
Experiment Tracking – MLflow experiments and reproducibility
Model Serving – REST API with FastAPI, serverless with KServe and Seldon
Testing & CI/CD – Unit tests, integration tests, GitHub Actions
Orchestration – Airflow DAGs & Kubeflow pipelines
Deployment – Kubernetes manifests, autoscaling, secure namespaces & RBAC
GitOps – Continuous deployment using ArgoCD
Monitoring – Infrastructure & application monitoring with Prometheus & Grafana
Model Monitoring – Drift detection with Evidently AI
Feature Store – Production-grade features using Feast
Infrastructure as Code – Terraform for AWS EKS, S3, IAM, and KMS
End-to-End Projects – Resume-ready, fully functional ML pipelines

📂 Folder Structure & Learning Index

Folder	Description	Learning Outcomes
`01-setup`	Environment setup for Python, Docker, Kind, MLflow	Learn to create a local dev environment and containerized Kubernetes cluster
`02-data`	Raw & processed datasets	Data management, preprocessing, and DVC versioning
`03-src`	ML source code	Data ingestion, validation, preprocessing, feature engineering, training, evaluation, prediction
`03-src/01-data`	Data ingestion & validation	Use Great Expectations for data quality checks
`03-src/02-features`	Feature engineering	Integrate with Feast Feature Store
`03-src/03-models`	Training & evaluation	Log metrics & artifacts using MLflow
`03-src/04-pipelines`	Pipeline scripts	Modular pipelines for local, Airflow, and Kubeflow execution
`03-src/05-utils`	Utility scripts	Logging, configuration, security (secrets management, encryption)
`04-notebooks`	EDA & prototyping	Jupyter notebooks for exploring datasets & features
`05-dvc`	DVC pipelines	Version datasets and maintain reproducibility
`06-experiments`	MLflow experiment tracking	Track parameters, metrics, artifacts, and compare models
`07-serving`	Model serving	Build production-ready REST APIs with FastAPI, KServe, and Seldon
`08-tests`	Unit & integration tests	Test data processing, models, pipelines, and APIs
`09-ci-cd`	GitHub Actions workflows	Automate training, testing, and deployment
`10-k8s`	Kubernetes manifests	Deploy models securely with Namespaces, RBAC, Network Policies, HPA autoscaling
`11-kubeflow`	Kubeflow pipelines	Orchestrate ML workflows for production
`12-gitops`	ArgoCD GitOps deployment	Automate deployment using GitOps practices
`13-monitoring`	Prometheus & Grafana	Monitor infrastructure and applications
`14-monitoring-ml`	Model monitoring & drift detection	Detect data/model drift and generate reports using Evidently AI
`15-feature-store`	Feast Feature Store	Manage and serve production-grade features
`16-terraform`	AWS infrastructure as code	Provision EKS cluster, S3, IAM, KMS securely
`17-projects`	End-to-end ML projects	Hands-on projects for practice and portfolio building

⚡ Quick Start Guide

Bash
# Clone repository
git clone https://github.com/username/mlops.git
cd mlops

# Setup Python environment
bash 01-setup/01-python/setup-venv.sh

# Optional: Start MLflow tracking server
bash 01-setup/04-mlflow/start-mlflow.sh

Follow the folders in numerical order for a sequential, hands-on learning experience.

🛠 Tools & Security Features

Layer	Tools / Practices
ML Tracking	MLflow
Data Versioning	DVC
Feature Store	Feast
Orchestration	Airflow, Kubeflow
Model Serving	FastAPI, KServe, Seldon
CI/CD	GitHub Actions, Argo Workflows, ArgoCD
Deployment	Kubernetes: Namespaces, RBAC, Network Policies, HPA
Monitoring	Prometheus, Grafana, Evidently AI
Infrastructure	Terraform: AWS EKS, S3, IAM, KMS
Security	Secrets encryption, RBAC, Network Policies
Testing	Pytest for unit, integration, and API tests
Containerization	Docker with secure images

💡 Learning Tips for Students

01-setup → Ensure your environment is ready.
02-data → Practice data ingestion, preprocessing, and DVC.
03-src → Implement models, pipelines, and experiment logging.
04-notebooks → Visualize data and models.
07-serving → Deploy your first ML model API.
08-tests → Ensure code quality.
09-ci-cd → Automate training & deployment.
10-k8s → Learn secure Kubernetes practices.
11-kubeflow → Build end-to-end production pipelines.
12-gitops → Automate deployments via GitOps.
13-monitoring & 14-monitoring-ml → Monitor infrastructure and models.
15-feature-store → Manage features effectively.
16-terraform → Provision cloud infrastructure.
17-projects → Build real projects for practice & portfolio.

Note: Each folder contains placeholder scripts to be filled in while following the course. Follow the sequence for maximum learning efficiency.