Materi 13 · Applied AI

MLOps & Deployment

Model di Jupyter Notebook ≠ model production. Pelajari pipeline ML end-to-end: data versioning, model registry, deployment, monitoring, dan continuous training.

⏱ 30 Menit🎯 Advanced📚 Module 13/15

1. Apa Itu MLOps?

MLOps = praktik DevOps untuk machine learning. Mencakup automation, version control, monitoring, dan governance dari ML model di production.

"Hidden Technical Debt in ML Systems"

Paper Google 2015: ML code hanya ~5% dari ML system. 95% sisanya: data pipeline, feature engineering, deployment, monitoring, infrastructure. MLOps mengelola 95% ini.

2. ML Pipeline End-to-End

1Data Collection
2Data Validation
3Feature Engineering
4Model Training
5Model Eval
6Deployment
7Monitoring

3. Tooling Stack Modern

📦

Data Versioning

DVC, LakeFS, Pachyderm — git untuk dataset.

🧪

Experiment Tracking

MLflow, Weights & Biases, Neptune — log run, params, metrics.

🏭

Pipeline Orchestration

Airflow, Kubeflow, Prefect, Dagster.

📚

Model Registry

MLflow Registry, SageMaker Registry — versi model.

🚀

Serving

TorchServe, Triton, BentoML, FastAPI, KServe.

📊

Monitoring

Evidently, Arize, WhyLabs — drift detection.

4. Deployment Patterns

PatternKarakterCocok untuk
Batch PredictionPredict offline, simpan ke DBRecommendations harian
Real-time APIREST/gRPC endpointFraud detection, search
StreamingPredict pada event streamAnomaly detection IoT
Edge / On-deviceModel di smartphone/IoTPrivacy, low-latency
EmbeddedTertanam di app/librarySpeech recognition

5. Monitoring di Production

3 Tipe Drift yang Wajib Di-Monitor

1. Data Drift: distribusi input feature berubah dari training data. Mis. usia user shift dari 25-35 ke 18-25.
2. Concept Drift: hubungan input-output berubah. Mis. preferensi konsumen pasca-pandemic.
3. Model Performance Drift: accuracy turun seiring waktu. Cek dengan ground truth (jika tersedia).

6. CI/CD untuk ML

CI/CD/CT Pipeline CI: unit test, integration test, data validation
CD: deploy model ke staging → production
CT: Continuous Training — auto retrain saat ada
drift, data baru, atau scheduled cadence

7. Deployment Strategy

8. Studi Kasus

🌟 Real World

Gojek: Platform ML untuk 200+ Model di Production

Gojek punya internal platform ML (codename: "Marvel") yang melayani 200+ model di production: dynamic pricing, ETA prediction, fraud detection, recommendation, dst. Investasi besar di MLOps memungkinkan tim data scientist fokus pada modeling, bukan engineering.

Pelajaran: tanpa platform MLOps, scaling ML di company besar = chaos. Setiap tim reinvent pipeline = waste resource & inkonsisten.

📝 Tugas

Deploy First Model

  1. Train sederhana scikit-learn model (mis. iris classifier).
  2. Save model dengan joblib/pickle.
  3. Build FastAPI endpoint /predict yang load model & return prediction.
  4. Containerize dengan Docker.
  5. Deploy ke cloud (Hugging Face Spaces, Railway, atau Render — semua punya free tier).
  6. Test endpoint dengan curl atau Postman.
  7. Bonus: tambahkan logging, basic auth, rate limiting.

Rangkuman