Model di Jupyter Notebook ≠ model production. Pelajari pipeline ML end-to-end: data versioning, model registry, deployment, monitoring, dan continuous training.
MLOps = praktik DevOps untuk machine learning. Mencakup automation, version control, monitoring, dan governance dari ML model di production.
Paper Google 2015: ML code hanya ~5% dari ML system. 95% sisanya: data pipeline, feature engineering, deployment, monitoring, infrastructure. MLOps mengelola 95% ini.
DVC, LakeFS, Pachyderm — git untuk dataset.
MLflow, Weights & Biases, Neptune — log run, params, metrics.
Airflow, Kubeflow, Prefect, Dagster.
MLflow Registry, SageMaker Registry — versi model.
TorchServe, Triton, BentoML, FastAPI, KServe.
Evidently, Arize, WhyLabs — drift detection.
| Pattern | Karakter | Cocok untuk |
|---|---|---|
| Batch Prediction | Predict offline, simpan ke DB | Recommendations harian |
| Real-time API | REST/gRPC endpoint | Fraud detection, search |
| Streaming | Predict pada event stream | Anomaly detection IoT |
| Edge / On-device | Model di smartphone/IoT | Privacy, low-latency |
| Embedded | Tertanam di app/library | Speech recognition |
1. Data Drift: distribusi input feature berubah dari training data. Mis. usia user shift dari 25-35 ke 18-25.
2. Concept Drift: hubungan input-output berubah. Mis. preferensi konsumen pasca-pandemic.
3. Model Performance Drift: accuracy turun seiring waktu. Cek dengan ground truth (jika tersedia).
CI: unit test, integration test, data validationCD: deploy model ke staging → productionCT: Continuous Training — auto retrain saat ada drift, data baru, atau scheduled cadence
Gojek punya internal platform ML (codename: "Marvel") yang melayani 200+ model di production: dynamic pricing, ETA prediction, fraud detection, recommendation, dst. Investasi besar di MLOps memungkinkan tim data scientist fokus pada modeling, bukan engineering.
Pelajaran: tanpa platform MLOps, scaling ML di company besar = chaos. Setiap tim reinvent pipeline = waste resource & inkonsisten.
/predict yang load model & return prediction.