Workflow Element Store

  1. Binning / Discretization
  2. Handling Imbalanced Classes
  3. Dimensionality Reduction
  4. Annotation
  5. Interaction Features
  6. Domain-Specific Feature Engineering
  7. Feature Extraction from Images
  8. Handling Missing Data
  9. Data Scaling and Normalization
  10. Auto-Preprocessing libraries
  11. Data Partitioning - Train, Validation, & Test
  12. Time-Based Features
  13. Data Transformations
  14. Textual Feature Extraction
  15. Dealing with Outliers
  16. Handling Noisy Data
  17. Feature Selection
  18. Augmentation
  19. Polynomial Features
  20. Handling Categorical Data
  21. AutoEDA libraries
  22. Handling Time-Series Data
  1. Clustering
  2. Forecasting Techniques
  3. Regular Monitoring and Logging
  4. Model Interpretability
  5. Natural Language Processing
  6. Regularization Techniques
  7. Transfer Learning
  8. Multiclass Classification Techniques
  9. Cross-Validation
  10. Word Embeddings
  11. Evaluation Metrics
  12. Batch Normalization
  13. Binary Classification Techniques
  14. Regularization
  15. AutoML
  16. Model Comparison
  17. Ensemble Techniques
  18. Data Augmentation
  19. Recommendation Engine
  20. Regression Analysis
  21. Batch Size Selection
  22. Early Stopping
  23. Learning Rate Scheduling
  24. Association Rules
  25. External Validation
  26. Weight Initialization
  27. Hyperparameter Tuning
  28. Blackbox - Neural Network Models
  29. Cross-Validation
  30. Network Analytics/ GeoSpatial Analytics
  31. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  32. Reinforcement Learning
  33. Transfer Learning
  34. Performance Visualization
  1. Data Preprocessing pipeline models
  2. Kafka Brokers
  3. code repository
  4. Github Actions
  5. Github
  6. Databases
  7. model registry
  8. Evidently.ai
  9. Apache Airflow
  10. Datawarehouse
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)