Workflow Element Store

  1. Binning / Discretization
  2. Annotation
  3. Handling Imbalanced Classes
  4. Polynomial Features
  5. Handling Categorical Data
  6. Augmentation
  7. Handling Missing Data
  8. Dimensionality Reduction
  9. Handling Noisy Data
  10. Dealing with Outliers
  11. Domain-Specific Feature Engineering
  12. Data Transformations
  13. Time-Based Features
  14. Data Scaling and Normalization
  15. Data Partitioning - Train, Validation, & Test
  16. Interaction Features
  17. Auto-Preprocessing libraries
  18. Feature Extraction from Images
  19. Textual Feature Extraction
  20. Feature Selection
  21. Handling Time-Series Data
  22. AutoEDA libraries
  1. Early Stopping
  2. Model Comparison
  3. Transfer Learning
  4. Batch Size Selection
  5. Regression Analysis
  6. Ensemble Techniques
  7. Natural Language Processing
  8. Data Augmentation
  9. Evaluation Metrics
  10. Batch Normalization
  11. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  12. Transfer Learning
  13. Cross-Validation
  14. Weight Initialization
  15. Word Embeddings
  16. Recommendation Engine
  17. Multiclass Classification Techniques
  18. Regularization
  19. AutoML
  20. External Validation
  21. Performance Visualization
  22. Network Analytics/ GeoSpatial Analytics
  23. Reinforcement Learning
  24. Association Rules
  25. Cross-Validation
  26. Forecasting Techniques
  27. Binary Classification Techniques
  28. Regularization Techniques
  29. Clustering
  30. Model Interpretability
  31. Regular Monitoring and Logging
  32. Blackbox - Neural Network Models
  33. Learning Rate Scheduling
  34. Hyperparameter Tuning
  1. Evidently.ai
  2. Apache Airflow
  3. Datawarehouse
  4. Data Preprocessing pipeline models
  5. Github Actions
  6. Github
  7. Kafka Brokers
  8. Databases
  9. code repository
  10. model registry
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)