Workflow Element Store

  1. Feature Selection
  2. Domain-Specific Feature Engineering
  3. AutoEDA libraries
  4. Data Scaling and Normalization
  5. Time-Based Features
  6. Binning / Discretization
  7. Dealing with Outliers
  8. Data Partitioning - Train, Validation, & Test
  9. Handling Noisy Data
  10. Textual Feature Extraction
  11. Handling Time-Series Data
  12. Augmentation
  13. Handling Imbalanced Classes
  14. Handling Missing Data
  15. Interaction Features
  16. Annotation
  17. Dimensionality Reduction
  18. Handling Categorical Data
  19. Polynomial Features
  20. Data Transformations
  21. Feature Extraction from Images
  22. Auto-Preprocessing libraries
  1. Recommendation Engine
  2. External Validation
  3. Regularization
  4. Model Interpretability
  5. Natural Language Processing
  6. Association Rules
  7. Transfer Learning
  8. Batch Normalization
  9. Weight Initialization
  10. Batch Size Selection
  11. Evaluation Metrics
  12. Word Embeddings
  13. Ensemble Techniques
  14. AutoML
  15. Data Augmentation
  16. Performance Visualization
  17. Model Comparison
  18. Cross-Validation
  19. Clustering
  20. Binary Classification Techniques
  21. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  22. Multiclass Classification Techniques
  23. Learning Rate Scheduling
  24. Regular Monitoring and Logging
  25. Transfer Learning
  26. Hyperparameter Tuning
  27. Regularization Techniques
  28. Reinforcement Learning
  29. Network Analytics/ GeoSpatial Analytics
  30. Early Stopping
  31. Forecasting Techniques
  32. Cross-Validation
  33. Blackbox - Neural Network Models
  34. Regression Analysis
  1. Datawarehouse
  2. Github
  3. Apache Airflow
  4. code repository
  5. model registry
  6. Data Preprocessing pipeline models
  7. Kafka Brokers
  8. Databases
  9. Evidently.ai
  10. Github Actions
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)