Workflow Element Store

  1. Augmentation
  2. Feature Selection
  3. Polynomial Features
  4. Interaction Features
  5. Feature Extraction from Images
  6. Auto-Preprocessing libraries
  7. Handling Missing Data
  8. Handling Categorical Data
  9. Data Transformations
  10. Annotation
  11. Handling Time-Series Data
  12. Data Partitioning - Train, Validation, & Test
  13. Time-Based Features
  14. Dealing with Outliers
  15. Handling Imbalanced Classes
  16. Domain-Specific Feature Engineering
  17. Binning / Discretization
  18. Textual Feature Extraction
  19. Data Scaling and Normalization
  20. Dimensionality Reduction
  21. AutoEDA libraries
  22. Handling Noisy Data
  1. Transfer Learning
  2. Blackbox - Neural Network Models
  3. Regression Analysis
  4. Data Augmentation
  5. Multiclass Classification Techniques
  6. Early Stopping
  7. Model Interpretability
  8. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  9. Ensemble Techniques
  10. Forecasting Techniques
  11. Performance Visualization
  12. Hyperparameter Tuning
  13. Learning Rate Scheduling
  14. Natural Language Processing
  15. Cross-Validation
  16. AutoML
  17. Transfer Learning
  18. Recommendation Engine
  19. Batch Size Selection
  20. Network Analytics/ GeoSpatial Analytics
  21. Reinforcement Learning
  22. Word Embeddings
  23. Clustering
  24. Regularization Techniques
  25. External Validation
  26. Evaluation Metrics
  27. Association Rules
  28. Cross-Validation
  29. Batch Normalization
  30. Regular Monitoring and Logging
  31. Model Comparison
  32. Binary Classification Techniques
  33. Weight Initialization
  34. Regularization
  1. Datawarehouse
  2. Databases
  3. Data Preprocessing pipeline models
  4. Kafka Brokers
  5. code repository
  6. Evidently.ai
  7. Apache Airflow
  8. Github Actions
  9. model registry
  10. Github
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)