Workflow Element Store

  1. Interaction Features
  2. Handling Time-Series Data
  3. Time-Based Features
  4. Handling Missing Data
  5. Textual Feature Extraction
  6. Data Transformations
  7. Feature Extraction from Images
  8. Auto-Preprocessing libraries
  9. AutoEDA libraries
  10. Handling Imbalanced Classes
  11. Handling Noisy Data
  12. Dimensionality Reduction
  13. Augmentation
  14. Data Scaling and Normalization
  15. Data Partitioning - Train, Validation, & Test
  16. Annotation
  17. Handling Categorical Data
  18. Polynomial Features
  19. Feature Selection
  20. Dealing with Outliers
  21. Domain-Specific Feature Engineering
  22. Binning / Discretization
  1. Data Augmentation
  2. Regression Analysis
  3. AutoML
  4. Transfer Learning
  5. Natural Language Processing
  6. Evaluation Metrics
  7. Batch Size Selection
  8. Binary Classification Techniques
  9. External Validation
  10. Transfer Learning
  11. Regularization Techniques
  12. Clustering
  13. Blackbox - Neural Network Models
  14. Model Interpretability
  15. Model Comparison
  16. Network Analytics/ GeoSpatial Analytics
  17. Multiclass Classification Techniques
  18. Regularization
  19. Cross-Validation
  20. Weight Initialization
  21. Forecasting Techniques
  22. Cross-Validation
  23. Batch Normalization
  24. Early Stopping
  25. Hyperparameter Tuning
  26. Reinforcement Learning
  27. Association Rules
  28. Regular Monitoring and Logging
  29. Word Embeddings
  30. Recommendation Engine
  31. Learning Rate Scheduling
  32. Ensemble Techniques
  33. Performance Visualization
  34. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  1. Data Preprocessing pipeline models
  2. model registry
  3. Github Actions
  4. Datawarehouse
  5. Apache Airflow
  6. code repository
  7. Databases
  8. Kafka Brokers
  9. Evidently.ai
  10. Github
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)