Workflow Element Store

  1. Data Partitioning - Train, Validation, & Test
  2. Feature Extraction from Images
  3. Augmentation
  4. AutoEDA libraries
  5. Handling Noisy Data
  6. Auto-Preprocessing libraries
  7. Polynomial Features
  8. Dealing with Outliers
  9. Time-Based Features
  10. Interaction Features
  11. Handling Imbalanced Classes
  12. Handling Missing Data
  13. Annotation
  14. Binning / Discretization
  15. Handling Time-Series Data
  16. Data Transformations
  17. Data Scaling and Normalization
  18. Dimensionality Reduction
  19. Domain-Specific Feature Engineering
  20. Handling Categorical Data
  21. Feature Selection
  22. Textual Feature Extraction
  1. Natural Language Processing
  2. Recommendation Engine
  3. Transfer Learning
  4. Early Stopping
  5. Binary Classification Techniques
  6. Performance Visualization
  7. Cross-Validation
  8. Cross-Validation
  9. Learning Rate Scheduling
  10. Reinforcement Learning
  11. Hyperparameter Tuning
  12. Batch Size Selection
  13. Transfer Learning
  14. Word Embeddings
  15. AutoML
  16. Evaluation Metrics
  17. Regularization Techniques
  18. Data Augmentation
  19. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  20. Forecasting Techniques
  21. Model Interpretability
  22. Weight Initialization
  23. Regression Analysis
  24. Clustering
  25. Batch Normalization
  26. External Validation
  27. Network Analytics/ GeoSpatial Analytics
  28. Association Rules
  29. Blackbox - Neural Network Models
  30. Regular Monitoring and Logging
  31. Multiclass Classification Techniques
  32. Ensemble Techniques
  33. Model Comparison
  34. Regularization
  1. Kafka Brokers
  2. Evidently.ai
  3. Databases
  4. Data Preprocessing pipeline models
  5. Datawarehouse
  6. code repository
  7. Github Actions
  8. model registry
  9. Github
  10. Apache Airflow
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)