Workflow Element Store

  1. Handling Noisy Data
  2. Dimensionality Reduction
  3. Handling Categorical Data
  4. Dealing with Outliers
  5. Binning / Discretization
  6. Auto-Preprocessing libraries
  7. Feature Selection
  8. Polynomial Features
  9. Handling Missing Data
  10. Data Scaling and Normalization
  11. Domain-Specific Feature Engineering
  12. Data Partitioning - Train, Validation, & Test
  13. Handling Time-Series Data
  14. AutoEDA libraries
  15. Textual Feature Extraction
  16. Augmentation
  17. Interaction Features
  18. Data Transformations
  19. Time-Based Features
  20. Handling Imbalanced Classes
  21. Annotation
  22. Feature Extraction from Images
  1. Reinforcement Learning
  2. Forecasting Techniques
  3. Binary Classification Techniques
  4. Early Stopping
  5. Model Interpretability
  6. Transfer Learning
  7. Cross-Validation
  8. Data Augmentation
  9. Clustering
  10. Regularization Techniques
  11. Weight Initialization
  12. Hyperparameter Tuning
  13. Network Analytics/ GeoSpatial Analytics
  14. Regular Monitoring and Logging
  15. External Validation
  16. Multiclass Classification Techniques
  17. AutoML
  18. Transfer Learning
  19. Performance Visualization
  20. Batch Size Selection
  21. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  22. Association Rules
  23. Blackbox - Neural Network Models
  24. Word Embeddings
  25. Regression Analysis
  26. Cross-Validation
  27. Model Comparison
  28. Regularization
  29. Learning Rate Scheduling
  30. Recommendation Engine
  31. Natural Language Processing
  32. Evaluation Metrics
  33. Batch Normalization
  34. Ensemble Techniques
  1. model registry
  2. Apache Airflow
  3. Github Actions
  4. Evidently.ai
  5. Github
  6. Databases
  7. Kafka Brokers
  8. Datawarehouse
  9. Data Preprocessing pipeline models
  10. code repository
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)