Workflow Element Store

  1. Handling Missing Data
  2. Data Partitioning - Train, Validation, & Test
  3. Binning / Discretization
  4. Handling Noisy Data
  5. Dealing with Outliers
  6. Handling Imbalanced Classes
  7. AutoEDA libraries
  8. Textual Feature Extraction
  9. Feature Extraction from Images
  10. Handling Time-Series Data
  11. Annotation
  12. Domain-Specific Feature Engineering
  13. Time-Based Features
  14. Dimensionality Reduction
  15. Data Scaling and Normalization
  16. Auto-Preprocessing libraries
  17. Augmentation
  18. Handling Categorical Data
  19. Polynomial Features
  20. Interaction Features
  21. Feature Selection
  22. Data Transformations
  1. Association Rules
  2. Evaluation Metrics
  3. Transfer Learning
  4. Batch Normalization
  5. AutoML
  6. Performance Visualization
  7. Model Interpretability
  8. Regular Monitoring and Logging
  9. Cross-Validation
  10. Transfer Learning
  11. Natural Language Processing
  12. Binary Classification Techniques
  13. External Validation
  14. Reinforcement Learning
  15. Cross-Validation
  16. Learning Rate Scheduling
  17. Data Augmentation
  18. Batch Size Selection
  19. Regression Analysis
  20. Forecasting Techniques
  21. Hyperparameter Tuning
  22. Regularization Techniques
  23. Model Comparison
  24. Regularization
  25. Weight Initialization
  26. Recommendation Engine
  27. Network Analytics/ GeoSpatial Analytics
  28. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  29. Clustering
  30. Word Embeddings
  31. Early Stopping
  32. Blackbox - Neural Network Models
  33. Ensemble Techniques
  34. Multiclass Classification Techniques
  1. Datawarehouse
  2. code repository
  3. Databases
  4. model registry
  5. Apache Airflow
  6. Github Actions
  7. Data Preprocessing pipeline models
  8. Evidently.ai
  9. Kafka Brokers
  10. Github
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Orchestration Component

Artifact Store

CI/CD Component

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)