Workflow Element Store

  1. AutoEDA libraries
  2. Handling Time-Series Data
  3. Polynomial Features
  4. Data Scaling and Normalization
  5. Feature Extraction from Images
  6. Time-Based Features
  7. Auto-Preprocessing libraries
  8. Annotation
  9. Interaction Features
  10. Feature Selection
  11. Handling Categorical Data
  12. Handling Imbalanced Classes
  13. Dimensionality Reduction
  14. Textual Feature Extraction
  15. Data Transformations
  16. Domain-Specific Feature Engineering
  17. Dealing with Outliers
  18. Binning / Discretization
  19. Handling Missing Data
  20. Data Partitioning - Train, Validation, & Test
  21. Handling Noisy Data
  22. Augmentation
  1. External Validation
  2. Weight Initialization
  3. Binary Classification Techniques
  4. Network Analytics/ GeoSpatial Analytics
  5. Model Interpretability
  6. Word Embeddings
  7. Blackbox - Neural Network Models
  8. Clustering
  9. Cross-Validation
  10. Recommendation Engine
  11. Ensemble Techniques
  12. Hyperparameter Tuning
  13. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  14. Regression Analysis
  15. Transfer Learning
  16. Learning Rate Scheduling
  17. Data Augmentation
  18. Regularization
  19. Natural Language Processing
  20. Regularization Techniques
  21. Reinforcement Learning
  22. Forecasting Techniques
  23. Regular Monitoring and Logging
  24. Transfer Learning
  25. Batch Size Selection
  26. Model Comparison
  27. Association Rules
  28. Performance Visualization
  29. Batch Normalization
  30. AutoML
  31. Evaluation Metrics
  32. Early Stopping
  33. Multiclass Classification Techniques
  34. Cross-Validation
  1. Data Preprocessing pipeline models
  2. Github Actions
  3. Github
  4. Datawarehouse
  5. Kafka Brokers
  6. Apache Airflow
  7. code repository
  8. Evidently.ai
  9. model registry
  10. Databases
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)