Every block you need to build ML pipelines
Drag-and-drop components for tabular data, computer vision, NLP, time series, and geospatial analysis — all in one studio.
Showing 69 blocks
CSV Upload
TabularData Sources
Load structured data from CSV files for analysis and model training.
Excel File
TabularData Sources
Import data from Excel spreadsheets (.xlsx, .xls) including multi-sheet files.
Parquet File
TabularData Sources
Load columnar-format Parquet files — ideal for large datasets.
SQL Database
TabularData Sources
Connect to relational databases (PostgreSQL, MySQL, SQLite) and run custom queries.
API Webhook
TabularData Sources
Fetch data from REST APIs with support for authentication and pagination.
Remove Nulls
TabularData Prep
Handle missing values through deletion or statistical imputation.
Data Imputer
TabularData Prep
Advanced missing value imputation using KNN or iterative model-based strategies.
Feature Scale
TabularData Prep
Normalise or standardise numeric features to comparable scales.
Encode Categories
TabularData Prep
Convert categorical variables to numeric representations for ML models.
SMOTE
TabularData Prep
Synthetic Minority Over-sampling Technique for imbalanced classification.
Remove Outliers
TabularData Prep
Detect and remove or cap statistical outliers from numeric features.
Split Data
TabularData Prep
Divide dataset into training and test sets with optional stratification.
PCA
TabularFeature Engineering
Principal Component Analysis — reduce dimensionality while preserving variance.
Target Encoding
TabularFeature Engineering
Replace categorical values with smoothed mean of the target variable.
Polynomial Features
TabularFeature Engineering
Generate interaction terms and polynomial combinations of numeric features.
Random Forest
TabularClassical Models
Ensemble of decision trees — robust, accurate, and scale-invariant.
XGBoost
TabularClassical Models
Optimised gradient boosting — frequently wins structured-data competitions.
LightGBM
TabularClassical Models
Gradient boosting optimised for speed and large datasets.
Logistic Regression
TabularClassical Models
Linear classifier with probabilistic output — fast and interpretable.
SVM
TabularClassical Models
Support Vector Machine — effective in high-dimensional spaces.
Decision Tree
TabularClassical Models
Interpretable tree-based model — transparent decision rules.
Neural Network (MLP)
TabularDeep Learning
Multi-layer perceptron for complex non-linear patterns in tabular data.
LSTM
TabularDeep Learning
Long Short-Term Memory — recurrent network for sequential tabular data.
Transformer
TabularDeep Learning
Self-attention based model — state of the art for tabular and sequence tasks.
Accuracy Score
TabularEvaluation
Classification accuracy, precision, recall, and F1 — all in one block.
ROC AUC
TabularEvaluation
Area under the ROC curve — threshold-independent classifier evaluation.
F1 Score
TabularEvaluation
Harmonic mean of precision and recall — best for imbalanced classification.
Confusion Matrix
TabularEvaluation
Visualise all correct and incorrect predictions across every class.
Correlation Matrix
TabularAnalysis
Visualize pairwise correlations between numeric features as a color-coded heatmap matrix.
Correlation Table
TabularAnalysis
Display pairwise correlations in a sortable table format with numeric precision.
SHAP Values
TabularEvaluation
Game-theory feature importance — explains how each feature drives predictions.
Cross-Validation
TabularEvaluation
K-fold cross-validation for robust, unbiased performance estimates.
Image Folder
VisionVision Sources
Load image datasets from a folder organised by class subdirectories.
COCO Dataset
VisionVision Sources
Load object detection datasets in COCO JSON annotation format.
Resize
VisionVision Prep
Resize images to a fixed resolution required by vision models.
Normalize
VisionVision Prep
Normalise pixel values to ImageNet mean/std for pretrained model compatibility.
Random Flip
VisionVision Prep
Data augmentation via horizontal and vertical random flipping.
EfficientNet-B0
VisionVision Models
Efficient and accurate image classifier — best accuracy/parameter ratio.
ResNet-50
VisionVision Models
Classic 50-layer residual network — reliable baseline for image tasks.
YOLOv8
VisionVision Models
Real-time object detection — state of the art speed and accuracy.
mAP Score
VisionVision Evaluation
Mean Average Precision — the standard metric for object detection.
Text CSV
NLPText Sources
Load text data from CSV files for NLP pipelines.
PDF Loader
NLPText Sources
Extract text from PDF files for document analysis pipelines.
Clean Text
NLPText Prep
Remove noise from text — HTML, URLs, special characters, stopwords.
TF-IDF Vectorizer
NLPText Prep
Convert text to numeric TF-IDF feature vectors for classical ML models.
Text Embeddings
NLPText Prep
Dense sentence embeddings using sentence-transformers models.
BERT
NLPNLP Models
Bidirectional transformer — state of the art for text classification and NER.
DistilBERT
NLPNLP Models
Lightweight BERT distillation — 40% smaller, 60% faster, 97% of performance.
LLM Processor
NLPNLP Models
Connect to GPT-4, Claude, or Llama via API for zero-shot NLP tasks.
Time Series CSV
Time SeriesTS Sources
Load time series data from CSV with automatic datetime parsing.
Financial API
Time SeriesTS Sources
Stream historical stock, crypto, or forex prices from market data APIs.
Resample
Time SeriesTS Prep
Change time series frequency — upsample or downsample with aggregation.
Differencing
Time SeriesTS Prep
Remove trends and make time series stationary via differencing.
Lag Features
Time SeriesTS Features
Create lagged versions of the target to use as input features.
Rolling Statistics
Time SeriesTS Features
Rolling mean, std, min, max over a sliding time window.
Prophet
Time SeriesTS Models
Facebook Prophet — robust forecasting with trend, seasonality, and holidays.
ARIMA
Time SeriesTS Models
AutoRegressive Integrated Moving Average — classic statistical forecasting.
XGBoost (TS)
Time SeriesTS Models
XGBoost trained on lag and rolling features for time series regression.
MAE
Time SeriesTS Evaluation
Mean Absolute Error — average magnitude of forecast errors.
RMSE
Time SeriesTS Evaluation
Root Mean Squared Error — penalises large forecast errors more heavily.
Shapefile Loader
GeospatialGeo Sources
Load vector geospatial data from Shapefile or GeoJSON.
GeoJSON Loader
GeospatialGeo Sources
Load GeoJSON feature collections directly from file or URL.
Reproject
GeospatialGeo Prep
Transform geometries between coordinate reference systems (CRS).
Grid Sampling
GeospatialGeo Prep
Sample spatial data onto a regular grid at a specified resolution.
NDVI / Spectral
GeospatialGeo Features
Compute vegetation indices (NDVI, EVI) from multispectral satellite bands.
Distance Features
GeospatialGeo Features
Compute distances from points to reference geometries (roads, cities, POIs).
Spatial RF
GeospatialGeo Models
Random Forest with spatial cross-validation to prevent leakage.
Kriging
GeospatialGeo Models
Gaussian process spatial interpolation — optimal linear unbiased estimator.
Spatial Accuracy
GeospatialGeo Evaluation
Evaluate predictions with spatially-aware metrics (RMSE, coverage error).