Tutorials¶
Interactive Jupyter notebooks that walk through end-to-end workflows in VLA Foundry. These tutorials are hands-on and beginner-friendly, designed to help you understand how to use the framework by running actual training jobs with small models and sample data.
All tutorials are located in the tutorials/ directory of the repository.
Getting Started¶
Prerequisites: - GPU with ≥16 GB VRAM recommended - Install the Jupyter kernel once from the repo root:
- Select Python (vla_foundry) as your kernel when running notebooksAll notebooks are standalone and download required data automatically.
Available Tutorials¶
🎯 Training LLM, VLM, and VLA¶
The full three-stage training pipeline from scratch: train a 100M parameter language model on text data, add vision capabilities with image-caption training, and add action prediction with robotics data.
This is the recommended starting point if you're new to VLA Foundry.
🔄 LLM & VLM Inference¶
Load trained models and run inference: load LLM/VLM checkpoints, generate text completions and image captions, and use the processor and tokenizer APIs.
📊 Data Visualization¶
Inspect and visualize robotics datasets: visualize camera streams, plot action trajectories, examine proprioceptive data, and debug data loading issues.
🤖 Simulation Evaluation¶
Evaluate VLA policies in simulation: set up the evaluation environment, load a trained VLA checkpoint, run rollouts in simulation, and analyze success rates and failure modes.
📦 Adding New Datasets¶
Integrate custom datasets into VLA Foundry: understand the WebDataset tar format, convert your data to VLA Foundry format, write dataset manifests, and configure dataset mixing and weighting.
🔧 Converting Spartan Data to Tar Shards¶
Preprocess LBM/Spartan format robotics data: convert Spartan episodes to WebDataset shards, generate dataset statistics, and create manifests for training.
🦾 LeRobot Integration¶
Work with LeRobot datasets: download datasets from the LeRobot hub, convert LeRobot format to VLA Foundry format, and preprocess and train on LeRobot data.
What's Next?¶
After completing the tutorials, check out:
- Examples -- Copy-paste-ready bash scripts for production workflows
- Guides -- In-depth how-to guides for specific tasks
- Reference -- Detailed API documentation
Troubleshooting¶
Kernel not found:
Out of memory: - Reduce per_gpu_batch_size in training commands - Use smaller model configs (e.g., transformer_100m.yaml instead of larger variants) - Close other GPU processes
Data download fails: - Check your internet connection - Some image URLs in PixelProse may be unavailable (normal -- the tutorial will retry) - For large datasets, consider downloading outside the notebook and pointing to local paths