Skip to content

Tutorials

Interactive Jupyter notebooks that walk through end-to-end workflows in VLA Foundry. These tutorials are hands-on and beginner-friendly, designed to help you understand how to use the framework by running actual training jobs with small models and sample data.

All tutorials are located in the tutorials/ directory of the repository.


Getting Started

Prerequisites: - GPU with ≥16 GB VRAM recommended - Install the Jupyter kernel once from the repo root:

bash tutorials/install_kernel.sh
- Select Python (vla_foundry) as your kernel when running notebooks

All notebooks are standalone and download required data automatically.


Available Tutorials

🎯 Training LLM, VLM, and VLA

The full three-stage training pipeline from scratch: train a 100M parameter language model on text data, add vision capabilities with image-caption training, and add action prediction with robotics data.

This is the recommended starting point if you're new to VLA Foundry.


🔄 LLM & VLM Inference

Load trained models and run inference: load LLM/VLM checkpoints, generate text completions and image captions, and use the processor and tokenizer APIs.


📊 Data Visualization

Inspect and visualize robotics datasets: visualize camera streams, plot action trajectories, examine proprioceptive data, and debug data loading issues.


🤖 Simulation Evaluation

Evaluate VLA policies in simulation: set up the evaluation environment, load a trained VLA checkpoint, run rollouts in simulation, and analyze success rates and failure modes.


📦 Adding New Datasets

Integrate custom datasets into VLA Foundry: understand the WebDataset tar format, convert your data to VLA Foundry format, write dataset manifests, and configure dataset mixing and weighting.


🔧 Converting Spartan Data to Tar Shards

Preprocess LBM/Spartan format robotics data: convert Spartan episodes to WebDataset shards, generate dataset statistics, and create manifests for training.


🦾 LeRobot Integration

Work with LeRobot datasets: download datasets from the LeRobot hub, convert LeRobot format to VLA Foundry format, and preprocess and train on LeRobot data.


What's Next?

After completing the tutorials, check out:

  • Examples -- Copy-paste-ready bash scripts for production workflows
  • Guides -- In-depth how-to guides for specific tasks
  • Reference -- Detailed API documentation

Troubleshooting

Kernel not found:

bash tutorials/install_kernel.sh
# Then restart Jupyter and select "Python (vla_foundry)"

Out of memory: - Reduce per_gpu_batch_size in training commands - Use smaller model configs (e.g., transformer_100m.yaml instead of larger variants) - Close other GPU processes

Data download fails: - Check your internet connection - Some image URLs in PixelProse may be unavailable (normal -- the tutorial will retry) - For large datasets, consider downloading outside the notebook and pointing to local paths

For more help, see the FAQ or open an issue on GitHub.