Module 4 — Vision-Language-Action (VLA)
Welcome to Module 4 of the AI-Native Textbook. This module covers Vision-Language-Action systems and capstone projects.
Curriculum Overview
This module spans 4 weeks and covers:
- Week 9: Voice-to-Action with OpenAI Whisper
- Week 10: Cognitive Planning — LLMs Translating Natural Language to ROS 2 Actions
- Weeks 11-13: Capstone — Autonomous Humanoid Deployment & Testing
Prerequisites
- Completion of Modules 1, 2, and 3
- Understanding of natural language processing
- Experience with LLMs and AI systems
Learning Objectives
By the end of this module, you will understand:
- Voice-to-action systems using OpenAI Whisper
- Cognitive planning with LLMs
- Natural language to ROS 2 action translation
- Autonomous humanoid deployment and testing