Depth-Aware Vision-Language-Action (VLA) Architectures for Embodied AI
Exploring the integration of spatial depth perception into large-scale VLA models to enhance the robustness of robotic manipulation and bridge the gap between high-level reasoning and low-level physical interaction.
- Embodied AI
- VLA Models
- Robotic Policy Learning