Visual Basic Example Language

High-level visual representations in the human brain are aligned with large language models

The human brain extracts complex information from visual inputs, including objects, their spatial and semantic interrelations, and their interactions with the environment. However, a quantitative ...

GitHub

[NeurIPS 2025] ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

ChartMuseum is a chart question answering benchmark designed to evaluate reasoning capabilities of large vision-language models (LVLMs) over real-world chart images. The benchmark consists of 1162 ...

IEEE

ReplanVLM: Replanning Robotic Tasks With Visual Language Models

Abstract: Large language models (LLMs) have gained increasing popularity in robotic task planning due to their exceptional abilities in text analytics and generation, as well as their broad knowledge ...

IEEE

NaturalVLM: Leveraging Fine-Grained Natural Language for Affordance-Guided Visual Manipulation

Abstract: Enabling home-assistant robots to perceive and manipulate a diverse range of 3D objects based on human language instructions is a pivotal challenge. Prior research has predominantly focused ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results