This demo presents the full PhyInfoEQA pipeline, including embodied exploration, scene graph construction, physical-information fusion, active reasoning, and industrial question answering in complex environments.
Fig. 1. Overview of the proposed TriPMA framework.
A concrete workflow example of TriPMA. Given the question ``Why did conveyor belt A fail and stop?'', reasoning over physical and informational spaces, the agent navigates across regions (Assembly line work area ——> Electrical room), maintains an object information list, collects physical observations and digital information on demand, and dynamically updates the scene graph until the question becomes answerable.
Fig. 2. Dataset examples and statistics of PhyInfoEQA.
Task examples and dataset statistics of the PhyInfoEQA Dataset
PhyInfoEQA addresses a critical limitation in existing Embodied Question Answering systems: most current benchmarks rely solely on physical perception and lack support for dynamic informational reasoning.
We introduce PhyInfoEQA, the first unified benchmark that requires embodied agents to jointly reason across physical observations and acquired digital information.
We further propose TriPMA, a question-driven active cognitive framework integrating planning, scene understanding, active exploration, and information acquisition for industrial embodied intelligence.
Fig. 3 . Detailed architecture of TriPMA.
Overview of the proposed TriPMA framework for PhyInfoEQA task. TriPMA consists of three functional modules: TriPlanner (Planner 1 for answerability judgment, Planner 2 for target decomposition, Planner 3 for navigation decision), Manager (maintaining an object information list and a hierarchical scene graph), and Actor (executing cross-region navigation and in-region exploration via multi-sourced value maps).
Fig. 4. Qualitative reasoning case of TriPMA.
The two RGB-D images used by TriPMA for information acquisition: (a) capturing the equipment status and intactness information of the robotic arm; and (b) capturing the occupancy status of the vehicle bays.