Picture for Mengna Wang

Mengna Wang

Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks

Add code
Mar 27, 2025
Viaarxiv icon

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Add code
Jul 10, 2024
Figure 1 for Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model
Figure 2 for Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model
Figure 3 for Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model
Figure 4 for Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model
Viaarxiv icon

Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization

Add code
Feb 27, 2024
Figure 1 for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
Figure 2 for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
Figure 3 for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
Figure 4 for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
Viaarxiv icon