Picture for Yueqi Song

Yueqi Song

Say Something Else: Rethinking Contextual Privacy as Information Sufficiency

Add code
Apr 07, 2026
Viaarxiv icon

IndoorR2X: Indoor Robot-to-Everything Coordination with LLM-Driven Planning

Add code
Mar 20, 2026
Viaarxiv icon

Grounding Multilingual Multimodal LLMs With Cultural Knowledge

Add code
Aug 12, 2025
Viaarxiv icon

Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics

Add code
Jun 14, 2025
Viaarxiv icon

FieldWorkArena: Agentic AI Benchmark for Real Field Work Tasks

Add code
May 26, 2025
Viaarxiv icon

VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge

Add code
Apr 15, 2025
Viaarxiv icon

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

Add code
Apr 09, 2025
Figure 1 for SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Figure 2 for SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Figure 3 for SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Figure 4 for SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Viaarxiv icon

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Add code
Mar 10, 2025
Figure 1 for Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
Figure 2 for Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
Figure 3 for Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
Figure 4 for Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
Viaarxiv icon

Beyond Browsing: API-Based Web Agents

Add code
Oct 21, 2024
Figure 1 for Beyond Browsing: API-Based Web Agents
Figure 2 for Beyond Browsing: API-Based Web Agents
Figure 3 for Beyond Browsing: API-Based Web Agents
Figure 4 for Beyond Browsing: API-Based Web Agents
Viaarxiv icon

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Add code
Oct 21, 2024
Viaarxiv icon