Picture for Manling Li

Manling Li

LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models

Add code
Dec 03, 2024
Viaarxiv icon

IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos

Add code
Nov 18, 2024
Viaarxiv icon

HourVideo: 1-Hour Video-Language Understanding

Add code
Nov 07, 2024
Figure 1 for HourVideo: 1-Hour Video-Language Understanding
Figure 2 for HourVideo: 1-Hour Video-Language Understanding
Figure 3 for HourVideo: 1-Hour Video-Language Understanding
Figure 4 for HourVideo: 1-Hour Video-Language Understanding
Viaarxiv icon

MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders

Add code
Oct 09, 2024
Figure 1 for MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders
Figure 2 for MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders
Figure 3 for MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders
Figure 4 for MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders
Viaarxiv icon

Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making

Add code
Oct 09, 2024
Figure 1 for Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
Figure 2 for Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
Figure 3 for Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
Figure 4 for Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
Viaarxiv icon

Knowledge Overshadowing Causes Amalgamated Hallucination in Large Language Models

Add code
Jul 10, 2024
Viaarxiv icon

Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action

Add code
May 28, 2024
Figure 1 for Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action
Figure 2 for Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action
Figure 3 for Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action
Viaarxiv icon

Text-Based Reasoning About Vector Graphics

Add code
Apr 10, 2024
Figure 1 for Text-Based Reasoning About Vector Graphics
Figure 2 for Text-Based Reasoning About Vector Graphics
Figure 3 for Text-Based Reasoning About Vector Graphics
Figure 4 for Text-Based Reasoning About Vector Graphics
Viaarxiv icon

Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models

Add code
Mar 26, 2024
Viaarxiv icon

Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate

Add code
Feb 12, 2024
Viaarxiv icon