Picture for Wenwei Zhang

Wenwei Zhang

Training Language Models to Critique With Multi-agent Feedback

Add code
Oct 20, 2024
Viaarxiv icon

LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

Add code
Sep 26, 2024
Figure 1 for LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Figure 2 for LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Figure 3 for LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Figure 4 for LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Viaarxiv icon

SLAM assisted 3D tracking system for laparoscopic surgery

Add code
Sep 18, 2024
Viaarxiv icon

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

Add code
Jul 29, 2024
Viaarxiv icon

CIBench: Evaluating Your LLMs with a Code Interpreter Plugin

Add code
Jul 15, 2024
Viaarxiv icon

4D Contrastive Superflows are Dense 3D Representation Learners

Add code
Jul 10, 2024
Viaarxiv icon

ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models

Add code
Jul 05, 2024
Viaarxiv icon

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Add code
Jul 03, 2024
Figure 1 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Figure 2 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Figure 3 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Figure 4 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Viaarxiv icon

Empowering 3D Visual Grounding with Reasoning Capabilities

Add code
Jul 02, 2024
Viaarxiv icon

InternLM-Law: An Open Source Chinese Legal Large Language Model

Add code
Jun 21, 2024
Viaarxiv icon