Picture for Ruochen Xu

Ruochen Xu

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration

Add code
Nov 25, 2024
Figure 1 for ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
Figure 2 for ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
Figure 3 for ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
Figure 4 for ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
Viaarxiv icon

OmChat: A Recipe to Train Multimodal Language Models with Strong Long Context and Video Understanding

Add code
Jul 06, 2024
Figure 1 for OmChat: A Recipe to Train Multimodal Language Models with Strong Long Context and Video Understanding
Figure 2 for OmChat: A Recipe to Train Multimodal Language Models with Strong Long Context and Video Understanding
Figure 3 for OmChat: A Recipe to Train Multimodal Language Models with Strong Long Context and Video Understanding
Figure 4 for OmChat: A Recipe to Train Multimodal Language Models with Strong Long Context and Video Understanding
Viaarxiv icon

Preserving Knowledge in Large Language Model: A Model-Agnostic Self-Decompression Approach

Add code
Jun 17, 2024
Viaarxiv icon

Rho-1: Not All Tokens Are What You Need

Add code
Apr 11, 2024
Figure 1 for Rho-1: Not All Tokens Are What You Need
Figure 2 for Rho-1: Not All Tokens Are What You Need
Figure 3 for Rho-1: Not All Tokens Are What You Need
Figure 4 for Rho-1: Not All Tokens Are What You Need
Viaarxiv icon

ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language Models

Add code
Mar 08, 2024
Viaarxiv icon

SciAgent: Tool-augmented Language Models for Scientific Reasoning

Add code
Feb 21, 2024
Viaarxiv icon

DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing Agents

Add code
Feb 21, 2024
Figure 1 for DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing Agents
Figure 2 for DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing Agents
Figure 3 for DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing Agents
Figure 4 for DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing Agents
Viaarxiv icon

Supervised Knowledge Makes Large Language Models Better In-context Learners

Add code
Dec 26, 2023
Figure 1 for Supervised Knowledge Makes Large Language Models Better In-context Learners
Figure 2 for Supervised Knowledge Makes Large Language Models Better In-context Learners
Figure 3 for Supervised Knowledge Makes Large Language Models Better In-context Learners
Figure 4 for Supervised Knowledge Makes Large Language Models Better In-context Learners
Viaarxiv icon

Language Models can be Logical Solvers

Add code
Nov 10, 2023
Figure 1 for Language Models can be Logical Solvers
Figure 2 for Language Models can be Logical Solvers
Figure 3 for Language Models can be Logical Solvers
Figure 4 for Language Models can be Logical Solvers
Viaarxiv icon

In-Context Demonstration Selection with Cross Entropy Difference

Add code
May 24, 2023
Viaarxiv icon