Picture for Zesen Cheng

Zesen Cheng

Large Language Models Can Self-Improve in Long-context Reasoning

Add code
Nov 12, 2024
Figure 1 for Large Language Models Can Self-Improve in Long-context Reasoning
Figure 2 for Large Language Models Can Self-Improve in Long-context Reasoning
Figure 3 for Large Language Models Can Self-Improve in Long-context Reasoning
Figure 4 for Large Language Models Can Self-Improve in Long-context Reasoning
Viaarxiv icon

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Add code
Oct 22, 2024
Figure 1 for Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Figure 2 for Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Figure 3 for Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Figure 4 for Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Viaarxiv icon

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Add code
Oct 16, 2024
Viaarxiv icon

A Survey on the Honesty of Large Language Models

Add code
Sep 27, 2024
Figure 1 for A Survey on the Honesty of Large Language Models
Figure 2 for A Survey on the Honesty of Large Language Models
Figure 3 for A Survey on the Honesty of Large Language Models
Figure 4 for A Survey on the Honesty of Large Language Models
Viaarxiv icon

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

Add code
Jul 15, 2024
Viaarxiv icon

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Add code
Jun 11, 2024
Figure 1 for VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Figure 2 for VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Figure 3 for VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Figure 4 for VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Viaarxiv icon

GraCo: Granularity-Controllable Interactive Segmentation

Add code
May 01, 2024
Viaarxiv icon

Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation

Add code
Jan 18, 2024
Viaarxiv icon

FreestyleRet: Retrieving Images from Style-Diversified Queries

Add code
Dec 08, 2023
Viaarxiv icon

NewsDialogues: Towards Proactive News Grounded Conversation

Add code
Aug 12, 2023
Viaarxiv icon