Picture for Tong Xiao

Tong Xiao

Jack

TLDR: Token-Level Detective Reward Model for Large Vision Language Models

Add code
Oct 07, 2024
Figure 1 for TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Figure 2 for TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Figure 3 for TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Figure 4 for TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Viaarxiv icon

Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models

Add code
Oct 07, 2024
Viaarxiv icon

LRHP: Learning Representations for Human Preferences via Preference Pairs

Add code
Oct 06, 2024
Viaarxiv icon

A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation

Add code
Sep 24, 2024
Figure 1 for A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation
Figure 2 for A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation
Figure 3 for A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation
Figure 4 for A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation
Viaarxiv icon

More Effective LLM Compressed Tokens with Uniformly Spread Position Identifiers and Compression Loss

Add code
Sep 22, 2024
Viaarxiv icon

SpMis: An Investigation of Synthetic Spoken Misinformation Detection

Add code
Sep 17, 2024
Figure 1 for SpMis: An Investigation of Synthetic Spoken Misinformation Detection
Figure 2 for SpMis: An Investigation of Synthetic Spoken Misinformation Detection
Figure 3 for SpMis: An Investigation of Synthetic Spoken Misinformation Detection
Figure 4 for SpMis: An Investigation of Synthetic Spoken Misinformation Detection
Viaarxiv icon

NDP: Next Distribution Prediction as a More Broad Target

Add code
Aug 30, 2024
Viaarxiv icon

RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data

Add code
Aug 22, 2024
Figure 1 for RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
Figure 2 for RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
Figure 3 for RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
Figure 4 for RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
Viaarxiv icon

Cross-layer Attention Sharing for Large Language Models

Add code
Aug 04, 2024
Figure 1 for Cross-layer Attention Sharing for Large Language Models
Figure 2 for Cross-layer Attention Sharing for Large Language Models
Figure 3 for Cross-layer Attention Sharing for Large Language Models
Figure 4 for Cross-layer Attention Sharing for Large Language Models
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon