Picture for Xing Sun

Xing Sun

T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs

Add code
Dec 02, 2024
Viaarxiv icon

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

Add code
Nov 22, 2024
Figure 1 for MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
Figure 2 for MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
Figure 3 for MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
Figure 4 for MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
Viaarxiv icon

Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Add code
Nov 01, 2024
Viaarxiv icon

Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing

Add code
Sep 25, 2024
Figure 1 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 2 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 3 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 4 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Viaarxiv icon

CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data

Add code
Sep 25, 2024
Figure 1 for CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data
Figure 2 for CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data
Figure 3 for CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data
Figure 4 for CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data
Viaarxiv icon

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

Add code
Aug 28, 2024
Figure 1 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Figure 2 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Figure 3 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Figure 4 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Viaarxiv icon

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Add code
Aug 09, 2024
Figure 1 for VITA: Towards Open-Source Interactive Omni Multimodal LLM
Figure 2 for VITA: Towards Open-Source Interactive Omni Multimodal LLM
Figure 3 for VITA: Towards Open-Source Interactive Omni Multimodal LLM
Figure 4 for VITA: Towards Open-Source Interactive Omni Multimodal LLM
Viaarxiv icon

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Add code
Aug 07, 2024
Figure 1 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 2 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 3 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 4 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Viaarxiv icon

Multimodal Label Relevance Ranking via Reinforcement Learning

Add code
Jul 18, 2024
Viaarxiv icon

Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence

Add code
Jun 16, 2024
Viaarxiv icon