Picture for Dongyan Zhao

Dongyan Zhao

Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding

Add code
Dec 23, 2024
Viaarxiv icon

VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format

Add code
Nov 27, 2024
Viaarxiv icon

FIRP: Faster LLM inference via future intermediate representation prediction

Add code
Oct 27, 2024
Viaarxiv icon

Understanding Multimodal Hallucination with Parameter-Free Representation Alignment

Add code
Sep 02, 2024
Viaarxiv icon

ReMamba: Equip Mamba with Effective Long-Sequence Modeling

Add code
Sep 01, 2024
Viaarxiv icon

Evidence-Enhanced Triplet Generation Framework for Hallucination Alleviation in Generative Question Answering

Add code
Aug 27, 2024
Viaarxiv icon

Internal and External Knowledge Interactive Refinement Framework for Knowledge-Intensive Question Answering

Add code
Aug 23, 2024
Viaarxiv icon

In-Context Learning with Reinforcement Learning for Incomplete Utterance Rewriting

Add code
Aug 23, 2024
Viaarxiv icon

End-to-End Video Question Answering with Frame Scoring Mechanisms and Adaptive Sampling

Add code
Jul 23, 2024
Viaarxiv icon

Graph-Structured Speculative Decoding

Add code
Jul 23, 2024
Figure 1 for Graph-Structured Speculative Decoding
Figure 2 for Graph-Structured Speculative Decoding
Figure 3 for Graph-Structured Speculative Decoding
Figure 4 for Graph-Structured Speculative Decoding
Viaarxiv icon