Picture for Huishuai Zhang

Huishuai Zhang

VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format

Add code
Nov 27, 2024
Viaarxiv icon

AIDBench: A benchmark for evaluating the authorship identification capability of large language models

Add code
Nov 20, 2024
Viaarxiv icon

Understanding Multimodal Hallucination with Parameter-Free Representation Alignment

Add code
Sep 02, 2024
Viaarxiv icon

ReMamba: Equip Mamba with Effective Long-Sequence Modeling

Add code
Sep 01, 2024
Viaarxiv icon

Evidence-Enhanced Triplet Generation Framework for Hallucination Alleviation in Generative Question Answering

Add code
Aug 27, 2024
Viaarxiv icon

Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules

Add code
Jul 09, 2024
Figure 1 for Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Figure 2 for Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Figure 3 for Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Figure 4 for Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Viaarxiv icon

Efficient Continual Pre-training by Mitigating the Stability Gap

Add code
Jun 21, 2024
Viaarxiv icon

Automatic Jailbreaking of the Text-to-Image Generative AI Systems

Add code
May 28, 2024
Viaarxiv icon

xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token

Add code
May 22, 2024
Figure 1 for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
Figure 2 for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
Figure 3 for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
Figure 4 for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
Viaarxiv icon

©Plug-in Authorization for Human Content Copyright Protection in Text-to-Image Model

Add code
Apr 18, 2024
Viaarxiv icon