Picture for Kai Hu

Kai Hu

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Add code
Dec 13, 2024
Viaarxiv icon

DocTabQA: Answering Questions from Long Documents Using Tables

Add code
Aug 21, 2024
Viaarxiv icon

Mutagenesis screen to map the functionals of parameters of Large Language Models

Add code
Aug 21, 2024
Viaarxiv icon

Empowering Graph Invariance Learning with Deep Spurious Infomax

Add code
Jul 13, 2024
Figure 1 for Empowering Graph Invariance Learning with Deep Spurious Infomax
Figure 2 for Empowering Graph Invariance Learning with Deep Spurious Infomax
Figure 3 for Empowering Graph Invariance Learning with Deep Spurious Infomax
Figure 4 for Empowering Graph Invariance Learning with Deep Spurious Infomax
Viaarxiv icon

CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens

Add code
Jul 09, 2024
Viaarxiv icon

RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection

Add code
May 30, 2024
Viaarxiv icon

Slight Corruption in Pre-training Data Makes Better Diffusion Models

Add code
May 30, 2024
Viaarxiv icon

StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models

Add code
May 24, 2024
Viaarxiv icon

DLAFormer: An End-to-End Transformer For Document Layout Analysis

Add code
May 20, 2024
Viaarxiv icon

Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization

Add code
May 15, 2024
Viaarxiv icon