Picture for Kai Hu

Kai Hu

DeepSeek-V3 Technical Report

Add code
Dec 27, 2024
Figure 1 for DeepSeek-V3 Technical Report
Figure 2 for DeepSeek-V3 Technical Report
Figure 3 for DeepSeek-V3 Technical Report
Figure 4 for DeepSeek-V3 Technical Report
Viaarxiv icon

TravelAgent: Generative Agents in the Built Environment

Add code
Dec 25, 2024
Viaarxiv icon

Explicit Relational Reasoning Network for Scene Text Detection

Add code
Dec 19, 2024
Figure 1 for Explicit Relational Reasoning Network for Scene Text Detection
Figure 2 for Explicit Relational Reasoning Network for Scene Text Detection
Figure 3 for Explicit Relational Reasoning Network for Scene Text Detection
Figure 4 for Explicit Relational Reasoning Network for Scene Text Detection
Viaarxiv icon

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Add code
Dec 13, 2024
Figure 1 for DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Figure 2 for DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Figure 3 for DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Figure 4 for DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Viaarxiv icon

DocTabQA: Answering Questions from Long Documents Using Tables

Add code
Aug 21, 2024
Viaarxiv icon

Mutagenesis screen to map the functionals of parameters of Large Language Models

Add code
Aug 21, 2024
Viaarxiv icon

Empowering Graph Invariance Learning with Deep Spurious Infomax

Add code
Jul 13, 2024
Figure 1 for Empowering Graph Invariance Learning with Deep Spurious Infomax
Figure 2 for Empowering Graph Invariance Learning with Deep Spurious Infomax
Figure 3 for Empowering Graph Invariance Learning with Deep Spurious Infomax
Figure 4 for Empowering Graph Invariance Learning with Deep Spurious Infomax
Viaarxiv icon

CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens

Add code
Jul 09, 2024
Viaarxiv icon

RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection

Add code
May 30, 2024
Figure 1 for RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Figure 2 for RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Figure 3 for RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Figure 4 for RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Viaarxiv icon

Slight Corruption in Pre-training Data Makes Better Diffusion Models

Add code
May 30, 2024
Figure 1 for Slight Corruption in Pre-training Data Makes Better Diffusion Models
Figure 2 for Slight Corruption in Pre-training Data Makes Better Diffusion Models
Figure 3 for Slight Corruption in Pre-training Data Makes Better Diffusion Models
Figure 4 for Slight Corruption in Pre-training Data Makes Better Diffusion Models
Viaarxiv icon