Picture for Jingyun Hua

Jingyun Hua

DiaDem: Advancing Dialogue Descriptions in Audiovisual Video Captioning for Multimodal Large Language Models

Add code
Jan 27, 2026
Viaarxiv icon

KlingAvatar 2.0 Technical Report

Add code
Dec 15, 2025
Viaarxiv icon

Kwai Keye-VL Technical Report

Add code
Jul 02, 2025
Viaarxiv icon

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Add code
Apr 14, 2025
Figure 1 for Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
Figure 2 for Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
Figure 3 for Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
Figure 4 for Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
Viaarxiv icon

HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models

Add code
Feb 28, 2025
Figure 1 for HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models
Figure 2 for HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models
Figure 3 for HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models
Figure 4 for HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models
Viaarxiv icon

Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese

Add code
Oct 14, 2021
Figure 1 for Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese
Figure 2 for Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese
Figure 3 for Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese
Figure 4 for Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese
Viaarxiv icon