Picture for Kai Ma

Kai Ma

School of Electrical Engineering, Yanshan University, Qinhuangdao, China

VRoPE: Rotary Position Embedding for Video Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

Unveiling the Capabilities of Large Language Models in Detecting Offensive Language with Annotation Disagreement

Add code
Feb 10, 2025
Viaarxiv icon

Motion-Aware Generative Frame Interpolation

Add code
Jan 07, 2025
Viaarxiv icon

Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters

Add code
Nov 27, 2024
Figure 1 for Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
Figure 2 for Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
Figure 3 for Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
Figure 4 for Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
Viaarxiv icon

SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks

Add code
Oct 02, 2024
Figure 1 for SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks
Figure 2 for SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks
Figure 3 for SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks
Figure 4 for SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks
Viaarxiv icon

FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction

Add code
Sep 26, 2024
Viaarxiv icon

IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation

Add code
Sep 12, 2024
Figure 1 for IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation
Figure 2 for IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation
Figure 3 for IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation
Figure 4 for IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation
Viaarxiv icon

Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector

Add code
Sep 10, 2024
Figure 1 for Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector
Figure 2 for Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector
Figure 3 for Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector
Figure 4 for Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector
Viaarxiv icon

Dynamic and Compressive Adaptation of Transformers From Images to Videos

Add code
Aug 14, 2024
Figure 1 for Dynamic and Compressive Adaptation of Transformers From Images to Videos
Figure 2 for Dynamic and Compressive Adaptation of Transformers From Images to Videos
Figure 3 for Dynamic and Compressive Adaptation of Transformers From Images to Videos
Figure 4 for Dynamic and Compressive Adaptation of Transformers From Images to Videos
Viaarxiv icon

VFIMamba: Video Frame Interpolation with State Space Models

Add code
Jul 02, 2024
Viaarxiv icon