Picture for Kai Ma

Kai Ma

School of Electrical Engineering, Yanshan University, Qinhuangdao, China

SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks

Add code
Oct 02, 2024
Viaarxiv icon

FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction

Add code
Sep 26, 2024
Viaarxiv icon

IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation

Add code
Sep 12, 2024
Viaarxiv icon

Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector

Add code
Sep 10, 2024
Figure 1 for Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector
Figure 2 for Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector
Figure 3 for Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector
Figure 4 for Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector
Viaarxiv icon

Dynamic and Compressive Adaptation of Transformers From Images to Videos

Add code
Aug 14, 2024
Figure 1 for Dynamic and Compressive Adaptation of Transformers From Images to Videos
Figure 2 for Dynamic and Compressive Adaptation of Transformers From Images to Videos
Figure 3 for Dynamic and Compressive Adaptation of Transformers From Images to Videos
Figure 4 for Dynamic and Compressive Adaptation of Transformers From Images to Videos
Viaarxiv icon

VFIMamba: Video Frame Interpolation with State Space Models

Add code
Jul 02, 2024
Viaarxiv icon

Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning

Add code
May 30, 2024
Viaarxiv icon

StableDrag: Stable Dragging for Point-based Image Editing

Add code
Mar 07, 2024
Viaarxiv icon

Adversarial Medical Image with Hierarchical Feature Hiding

Add code
Dec 04, 2023
Viaarxiv icon

A New Perspective to Boost Vision Transformer for Medical Image Classification

Add code
Jan 03, 2023
Viaarxiv icon