Picture for Fei Richard Yu

Fei Richard Yu

Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking

Add code
Mar 14, 2025
Viaarxiv icon

CLIP-Optimized Multimodal Image Enhancement via ISP-CNN Fusion for Coal Mine IoVT under Uneven Illumination

Add code
Feb 26, 2025
Viaarxiv icon

Exploring Embodied Multimodal Large Models: Development, Datasets, and Future Directions

Add code
Feb 21, 2025
Viaarxiv icon

PAFT: Prompt-Agnostic Fine-Tuning

Add code
Feb 18, 2025
Viaarxiv icon

EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models

Add code
Feb 06, 2025
Figure 1 for EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models
Figure 2 for EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models
Figure 3 for EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models
Figure 4 for EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models
Viaarxiv icon

Frequency-aware Event Cloud Network

Add code
Dec 30, 2024
Figure 1 for Frequency-aware Event Cloud Network
Figure 2 for Frequency-aware Event Cloud Network
Figure 3 for Frequency-aware Event Cloud Network
Figure 4 for Frequency-aware Event Cloud Network
Viaarxiv icon

PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis

Add code
Dec 11, 2024
Figure 1 for PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis
Figure 2 for PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis
Figure 3 for PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis
Figure 4 for PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis
Viaarxiv icon

A Review of Human Emotion Synthesis Based on Generative Technology

Add code
Dec 10, 2024
Figure 1 for A Review of Human Emotion Synthesis Based on Generative Technology
Figure 2 for A Review of Human Emotion Synthesis Based on Generative Technology
Figure 3 for A Review of Human Emotion Synthesis Based on Generative Technology
Figure 4 for A Review of Human Emotion Synthesis Based on Generative Technology
Viaarxiv icon

EventGPT: Event Stream Understanding with Multimodal Large Language Models

Add code
Dec 01, 2024
Figure 1 for EventGPT: Event Stream Understanding with Multimodal Large Language Models
Figure 2 for EventGPT: Event Stream Understanding with Multimodal Large Language Models
Figure 3 for EventGPT: Event Stream Understanding with Multimodal Large Language Models
Figure 4 for EventGPT: Event Stream Understanding with Multimodal Large Language Models
Viaarxiv icon

GRL-Prompt: Towards Knowledge Graph based Prompt Optimization via Reinforcement Learning

Add code
Nov 19, 2024
Viaarxiv icon