Picture for Yang Cao

Yang Cao

Sherman

Force Matching with Relativistic Constraints: A Physics-Inspired Approach to Stable and Efficient Generative Modeling

Add code
Feb 12, 2025
Viaarxiv icon

TruePose: Human-Parsing-guided Attention Diffusion for Full-ID Preserving Pose Transfer

Add code
Feb 05, 2025
Viaarxiv icon

OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds

Add code
Feb 05, 2025
Viaarxiv icon

Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration

Add code
Jan 27, 2025
Figure 1 for Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration
Figure 2 for Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration
Figure 3 for Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration
Figure 4 for Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration
Viaarxiv icon

TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection

Add code
Jan 21, 2025
Figure 1 for TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection
Figure 2 for TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection
Figure 3 for TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection
Figure 4 for TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection
Viaarxiv icon

VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization

Add code
Jan 16, 2025
Viaarxiv icon

RAIN: Real-time Animation of Infinite Video Stream

Add code
Dec 27, 2024
Viaarxiv icon

Grams: Gradient Descent with Adaptive Momentum Scaling

Add code
Dec 22, 2024
Viaarxiv icon

Privacy in Fine-tuning Large Language Models: Attacks, Defenses, and Future Directions

Add code
Dec 21, 2024
Figure 1 for Privacy in Fine-tuning Large Language Models: Attacks, Defenses, and Future Directions
Figure 2 for Privacy in Fine-tuning Large Language Models: Attacks, Defenses, and Future Directions
Figure 3 for Privacy in Fine-tuning Large Language Models: Attacks, Defenses, and Future Directions
Viaarxiv icon

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

Add code
Dec 12, 2024
Viaarxiv icon