Picture for Yu Wang

Yu Wang

School of Control and Computer Engineering, North China Electric Power University

What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study

Add code
Dec 17, 2024
Viaarxiv icon

Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal

Add code
Dec 15, 2024
Viaarxiv icon

A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation

Add code
Dec 08, 2024
Viaarxiv icon

GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration

Add code
Dec 05, 2024
Figure 1 for GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration
Figure 2 for GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration
Figure 3 for GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration
Figure 4 for GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration
Viaarxiv icon

ASIGN: An Anatomy-aware Spatial Imputation Graphic Network for 3D Spatial Transcriptomics

Add code
Dec 04, 2024
Viaarxiv icon

Jailbreak Large Vision-Language Models Through Multi-Modal Linkage

Add code
Dec 03, 2024
Viaarxiv icon

Personalized Multimodal Large Language Models: A Survey

Add code
Dec 03, 2024
Viaarxiv icon

DHCP: Detecting Hallucinations by Cross-modal Attention Pattern in Large Vision-Language Models

Add code
Nov 27, 2024
Viaarxiv icon

LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization

Add code
Nov 26, 2024
Viaarxiv icon

ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction

Add code
Nov 26, 2024
Figure 1 for ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction
Figure 2 for ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction
Figure 3 for ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction
Figure 4 for ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction
Viaarxiv icon