Picture for Wenming Yang

Wenming Yang

CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs

Add code
Nov 19, 2024
Viaarxiv icon

Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection

Add code
Nov 05, 2024
Viaarxiv icon

MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors

Add code
Oct 25, 2024
Figure 1 for MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
Figure 2 for MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
Figure 3 for MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
Figure 4 for MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
Viaarxiv icon

ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution

Add code
Oct 17, 2024
Figure 1 for ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution
Figure 2 for ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution
Figure 3 for ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution
Figure 4 for ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution
Viaarxiv icon

CONSULT: Contrastive Self-Supervised Learning for Few-shot Tumor Detection

Add code
Oct 15, 2024
Viaarxiv icon

RICASSO: Reinforced Imbalance Learning with Class-Aware Self-Supervised Outliers Exposure

Add code
Oct 14, 2024
Figure 1 for RICASSO: Reinforced Imbalance Learning with Class-Aware Self-Supervised Outliers Exposure
Figure 2 for RICASSO: Reinforced Imbalance Learning with Class-Aware Self-Supervised Outliers Exposure
Figure 3 for RICASSO: Reinforced Imbalance Learning with Class-Aware Self-Supervised Outliers Exposure
Figure 4 for RICASSO: Reinforced Imbalance Learning with Class-Aware Self-Supervised Outliers Exposure
Viaarxiv icon

Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation

Add code
Aug 29, 2024
Figure 1 for Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
Figure 2 for Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
Figure 3 for Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
Figure 4 for Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
Viaarxiv icon

FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant

Add code
Aug 19, 2024
Viaarxiv icon

Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network

Add code
Aug 07, 2024
Figure 1 for Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network
Figure 2 for Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network
Figure 3 for Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network
Figure 4 for Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network
Viaarxiv icon

GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution

Add code
Jul 25, 2024
Viaarxiv icon