Picture for Yaowei Wang

Yaowei Wang

Efficient Dataset Distillation via Diffusion-Driven Patch Selection for Improved Generalization

Add code
Dec 13, 2024
Viaarxiv icon

Towards Long Video Understanding via Fine-detailed Video Story Generation

Add code
Dec 09, 2024
Viaarxiv icon

Do We Need to Design Specific Diffusion Models for Different Tasks? Try ONE-PIC

Add code
Dec 07, 2024
Viaarxiv icon

CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs

Add code
Nov 19, 2024
Viaarxiv icon

OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling

Add code
Oct 10, 2024
Viaarxiv icon

EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment

Add code
Oct 08, 2024
Viaarxiv icon

LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation

Add code
Sep 09, 2024
Figure 1 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Figure 2 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Figure 3 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Figure 4 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Viaarxiv icon

Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS

Add code
Aug 29, 2024
Figure 1 for Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS
Figure 2 for Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS
Figure 3 for Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS
Viaarxiv icon

Event Stream based Sign Language Translation: A High-Definition Benchmark Dataset and A New Algorithm

Add code
Aug 20, 2024
Figure 1 for Event Stream based Sign Language Translation: A High-Definition Benchmark Dataset and A New Algorithm
Figure 2 for Event Stream based Sign Language Translation: A High-Definition Benchmark Dataset and A New Algorithm
Figure 3 for Event Stream based Sign Language Translation: A High-Definition Benchmark Dataset and A New Algorithm
Figure 4 for Event Stream based Sign Language Translation: A High-Definition Benchmark Dataset and A New Algorithm
Viaarxiv icon

Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers

Add code
Jul 26, 2024
Figure 1 for Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers
Figure 2 for Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers
Figure 3 for Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers
Figure 4 for Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers
Viaarxiv icon