Picture for Yuran Wang

Yuran Wang

DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies

Add code
Mar 19, 2025
Viaarxiv icon

GarmentPile: Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Manipulation

Add code
Mar 12, 2025
Viaarxiv icon

Baichuan-Omni-1.5 Technical Report

Add code
Jan 26, 2025
Viaarxiv icon

Ocean-OCR: Towards General OCR Application via a Vision-Language Model

Add code
Jan 26, 2025
Figure 1 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Figure 2 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Figure 3 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Figure 4 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Viaarxiv icon

Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching

Add code
Nov 14, 2024
Figure 1 for Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching
Figure 2 for Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching
Figure 3 for Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching
Figure 4 for Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching
Viaarxiv icon

ProMQA: Question Answering Dataset for Multimodal Procedural Activity Understanding

Add code
Oct 29, 2024
Viaarxiv icon

Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs

Add code
Oct 15, 2024
Figure 1 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Figure 2 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Figure 3 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Figure 4 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Viaarxiv icon

Devil is in Details: Locality-Aware 3D Abdominal CT Volume Generation for Self-Supervised Organ Segmentation

Add code
Sep 30, 2024
Viaarxiv icon

Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition

Add code
Jun 17, 2024
Figure 1 for Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition
Figure 2 for Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition
Figure 3 for Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition
Figure 4 for Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition
Viaarxiv icon

Terrain Point Cloud Inpainting via Signal Decomposition

Add code
Apr 04, 2024
Figure 1 for Terrain Point Cloud Inpainting via Signal Decomposition
Figure 2 for Terrain Point Cloud Inpainting via Signal Decomposition
Figure 3 for Terrain Point Cloud Inpainting via Signal Decomposition
Figure 4 for Terrain Point Cloud Inpainting via Signal Decomposition
Viaarxiv icon