Picture for Yu-Chiang Frank Wang

Yu-Chiang Frank Wang

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Add code
Jan 14, 2025
Viaarxiv icon

Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits

Add code
Jan 07, 2025
Figure 1 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Figure 2 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Figure 3 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Figure 4 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Viaarxiv icon

Toward Scene Graph and Layout Guided Complex 3D Scene Generation

Add code
Dec 29, 2024
Viaarxiv icon

Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering

Add code
Dec 05, 2024
Viaarxiv icon

VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models

Add code
Dec 02, 2024
Viaarxiv icon

Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models

Add code
Nov 28, 2024
Figure 1 for Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models
Figure 2 for Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models
Figure 3 for Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models
Figure 4 for Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models
Viaarxiv icon

PromptHSI: Universal Hyperspectral Image Restoration Framework for Composite Degradation

Add code
Nov 24, 2024
Viaarxiv icon

NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts

Add code
Nov 08, 2024
Viaarxiv icon

EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

Add code
Oct 28, 2024
Viaarxiv icon

Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data

Add code
Sep 30, 2024
Figure 1 for Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data
Figure 2 for Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data
Figure 3 for Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data
Figure 4 for Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data
Viaarxiv icon