Picture for Yanfeng Wang

Yanfeng Wang

Cooperative Medianet Innovation Center, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, China

VocalNet: Speech LLM with Multi-Token Prediction for Faster and High-Quality Generation

Add code
Apr 05, 2025
Viaarxiv icon

COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking

Add code
Apr 02, 2025
Viaarxiv icon

RARE: Retrieval-Augmented Reasoning Modeling

Add code
Mar 30, 2025
Viaarxiv icon

Learning to Instruct for Visual Instruction Tuning

Add code
Mar 28, 2025
Viaarxiv icon

ChatBEV: A Visual Language Model that Understands BEV Maps

Add code
Mar 21, 2025
Viaarxiv icon

FedMABench: Benchmarking Mobile Agents on Decentralized Heterogeneous User Data

Add code
Mar 07, 2025
Viaarxiv icon

Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases

Add code
Mar 06, 2025
Figure 1 for Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
Figure 2 for Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
Viaarxiv icon

DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models

Add code
Mar 05, 2025
Viaarxiv icon

M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging

Add code
Feb 27, 2025
Viaarxiv icon

Contrast-Unity for Partially-Supervised Temporal Sentence Grounding

Add code
Feb 18, 2025
Viaarxiv icon