Picture for Chao Wang

Chao Wang

Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching

Add code
Apr 08, 2025
Viaarxiv icon

StarFlow: Generating Structured Workflow Outputs From Sketch Images

Add code
Mar 27, 2025
Viaarxiv icon

Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings

Add code
Mar 25, 2025
Viaarxiv icon

RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment

Add code
Mar 18, 2025
Viaarxiv icon

DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models

Add code
Mar 17, 2025
Viaarxiv icon

Can LLMs Formally Reason as Abstract Interpreters for Program Analysis?

Add code
Mar 16, 2025
Viaarxiv icon

FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance

Add code
Mar 07, 2025
Figure 1 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 2 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 3 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 4 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Viaarxiv icon

TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction

Add code
Mar 06, 2025
Viaarxiv icon

Advancing vision-language models in front-end development via data synthesis

Add code
Mar 03, 2025
Viaarxiv icon

Can Large Language Models Unveil the Mysteries? An Exploration of Their Ability to Unlock Information in Complex Scenarios

Add code
Feb 27, 2025
Viaarxiv icon