Picture for Junnan Li

Junnan Li

Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models

Add code
Mar 14, 2025
Viaarxiv icon

Generative Frame Sampler for Long Video Understanding

Add code
Mar 12, 2025
Viaarxiv icon

ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks

Add code
Mar 10, 2025
Viaarxiv icon

Reward Models Identify Consistency, Not Causality

Add code
Feb 20, 2025
Viaarxiv icon

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Add code
Jan 31, 2025
Figure 1 for Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Figure 2 for Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Figure 3 for Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Figure 4 for Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Viaarxiv icon

Aria-UI: Visual Grounding for GUI Instructions

Add code
Dec 20, 2024
Viaarxiv icon

OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs

Add code
Dec 12, 2024
Figure 1 for OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs
Figure 2 for OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs
Figure 3 for OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs
Figure 4 for OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs
Viaarxiv icon

Accelerating Video Diffusion Models via Distribution Matching

Add code
Dec 08, 2024
Figure 1 for Accelerating Video Diffusion Models via Distribution Matching
Figure 2 for Accelerating Video Diffusion Models via Distribution Matching
Figure 3 for Accelerating Video Diffusion Models via Distribution Matching
Figure 4 for Accelerating Video Diffusion Models via Distribution Matching
Viaarxiv icon

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Add code
Nov 20, 2024
Figure 1 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 2 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 3 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 4 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Viaarxiv icon

Aria: An Open Multimodal Native Mixture-of-Experts Model

Add code
Oct 08, 2024
Figure 1 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Figure 2 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Figure 3 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Figure 4 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Viaarxiv icon