Picture for Kaixin Li

Kaixin Li

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Add code
Mar 25, 2026
Viaarxiv icon

Towards Principled Dataset Distillation: A Spectral Distribution Perspective

Add code
Mar 02, 2026
Viaarxiv icon

Qwen3-Coder-Next Technical Report

Add code
Feb 28, 2026
Viaarxiv icon

Grounding and Enhancing Informativeness and Utility in Dataset Distillation

Add code
Jan 29, 2026
Viaarxiv icon

CrownGen: Patient-customized Crown Generation via Point Diffusion Model

Add code
Dec 26, 2025
Figure 1 for CrownGen: Patient-customized Crown Generation via Point Diffusion Model
Figure 2 for CrownGen: Patient-customized Crown Generation via Point Diffusion Model
Figure 3 for CrownGen: Patient-customized Crown Generation via Point Diffusion Model
Figure 4 for CrownGen: Patient-customized Crown Generation via Point Diffusion Model
Viaarxiv icon

MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique

Add code
Nov 12, 2025
Figure 1 for MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Figure 2 for MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Figure 3 for MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Figure 4 for MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Viaarxiv icon

Grounding Computer Use Agents on Human Demonstrations

Add code
Nov 10, 2025
Figure 1 for Grounding Computer Use Agents on Human Demonstrations
Figure 2 for Grounding Computer Use Agents on Human Demonstrations
Figure 3 for Grounding Computer Use Agents on Human Demonstrations
Figure 4 for Grounding Computer Use Agents on Human Demonstrations
Viaarxiv icon

MemeArena: Automating Context-Aware Unbiased Evaluation of Harmfulness Understanding for Multimodal Large Language Models

Add code
Oct 31, 2025
Viaarxiv icon

AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness

Add code
Jul 02, 2025
Viaarxiv icon

Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning

Add code
May 18, 2025
Figure 1 for Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Figure 2 for Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Figure 3 for Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Figure 4 for Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Viaarxiv icon