Picture for Le Tian

Le Tian

VersaViT: Enhancing MLLM Vision Backbones via Task-Guided Optimization

Add code
Feb 10, 2026
Viaarxiv icon

POINTS-GUI-G: GUI-Grounding Journey

Add code
Feb 06, 2026
Viaarxiv icon

Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation

Add code
Feb 03, 2026
Viaarxiv icon

POINTS1.5: Building a Vision-Language Model towards Real World Applications

Add code
Dec 11, 2024
Figure 1 for POINTS1.5: Building a Vision-Language Model towards Real World Applications
Figure 2 for POINTS1.5: Building a Vision-Language Model towards Real World Applications
Figure 3 for POINTS1.5: Building a Vision-Language Model towards Real World Applications
Figure 4 for POINTS1.5: Building a Vision-Language Model towards Real World Applications
Viaarxiv icon

POINTS: Improving Your Vision-language Model with Affordable Strategies

Add code
Sep 07, 2024
Viaarxiv icon

Rethinking Overlooked Aspects in Vision-Language Models

Add code
May 20, 2024
Viaarxiv icon