Picture for Yiping Bao

Yiping Bao

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models

Add code
Jan 28, 2026
Viaarxiv icon

Towards Pixel-Level VLM Perception via Simple Points Prediction

Add code
Jan 27, 2026
Viaarxiv icon

BabyVision: Visual Reasoning Beyond Language

Add code
Jan 10, 2026
Viaarxiv icon

Kimi K2: Open Agentic Intelligence

Add code
Jul 28, 2025
Figure 1 for Kimi K2: Open Agentic Intelligence
Figure 2 for Kimi K2: Open Agentic Intelligence
Figure 3 for Kimi K2: Open Agentic Intelligence
Figure 4 for Kimi K2: Open Agentic Intelligence
Viaarxiv icon

Kimi-VL Technical Report

Add code
Apr 10, 2025
Figure 1 for Kimi-VL Technical Report
Figure 2 for Kimi-VL Technical Report
Figure 3 for Kimi-VL Technical Report
Figure 4 for Kimi-VL Technical Report
Viaarxiv icon

Multi-modal Relation Distillation for Unified 3D Representation Learning

Add code
Jul 19, 2024
Figure 1 for Multi-modal Relation Distillation for Unified 3D Representation Learning
Figure 2 for Multi-modal Relation Distillation for Unified 3D Representation Learning
Figure 3 for Multi-modal Relation Distillation for Unified 3D Representation Learning
Figure 4 for Multi-modal Relation Distillation for Unified 3D Representation Learning
Viaarxiv icon

W2N:Switching From Weak Supervision to Noisy Supervision for Object Detection

Add code
Jul 25, 2022
Figure 1 for W2N:Switching From Weak Supervision to Noisy Supervision for Object Detection
Figure 2 for W2N:Switching From Weak Supervision to Noisy Supervision for Object Detection
Figure 3 for W2N:Switching From Weak Supervision to Noisy Supervision for Object Detection
Figure 4 for W2N:Switching From Weak Supervision to Noisy Supervision for Object Detection
Viaarxiv icon

Prototypical Contrastive Language Image Pretraining

Add code
Jun 22, 2022
Figure 1 for Prototypical Contrastive Language Image Pretraining
Figure 2 for Prototypical Contrastive Language Image Pretraining
Figure 3 for Prototypical Contrastive Language Image Pretraining
Figure 4 for Prototypical Contrastive Language Image Pretraining
Viaarxiv icon

Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association

Add code
Nov 25, 2021
Figure 1 for Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association
Figure 2 for Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association
Figure 3 for Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association
Figure 4 for Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association
Viaarxiv icon