Picture for Zhiqi Huang

Zhiqi Huang

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models

Add code
Jan 28, 2026
Viaarxiv icon

Towards Pixel-Level VLM Perception via Simple Points Prediction

Add code
Jan 27, 2026
Viaarxiv icon

BabyVision: Visual Reasoning Beyond Language

Add code
Jan 10, 2026
Viaarxiv icon

MMErroR: A Benchmark for Erroneous Reasoning in Vision-Language Models

Add code
Jan 06, 2026
Viaarxiv icon

Kimi K2: Open Agentic Intelligence

Add code
Jul 28, 2025
Figure 1 for Kimi K2: Open Agentic Intelligence
Figure 2 for Kimi K2: Open Agentic Intelligence
Figure 3 for Kimi K2: Open Agentic Intelligence
Figure 4 for Kimi K2: Open Agentic Intelligence
Viaarxiv icon

G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning

Add code
May 19, 2025
Figure 1 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Figure 2 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Figure 3 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Figure 4 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Viaarxiv icon

Kimi-VL Technical Report

Add code
Apr 10, 2025
Figure 1 for Kimi-VL Technical Report
Figure 2 for Kimi-VL Technical Report
Figure 3 for Kimi-VL Technical Report
Figure 4 for Kimi-VL Technical Report
Viaarxiv icon

Efficient Inference for Large Reasoning Models: A Survey

Add code
Mar 29, 2025
Figure 1 for Efficient Inference for Large Reasoning Models: A Survey
Figure 2 for Efficient Inference for Large Reasoning Models: A Survey
Figure 3 for Efficient Inference for Large Reasoning Models: A Survey
Viaarxiv icon

A Survey of Model Architectures in Information Retrieval

Add code
Feb 20, 2025
Viaarxiv icon