Picture for Xiangyan Liu

Xiangyan Liu

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Add code
Sep 26, 2025
Figure 1 for Language Models Can Learn from Verbal Feedback Without Scalar Rewards
Figure 2 for Language Models Can Learn from Verbal Feedback Without Scalar Rewards
Figure 3 for Language Models Can Learn from Verbal Feedback Without Scalar Rewards
Figure 4 for Language Models Can Learn from Verbal Feedback Without Scalar Rewards
Viaarxiv icon

Fostering Video Reasoning via Next-Event Prediction

Add code
May 28, 2025
Figure 1 for Fostering Video Reasoning via Next-Event Prediction
Figure 2 for Fostering Video Reasoning via Next-Event Prediction
Figure 3 for Fostering Video Reasoning via Next-Event Prediction
Figure 4 for Fostering Video Reasoning via Next-Event Prediction
Viaarxiv icon

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Add code
Apr 17, 2025
Viaarxiv icon

CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases

Add code
Aug 07, 2024
Figure 1 for CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
Figure 2 for CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
Figure 3 for CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
Figure 4 for CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
Viaarxiv icon

Described Spatial-Temporal Video Detection

Add code
Jul 08, 2024
Figure 1 for Described Spatial-Temporal Video Detection
Figure 2 for Described Spatial-Temporal Video Detection
Figure 3 for Described Spatial-Temporal Video Detection
Figure 4 for Described Spatial-Temporal Video Detection
Viaarxiv icon

An Empirical Study of Training ID-Agnostic Multi-modal Sequential Recommenders

Add code
Mar 30, 2024
Figure 1 for An Empirical Study of Training ID-Agnostic Multi-modal Sequential Recommenders
Figure 2 for An Empirical Study of Training ID-Agnostic Multi-modal Sequential Recommenders
Figure 3 for An Empirical Study of Training ID-Agnostic Multi-modal Sequential Recommenders
Figure 4 for An Empirical Study of Training ID-Agnostic Multi-modal Sequential Recommenders
Viaarxiv icon

Towards Robust Multi-Modal Reasoning via Model Selection

Add code
Oct 12, 2023
Viaarxiv icon

Towards Complex-query Referring Image Segmentation: A Novel Benchmark

Add code
Sep 29, 2023
Figure 1 for Towards Complex-query Referring Image Segmentation: A Novel Benchmark
Figure 2 for Towards Complex-query Referring Image Segmentation: A Novel Benchmark
Figure 3 for Towards Complex-query Referring Image Segmentation: A Novel Benchmark
Figure 4 for Towards Complex-query Referring Image Segmentation: A Novel Benchmark
Viaarxiv icon

A Content-Driven Micro-Video Recommendation Dataset at Scale

Add code
Sep 27, 2023
Figure 1 for A Content-Driven Micro-Video Recommendation Dataset at Scale
Figure 2 for A Content-Driven Micro-Video Recommendation Dataset at Scale
Figure 3 for A Content-Driven Micro-Video Recommendation Dataset at Scale
Figure 4 for A Content-Driven Micro-Video Recommendation Dataset at Scale
Viaarxiv icon