Picture for Jiaming Li

Jiaming Li

Kimi-VL Technical Report

Add code
Apr 10, 2025
Viaarxiv icon

Med-LEGO: Editing and Adapting toward Generalist Medical Image Diagnosis

Add code
Mar 03, 2025
Viaarxiv icon

MITracker: Multi-View Integration for Visual Object Tracking

Add code
Feb 27, 2025
Viaarxiv icon

OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis

Add code
Jan 08, 2025
Viaarxiv icon

Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding

Add code
Jan 03, 2025
Viaarxiv icon

PersonaMath: Enhancing Math Reasoning through Persona-Driven Data Augmentation

Add code
Oct 02, 2024
Figure 1 for PersonaMath: Enhancing Math Reasoning through Persona-Driven Data Augmentation
Figure 2 for PersonaMath: Enhancing Math Reasoning through Persona-Driven Data Augmentation
Figure 3 for PersonaMath: Enhancing Math Reasoning through Persona-Driven Data Augmentation
Figure 4 for PersonaMath: Enhancing Math Reasoning through Persona-Driven Data Augmentation
Viaarxiv icon

Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models

Add code
Sep 27, 2024
Figure 1 for Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Figure 2 for Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Figure 3 for Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Figure 4 for Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Viaarxiv icon

Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs

Add code
Jun 27, 2024
Viaarxiv icon

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models

Add code
Jun 11, 2024
Figure 1 for II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models
Figure 2 for II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models
Figure 3 for II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models
Figure 4 for II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models
Viaarxiv icon

Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection

Add code
Jun 01, 2024
Viaarxiv icon