Picture for Linxin Song

Linxin Song

CoAct-1: Computer-using Agents with Coding as Actions

Add code
Aug 05, 2025
Figure 1 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 2 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 3 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 4 for CoAct-1: Computer-using Agents with Coding as Actions
Viaarxiv icon

The Hallucination Tax of Reinforcement Finetuning

Add code
May 20, 2025
Viaarxiv icon

A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations

Add code
May 20, 2025
Figure 1 for A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations
Figure 2 for A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations
Figure 3 for A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations
Figure 4 for A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations
Viaarxiv icon

Efficient Reinforcement Finetuning via Adaptive Curriculum Learning

Add code
Apr 07, 2025
Viaarxiv icon

Attributed Synthetic Data Generation for Zero-shot Domain-specific Image Classification

Add code
Apr 06, 2025
Viaarxiv icon

Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base

Add code
Mar 30, 2025
Viaarxiv icon

Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training

Add code
Dec 11, 2024
Figure 1 for Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
Figure 2 for Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
Figure 3 for Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
Figure 4 for Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
Viaarxiv icon

ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models

Add code
Dec 09, 2024
Figure 1 for ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models
Figure 2 for ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models
Figure 3 for ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models
Figure 4 for ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models
Viaarxiv icon

Disentangling Likes and Dislikes in Personalized Generative Explainable Recommendation

Add code
Oct 17, 2024
Figure 1 for Disentangling Likes and Dislikes in Personalized Generative Explainable Recommendation
Figure 2 for Disentangling Likes and Dislikes in Personalized Generative Explainable Recommendation
Figure 3 for Disentangling Likes and Dislikes in Personalized Generative Explainable Recommendation
Figure 4 for Disentangling Likes and Dislikes in Personalized Generative Explainable Recommendation
Viaarxiv icon

Rethinking LLM-based Preference Evaluation

Add code
Jul 01, 2024
Viaarxiv icon