Picture for Yulei Qin

Yulei Qin

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Add code
Dec 31, 2025
Viaarxiv icon

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Add code
Dec 26, 2025
Viaarxiv icon

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Add code
Sep 26, 2025
Figure 1 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 2 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 3 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 4 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Viaarxiv icon

Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models

Add code
Dec 19, 2024
Viaarxiv icon

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

Add code
Aug 28, 2024
Figure 1 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Figure 2 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Figure 3 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Figure 4 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Viaarxiv icon

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Add code
Aug 07, 2024
Figure 1 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 2 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 3 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 4 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Viaarxiv icon

RESTORE: Towards Feature Shift for Vision-Language Prompt Learning

Add code
Mar 10, 2024
Figure 1 for RESTORE: Towards Feature Shift for Vision-Language Prompt Learning
Figure 2 for RESTORE: Towards Feature Shift for Vision-Language Prompt Learning
Figure 3 for RESTORE: Towards Feature Shift for Vision-Language Prompt Learning
Figure 4 for RESTORE: Towards Feature Shift for Vision-Language Prompt Learning
Viaarxiv icon

Sinkhorn Distance Minimization for Knowledge Distillation

Add code
Feb 27, 2024
Viaarxiv icon

Towards Robust Text Retrieval with Progressive Learning

Add code
Nov 20, 2023
Figure 1 for Towards Robust Text Retrieval with Progressive Learning
Figure 2 for Towards Robust Text Retrieval with Progressive Learning
Figure 3 for Towards Robust Text Retrieval with Progressive Learning
Figure 4 for Towards Robust Text Retrieval with Progressive Learning
Viaarxiv icon

CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes

Add code
Oct 15, 2023
Figure 1 for CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes
Figure 2 for CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes
Figure 3 for CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes
Figure 4 for CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes
Viaarxiv icon