Picture for Yutao Zhu

Yutao Zhu

MCLRL: A Multi-Domain Contrastive Learning with Reinforcement Learning Framework for Few-Shot Modulation Recognition

Add code
Feb 26, 2025
Viaarxiv icon

mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data

Add code
Feb 12, 2025
Viaarxiv icon

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Add code
Jan 09, 2025
Figure 1 for Search-o1: Agentic Search-Enhanced Large Reasoning Models
Figure 2 for Search-o1: Agentic Search-Enhanced Large Reasoning Models
Figure 3 for Search-o1: Agentic Search-Enhanced Large Reasoning Models
Figure 4 for Search-o1: Agentic Search-Enhanced Large Reasoning Models
Viaarxiv icon

YuLan-Mini: An Open Data-efficient Language Model

Add code
Dec 24, 2024
Figure 1 for YuLan-Mini: An Open Data-efficient Language Model
Figure 2 for YuLan-Mini: An Open Data-efficient Language Model
Figure 3 for YuLan-Mini: An Open Data-efficient Language Model
Figure 4 for YuLan-Mini: An Open Data-efficient Language Model
Viaarxiv icon

Progressive Multimodal Reasoning via Active Retrieval

Add code
Dec 19, 2024
Figure 1 for Progressive Multimodal Reasoning via Active Retrieval
Figure 2 for Progressive Multimodal Reasoning via Active Retrieval
Figure 3 for Progressive Multimodal Reasoning via Active Retrieval
Figure 4 for Progressive Multimodal Reasoning via Active Retrieval
Viaarxiv icon

Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models

Add code
Dec 19, 2024
Viaarxiv icon

Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model

Add code
Dec 11, 2024
Viaarxiv icon

From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning

Add code
Nov 06, 2024
Figure 1 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Figure 2 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Figure 3 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Figure 4 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Viaarxiv icon

Little Giants: Synthesizing High-Quality Embedding Data at Scale

Add code
Oct 24, 2024
Viaarxiv icon

A Survey of Conversational Search

Add code
Oct 21, 2024
Figure 1 for A Survey of Conversational Search
Figure 2 for A Survey of Conversational Search
Figure 3 for A Survey of Conversational Search
Figure 4 for A Survey of Conversational Search
Viaarxiv icon