Picture for Yuchuan Wu

Yuchuan Wu

EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning

Add code
Feb 18, 2025
Viaarxiv icon

OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis

Add code
Jan 08, 2025
Viaarxiv icon

SDPO: Segment-Level Direct Preference Optimization for Social Agents

Add code
Jan 03, 2025
Figure 1 for SDPO: Segment-Level Direct Preference Optimization for Social Agents
Figure 2 for SDPO: Segment-Level Direct Preference Optimization for Social Agents
Figure 3 for SDPO: Segment-Level Direct Preference Optimization for Social Agents
Figure 4 for SDPO: Segment-Level Direct Preference Optimization for Social Agents
Viaarxiv icon

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Add code
Sep 09, 2024
Figure 1 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 2 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 3 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 4 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Viaarxiv icon

FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents

Add code
Jun 21, 2024
Figure 1 for FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Figure 2 for FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Figure 3 for FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Figure 4 for FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Viaarxiv icon

A Survey on Self-Evolution of Large Language Models

Add code
Apr 22, 2024
Figure 1 for A Survey on Self-Evolution of Large Language Models
Figure 2 for A Survey on Self-Evolution of Large Language Models
Figure 3 for A Survey on Self-Evolution of Large Language Models
Figure 4 for A Survey on Self-Evolution of Large Language Models
Viaarxiv icon

Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning

Add code
Mar 29, 2024
Figure 1 for Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning
Figure 2 for Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning
Figure 3 for Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning
Figure 4 for Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning
Viaarxiv icon

Semantically-Shifted Incremental Adapter-Tuning is A Continual ViTransformer

Add code
Mar 29, 2024
Viaarxiv icon

Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models

Add code
Mar 04, 2024
Viaarxiv icon

Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use

Add code
Dec 07, 2023
Figure 1 for Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use
Figure 2 for Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use
Figure 3 for Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use
Figure 4 for Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use
Viaarxiv icon