Picture for Yunxuan Li

Yunxuan Li

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Add code
Jul 22, 2024
Figure 1 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Figure 2 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Figure 3 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Figure 4 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Viaarxiv icon

Improving Multi-Agent Debate with Sparse Communication Topology

Add code
Jun 17, 2024
Viaarxiv icon

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Add code
Jun 05, 2024
Figure 1 for Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Figure 2 for Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Figure 3 for Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Figure 4 for Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Viaarxiv icon

Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision

Add code
Feb 05, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Evidential Active Recognition: Intelligent and Prudent Open-World Embodied Perception

Add code
Nov 23, 2023
Viaarxiv icon

Enable Language Models to Implicitly Learn Self-Improvement From Data

Add code
Oct 05, 2023
Viaarxiv icon

Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts

Add code
May 24, 2023
Viaarxiv icon