Picture for Yunxuan Li

Yunxuan Li

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Add code
Jul 22, 2024
Viaarxiv icon

Improving Multi-Agent Debate with Sparse Communication Topology

Add code
Jun 17, 2024
Viaarxiv icon

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Add code
Jun 05, 2024
Viaarxiv icon

Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision

Add code
Feb 05, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Evidential Active Recognition: Intelligent and Prudent Open-World Embodied Perception

Add code
Nov 23, 2023
Viaarxiv icon

Enable Language Models to Implicitly Learn Self-Improvement From Data

Add code
Oct 05, 2023
Viaarxiv icon

Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts

Add code
May 24, 2023
Viaarxiv icon