Picture for Yao Liu

Yao Liu

From Demonstrations to Rewards: Alignment Without Explicit Human Preferences

Add code
Mar 15, 2025
Viaarxiv icon

D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning

Add code
Mar 14, 2025
Viaarxiv icon

Human Cognition Inspired RAG with Knowledge Graph for Complex Problem Solving

Add code
Mar 09, 2025
Viaarxiv icon

Regularized Multi-LLMs Collaboration for Enhanced Score-based Causal Discovery

Add code
Nov 27, 2024
Figure 1 for Regularized Multi-LLMs Collaboration for Enhanced Score-based Causal Discovery
Figure 2 for Regularized Multi-LLMs Collaboration for Enhanced Score-based Causal Discovery
Figure 3 for Regularized Multi-LLMs Collaboration for Enhanced Score-based Causal Discovery
Figure 4 for Regularized Multi-LLMs Collaboration for Enhanced Score-based Causal Discovery
Viaarxiv icon

Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens

Add code
Oct 18, 2024
Figure 1 for Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Figure 2 for Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Figure 3 for Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Figure 4 for Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Viaarxiv icon

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents

Add code
Oct 17, 2024
Figure 1 for AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Figure 2 for AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Figure 3 for AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Figure 4 for AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Viaarxiv icon

Causality-Aware Transformer Networks for Robotic Navigation

Add code
Sep 04, 2024
Figure 1 for Causality-Aware Transformer Networks for Robotic Navigation
Figure 2 for Causality-Aware Transformer Networks for Robotic Navigation
Figure 3 for Causality-Aware Transformer Networks for Robotic Navigation
Figure 4 for Causality-Aware Transformer Networks for Robotic Navigation
Viaarxiv icon

EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data

Add code
Jun 25, 2024
Figure 1 for EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data
Figure 2 for EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data
Figure 3 for EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data
Figure 4 for EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data
Viaarxiv icon

Learning the Target Network in Function Space

Add code
Jun 03, 2024
Viaarxiv icon

Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising

Add code
May 12, 2024
Figure 1 for Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising
Figure 2 for Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising
Figure 3 for Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising
Figure 4 for Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising
Viaarxiv icon