Picture for Yaobo Liang

Yaobo Liang

RLRF4Rec: Reinforcement Learning from Recsys Feedback for Enhanced Recommendation Reranking

Add code
Oct 08, 2024
Viaarxiv icon

PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion

Add code
Mar 06, 2024
Viaarxiv icon

Competition-Level Problems are Effective LLM Evaluators

Add code
Dec 05, 2023
Viaarxiv icon

PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion

Add code
Nov 07, 2023
Viaarxiv icon

EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation

Add code
Oct 12, 2023
Viaarxiv icon

GameEval: Evaluating LLMs on Conversational Games

Add code
Aug 19, 2023
Viaarxiv icon

Machine-Created Universal Language for Cross-lingual Transfer

Add code
May 22, 2023
Viaarxiv icon

Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast

Add code
May 19, 2023
Viaarxiv icon

Learning to Program with Natural Language

Add code
Apr 23, 2023
Viaarxiv icon

Low-code LLM: Visual Programming over LLMs

Add code
Apr 20, 2023
Viaarxiv icon