Picture for Limao Xiong

Limao Xiong

Multi-Programming Language Sandbox for LLMs

Add code
Oct 30, 2024
Figure 1 for Multi-Programming Language Sandbox for LLMs
Figure 2 for Multi-Programming Language Sandbox for LLMs
Figure 3 for Multi-Programming Language Sandbox for LLMs
Figure 4 for Multi-Programming Language Sandbox for LLMs
Viaarxiv icon

RMB: Comprehensively Benchmarking Reward Models in LLM Alignment

Add code
Oct 13, 2024
Viaarxiv icon

MetaRM: Shifted Distributions Alignment via Meta-Learning

Add code
May 01, 2024
Viaarxiv icon

EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models

Add code
Mar 18, 2024
Viaarxiv icon

StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback

Add code
Feb 05, 2024
Viaarxiv icon

The Rise and Potential of Large Language Model Based Agents: A Survey

Add code
Sep 19, 2023
Viaarxiv icon

Secrets of RLHF in Large Language Models Part I: PPO

Add code
Jul 18, 2023
Viaarxiv icon

A Confidence-based Partial Label Learning Model for Crowd-Annotated Named Entity Recognition

Add code
May 21, 2023
Viaarxiv icon

MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective

Add code
Apr 09, 2022
Figure 1 for MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective
Figure 2 for MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective
Figure 3 for MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective
Figure 4 for MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective
Viaarxiv icon