Picture for Songyang Gao

Songyang Gao

Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data

Add code
Aug 27, 2024
Viaarxiv icon

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Add code
Jun 06, 2024
Viaarxiv icon

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

Add code
Apr 09, 2024
Figure 1 for Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Figure 2 for Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Figure 3 for Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Figure 4 for Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Viaarxiv icon

The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis

Add code
Apr 01, 2024
Viaarxiv icon

EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models

Add code
Mar 18, 2024
Viaarxiv icon

ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages

Add code
Feb 16, 2024
Viaarxiv icon

Navigating the OverKill in Large Language Models

Add code
Jan 31, 2024
Viaarxiv icon

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

Add code
Jan 21, 2024
Viaarxiv icon

RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning

Add code
Jan 19, 2024
Viaarxiv icon

ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios

Add code
Jan 14, 2024
Viaarxiv icon