Picture for Xianghe Pang

Xianghe Pang

SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents

Add code
Dec 17, 2024
Viaarxiv icon

Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review

Add code
Dec 02, 2024
Figure 1 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Figure 2 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Figure 3 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Figure 4 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Viaarxiv icon

Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation

Add code
Oct 18, 2024
Viaarxiv icon

Self-Alignment of Large Language Models via Monopolylogue-based Social Scene Simulation

Add code
Feb 08, 2024
Viaarxiv icon

Pragmatic Communication in Multi-Agent Collaborative Perception

Add code
Jan 23, 2024
Viaarxiv icon