Picture for Jiaao Chen

Jiaao Chen

Position: Standard Benchmarks Fail -- LLM Agents Present Overlooked Risks for Financial Applications

Add code
Feb 21, 2025
Viaarxiv icon

Dynamic Skill Adaptation for Large Language Models

Add code
Dec 26, 2024
Viaarxiv icon

Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review

Add code
Dec 02, 2024
Figure 1 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Figure 2 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Figure 3 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Figure 4 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Viaarxiv icon

DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph

Add code
Jun 25, 2024
Viaarxiv icon

From Scroll to Misbelief: Modeling the Unobservable Susceptibility to Misinformation on Social Media

Add code
Nov 16, 2023
Viaarxiv icon

Unlearn What You Want to Forget: Efficient Unlearning for LLMs

Add code
Oct 31, 2023
Viaarxiv icon

DyVal: Graph-informed Dynamic Evaluation of Large Language Models

Add code
Oct 05, 2023
Viaarxiv icon

Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models

Add code
Aug 14, 2023
Viaarxiv icon

Informative Path Planning of Autonomous Vehicle for Parking Occupancy Estimation

Add code
Aug 01, 2023
Viaarxiv icon

Can Large Language Models Transform Computational Social Science?

Add code
Apr 12, 2023
Viaarxiv icon