Picture for Wenjie Tang

Wenjie Tang

TeleAntiFraud-28k: An Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection

Add code
Apr 01, 2025
Viaarxiv icon

DSGBench: A Diverse Strategic Game Benchmark for Evaluating LLM-based Agents in Complex Decision-Making Environments

Add code
Mar 08, 2025
Viaarxiv icon