Picture for Nanyun Peng

Nanyun Peng

Shammie

TaoBench: Do Automated Theorem Prover LLMs Generalize Beyond MathLib?

Add code
Mar 13, 2026
Viaarxiv icon

Learning Structured Reasoning via Tractable Trajectory Control

Add code
Mar 02, 2026
Viaarxiv icon

CoLyricist: Enhancing Lyric Writing with AI through Workflow-Aligned Support

Add code
Feb 26, 2026
Viaarxiv icon

Translation as a Scalable Proxy for Multilingual Evaluation

Add code
Jan 16, 2026
Viaarxiv icon

MMGR: Multi-Modal Generative Reasoning

Add code
Dec 17, 2025
Figure 1 for MMGR: Multi-Modal Generative Reasoning
Figure 2 for MMGR: Multi-Modal Generative Reasoning
Figure 3 for MMGR: Multi-Modal Generative Reasoning
Figure 4 for MMGR: Multi-Modal Generative Reasoning
Viaarxiv icon

MMPersuade: A Dataset and Evaluation Framework for Multimodal Persuasion

Add code
Oct 26, 2025
Viaarxiv icon

DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation

Add code
Oct 16, 2025
Figure 1 for DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation
Figure 2 for DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation
Figure 3 for DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation
Figure 4 for DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation
Viaarxiv icon

LLM-REVal: Can We Trust LLM Reviewers Yet?

Add code
Oct 14, 2025
Figure 1 for LLM-REVal: Can We Trust LLM Reviewers Yet?
Figure 2 for LLM-REVal: Can We Trust LLM Reviewers Yet?
Figure 3 for LLM-REVal: Can We Trust LLM Reviewers Yet?
Figure 4 for LLM-REVal: Can We Trust LLM Reviewers Yet?
Viaarxiv icon

Multilingual Routing in Mixture-of-Experts

Add code
Oct 06, 2025
Viaarxiv icon

VaPR -- Vision-language Preference alignment for Reasoning

Add code
Oct 02, 2025
Viaarxiv icon