Picture for Zihan Ma

Zihan Ma

JudgeFlow: Agentic Workflow Optimization via Block Judge

Add code
Jan 12, 2026
Viaarxiv icon

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Add code
Nov 18, 2025
Viaarxiv icon

How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity

Add code
Nov 11, 2025
Figure 1 for How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity
Figure 2 for How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity
Figure 3 for How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity
Figure 4 for How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity
Viaarxiv icon

BrainMCLIP: Brain Image Decoding with Multi-Layer feature Fusion of CLIP

Add code
Oct 22, 2025
Viaarxiv icon

DiFaR: Enhancing Multimodal Misinformation Detection with Diverse, Factual, and Relevant Rationales

Add code
Aug 14, 2025
Viaarxiv icon

TrajEvo: Trajectory Prediction Heuristics Design via LLM-driven Evolution

Add code
Aug 07, 2025
Viaarxiv icon

Rethinking Verification for LLM Code Generation: From Generation to Testing

Add code
Jul 09, 2025
Viaarxiv icon

Coding Triangle: How Does Large Language Model Understand Code?

Add code
Jul 08, 2025
Viaarxiv icon

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Add code
May 26, 2025
Viaarxiv icon

TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution

Add code
May 07, 2025
Figure 1 for TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution
Figure 2 for TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution
Figure 3 for TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution
Figure 4 for TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution
Viaarxiv icon