Picture for Haodong Duan

Haodong Duan

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

Add code
Dec 30, 2025
Viaarxiv icon

Think Visually, Reason Textually: Vision-Language Synergy in ARC

Add code
Nov 19, 2025
Figure 1 for Think Visually, Reason Textually: Vision-Language Synergy in ARC
Figure 2 for Think Visually, Reason Textually: Vision-Language Synergy in ARC
Figure 3 for Think Visually, Reason Textually: Vision-Language Synergy in ARC
Figure 4 for Think Visually, Reason Textually: Vision-Language Synergy in ARC
Viaarxiv icon

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Add code
Nov 18, 2025
Viaarxiv icon

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Add code
Oct 31, 2025
Viaarxiv icon

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Add code
Sep 26, 2025
Figure 1 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Figure 2 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Figure 3 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Figure 4 for SPARK: Synergistic Policy And Reward Co-Evolving Framework
Viaarxiv icon

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Add code
Aug 28, 2025
Figure 1 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 2 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 3 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 4 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Viaarxiv icon

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Add code
Aug 25, 2025
Figure 1 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 2 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 3 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 4 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Viaarxiv icon

OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems

Add code
Jun 12, 2025
Figure 1 for OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems
Figure 2 for OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems
Figure 3 for OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems
Figure 4 for OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems
Viaarxiv icon

Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings

Add code
Jun 05, 2025
Viaarxiv icon

MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Add code
May 29, 2025
Viaarxiv icon