Picture for Hao Yang

Hao Yang

CompactRAG: Reducing LLM Calls and Token Overhead in Multi-Hop Question Answering

Add code
Feb 05, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models

Add code
Jan 28, 2026
Viaarxiv icon

Towards Pixel-Level VLM Perception via Simple Points Prediction

Add code
Jan 27, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

Stephanie2: Thinking, Waiting, and Making Decisions Like Humans in Step-by-Step AI Social Chat

Add code
Jan 09, 2026
Viaarxiv icon

TransactionGPT

Add code
Nov 12, 2025
Viaarxiv icon

CARE-Bench: A Benchmark of Diverse Client Simulations Guided by Expert Principles for Evaluating LLMs in Psychological Counseling

Add code
Nov 12, 2025
Viaarxiv icon

What Makes Reasoning Invalid: Echo Reflection Mitigation for Large Language Models

Add code
Nov 09, 2025
Figure 1 for What Makes Reasoning Invalid: Echo Reflection Mitigation for Large Language Models
Figure 2 for What Makes Reasoning Invalid: Echo Reflection Mitigation for Large Language Models
Figure 3 for What Makes Reasoning Invalid: Echo Reflection Mitigation for Large Language Models
Figure 4 for What Makes Reasoning Invalid: Echo Reflection Mitigation for Large Language Models
Viaarxiv icon

Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation

Add code
Sep 04, 2025
Figure 1 for Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Figure 2 for Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Figure 3 for Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Figure 4 for Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Viaarxiv icon