Picture for Zhonghong Ou

Zhonghong Ou

GUI-AC: Enhancing Continual Learning in GUI Agents

Add code
Jun 09, 2026
Viaarxiv icon

5% > 100%: Flatness Preference is All You Need for Multimodal Parameter-Efficient Fine-Tuning

Add code
Jun 09, 2026
Viaarxiv icon

ERGeoBench:A Comprehensive Benchmark for Embodied Reasoning and Geo-localization in Multimodal Large Language Models

Add code
May 29, 2026
Viaarxiv icon

SliderQuant: Accurate Post-Training Quantization for LLMs

Add code
Mar 26, 2026
Viaarxiv icon

OmniRAG-Agent: Agentic Omnimodal Reasoning for Low-Resource Long Audio-Video Question Answering

Add code
Feb 03, 2026
Viaarxiv icon

CreBench: Human-Aligned Creativity Evaluation from Idea to Process to Product

Add code
Nov 17, 2025
Viaarxiv icon

A Foundation Model for Chest X-ray Interpretation with Grounded Reasoning via Online Reinforcement Learning

Add code
Sep 04, 2025
Viaarxiv icon

Efficient Robotic Policy Learning via Latent Space Backward Planning

Add code
May 11, 2025
Viaarxiv icon

SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL

Add code
Feb 17, 2025
Figure 1 for SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL
Figure 2 for SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL
Figure 3 for SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL
Figure 4 for SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL
Viaarxiv icon

TSVC:Tripartite Learning with Semantic Variation Consistency for Robust Image-Text Retrieval

Add code
Jan 19, 2025
Viaarxiv icon