Picture for Bo Xu

Bo Xu

EchoTrail-GUI: Building Actionable Memory for GUI Agents via Critic-Guided Self-Exploration

Add code
Dec 22, 2025
Viaarxiv icon

An Anatomy of Vision-Language-Action Models: From Modules to Milestones and Challenges

Add code
Dec 19, 2025
Viaarxiv icon

Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition

Add code
Nov 14, 2025
Viaarxiv icon

MrCoM: A Meta-Regularized World-Model Generalizing Across Multi-Scenarios

Add code
Nov 09, 2025
Viaarxiv icon

TinyChemVL: Advancing Chemical Vision-Language Models via Efficient Visual Token Reduction and Complex Reaction Tasks

Add code
Nov 09, 2025
Viaarxiv icon

4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular Videos

Add code
Nov 07, 2025
Viaarxiv icon

Empowering LLMs with Parameterized Skills for Adversarial Long-Horizon Planning

Add code
Sep 16, 2025
Viaarxiv icon

SpikingBrain Technical Report: Spiking Brain-inspired Large Models

Add code
Sep 05, 2025
Viaarxiv icon

Self-Guided Function Calling in Large Language Models via Stepwise Experience Recall

Add code
Aug 21, 2025
Viaarxiv icon

SC2Arena and StarEvolve: Benchmark and Self-Improvement Framework for LLMs in Complex Decision-Making Tasks

Add code
Aug 14, 2025
Viaarxiv icon