Picture for Xiaomin Yu

Xiaomin Yu

Chain of Mindset: Reasoning with Adaptive Cognitive Modes

Add code
Feb 10, 2026
Viaarxiv icon

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

Add code
Feb 06, 2026
Viaarxiv icon

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines

Add code
Jan 14, 2026
Viaarxiv icon

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Add code
Jan 11, 2026
Viaarxiv icon

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Add code
May 18, 2025
Viaarxiv icon

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Add code
Mar 28, 2025
Viaarxiv icon

CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model

Add code
Mar 25, 2025
Figure 1 for CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model
Figure 2 for CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model
Figure 3 for CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model
Figure 4 for CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model
Viaarxiv icon

Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities

Add code
Apr 25, 2024
Figure 1 for Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities
Figure 2 for Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities
Figure 3 for Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities
Figure 4 for Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities
Viaarxiv icon

ArcSin: Adaptive ranged cosine Similarity injected noise for Language-Driven Visual Tasks

Add code
Feb 27, 2024
Figure 1 for ArcSin: Adaptive ranged cosine Similarity injected noise for Language-Driven Visual Tasks
Figure 2 for ArcSin: Adaptive ranged cosine Similarity injected noise for Language-Driven Visual Tasks
Figure 3 for ArcSin: Adaptive ranged cosine Similarity injected noise for Language-Driven Visual Tasks
Figure 4 for ArcSin: Adaptive ranged cosine Similarity injected noise for Language-Driven Visual Tasks
Viaarxiv icon