Picture for Rui Qian

Rui Qian

CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reasoning

Add code
Mar 30, 2026
Viaarxiv icon

Unrewarded Exploration in Large Language Models Reveals Latent Learning from Psychology

Add code
Jan 30, 2026
Viaarxiv icon

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Add code
Sep 19, 2025
Figure 1 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Figure 2 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Figure 3 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Figure 4 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Viaarxiv icon

STARC: See-Through-Wall Augmented Reality Framework for Human-Robot Collaboration in Emergency Response

Add code
Sep 19, 2025
Viaarxiv icon

Energy-Constrained Navigation for Planetary Rovers under Hybrid RTG-Solar Power

Add code
Sep 18, 2025
Viaarxiv icon

CogStream: Context-guided Streaming Video Question Answering

Add code
Jun 12, 2025
Figure 1 for CogStream: Context-guided Streaming Video Question Answering
Figure 2 for CogStream: Context-guided Streaming Video Question Answering
Figure 3 for CogStream: Context-guided Streaming Video Question Answering
Figure 4 for CogStream: Context-guided Streaming Video Question Answering
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Figure 1 for Seed1.5-VL Technical Report
Figure 2 for Seed1.5-VL Technical Report
Figure 3 for Seed1.5-VL Technical Report
Figure 4 for Seed1.5-VL Technical Report
Viaarxiv icon

FA-BARF: Frequency Adapted Bundle-Adjusting Neural Radiance Fields

Add code
Mar 15, 2025
Figure 1 for FA-BARF: Frequency Adapted Bundle-Adjusting Neural Radiance Fields
Figure 2 for FA-BARF: Frequency Adapted Bundle-Adjusting Neural Radiance Fields
Figure 3 for FA-BARF: Frequency Adapted Bundle-Adjusting Neural Radiance Fields
Figure 4 for FA-BARF: Frequency Adapted Bundle-Adjusting Neural Radiance Fields
Viaarxiv icon

DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation

Add code
Mar 13, 2025
Figure 1 for DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation
Figure 2 for DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation
Figure 3 for DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation
Figure 4 for DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation
Viaarxiv icon

DynCIM: Dynamic Curriculum for Imbalanced Multimodal Learning

Add code
Mar 09, 2025
Viaarxiv icon