Picture for Botian Shi

Botian Shi

LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking

Add code
Jan 14, 2025
Viaarxiv icon

Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback

Add code
Jan 07, 2025
Viaarxiv icon

GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training

Add code
Dec 16, 2024
Viaarxiv icon

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Add code
Dec 10, 2024
Figure 1 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Figure 2 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Figure 3 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Figure 4 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Viaarxiv icon

Chimera: Improving Generalist Model with Domain-Specific Experts

Add code
Dec 08, 2024
Viaarxiv icon

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Add code
Dec 06, 2024
Viaarxiv icon

ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving

Add code
Nov 08, 2024
Figure 1 for ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
Figure 2 for ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
Figure 3 for ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
Figure 4 for ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
Viaarxiv icon

Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

Add code
Oct 13, 2024
Figure 1 for Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Figure 2 for Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Figure 3 for Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Figure 4 for Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Viaarxiv icon

MinerU: An Open-Source Solution for Precise Document Content Extraction

Add code
Sep 27, 2024
Figure 1 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 2 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 3 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 4 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Viaarxiv icon

DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes

Add code
Sep 06, 2024
Figure 1 for DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Figure 2 for DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Figure 3 for DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Figure 4 for DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Viaarxiv icon