Picture for Bohan Li

Bohan Li

LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding

Add code
Dec 24, 2024
Viaarxiv icon

Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits

Add code
Dec 17, 2024
Viaarxiv icon

OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation

Add code
Dec 15, 2024
Viaarxiv icon

Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction

Add code
Dec 11, 2024
Figure 1 for Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction
Figure 2 for Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction
Figure 3 for Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction
Figure 4 for Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction
Viaarxiv icon

UniScene: Unified Occupancy-centric Driving Scene Generation

Add code
Dec 06, 2024
Viaarxiv icon

Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding

Add code
Oct 29, 2024
Figure 1 for Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding
Figure 2 for Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding
Figure 3 for Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding
Figure 4 for Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding
Viaarxiv icon

R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models

Add code
Oct 23, 2024
Viaarxiv icon

TAPTRv2: Attention-based Position Update Improves Tracking Any Point

Add code
Jul 23, 2024
Figure 1 for TAPTRv2: Attention-based Position Update Improves Tracking Any Point
Figure 2 for TAPTRv2: Attention-based Position Update Improves Tracking Any Point
Figure 3 for TAPTRv2: Attention-based Position Update Improves Tracking Any Point
Figure 4 for TAPTRv2: Attention-based Position Update Improves Tracking Any Point
Viaarxiv icon

On the Effectiveness of Acoustic BPE in Decoder-Only TTS

Add code
Jul 04, 2024
Viaarxiv icon

Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion

Add code
Jul 02, 2024
Viaarxiv icon