Picture for Ling Shao

Ling Shao

Terminus Group, Beijing, China

Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey

Add code
Nov 05, 2024
Viaarxiv icon

GWQ: Gradient-Aware Weight Quantization for Large Language Models

Add code
Oct 30, 2024
Viaarxiv icon

Historical Test-time Prompt Tuning for Vision Foundation Models

Add code
Oct 27, 2024
Viaarxiv icon

LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models

Add code
Oct 15, 2024
Viaarxiv icon

MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders

Add code
May 13, 2024
Viaarxiv icon

StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting

Add code
Mar 12, 2024
Viaarxiv icon

Latent Semantic Consensus For Deterministic Geometric Model Fitting

Add code
Mar 11, 2024
Figure 1 for Latent Semantic Consensus For Deterministic Geometric Model Fitting
Figure 2 for Latent Semantic Consensus For Deterministic Geometric Model Fitting
Figure 3 for Latent Semantic Consensus For Deterministic Geometric Model Fitting
Figure 4 for Latent Semantic Consensus For Deterministic Geometric Model Fitting
Viaarxiv icon

Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives

Add code
Feb 05, 2024
Viaarxiv icon

Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation

Add code
Jan 15, 2024
Viaarxiv icon

Domain Adaptation for Large-Vocabulary Object Detectors

Add code
Jan 13, 2024
Viaarxiv icon