Picture for Shuo Sun

Shuo Sun

MERaLiON-AudioLLM: Technical Report

Add code
Dec 13, 2024
Viaarxiv icon

RMP-YOLO: A Robust Motion Predictor for Partially Observable Scenarios even if You Only Look Once

Add code
Sep 18, 2024
Viaarxiv icon

MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders

Add code
Sep 10, 2024
Figure 1 for MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Figure 2 for MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Figure 3 for MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Figure 4 for MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Viaarxiv icon

DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba

Add code
Aug 07, 2024
Figure 1 for DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba
Figure 2 for DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba
Figure 3 for DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba
Figure 4 for DRAMA: An Efficient End-to-end Motion Planner for Autonomous Driving with Mamba
Viaarxiv icon

Table-Filling via Mean Teacher for Cross-domain Aspect Sentiment Triplet Extraction

Add code
Jul 23, 2024
Figure 1 for Table-Filling via Mean Teacher for Cross-domain Aspect Sentiment Triplet Extraction
Figure 2 for Table-Filling via Mean Teacher for Cross-domain Aspect Sentiment Triplet Extraction
Figure 3 for Table-Filling via Mean Teacher for Cross-domain Aspect Sentiment Triplet Extraction
Figure 4 for Table-Filling via Mean Teacher for Cross-domain Aspect Sentiment Triplet Extraction
Viaarxiv icon

AudioBench: A Universal Benchmark for Audio Large Language Models

Add code
Jun 25, 2024
Viaarxiv icon

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Add code
Jun 14, 2024
Figure 1 for SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Figure 2 for SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Figure 3 for SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Figure 4 for SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Viaarxiv icon

ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties

Add code
May 01, 2024
Viaarxiv icon

ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction

Add code
Apr 17, 2024
Figure 1 for ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction
Figure 2 for ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction
Figure 3 for ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction
Figure 4 for ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction
Viaarxiv icon

High-Fidelity SLAM Using Gaussian Splatting with Rendering-Guided Densification and Regularized Optimization

Add code
Mar 19, 2024
Figure 1 for High-Fidelity SLAM Using Gaussian Splatting with Rendering-Guided Densification and Regularized Optimization
Figure 2 for High-Fidelity SLAM Using Gaussian Splatting with Rendering-Guided Densification and Regularized Optimization
Figure 3 for High-Fidelity SLAM Using Gaussian Splatting with Rendering-Guided Densification and Regularized Optimization
Figure 4 for High-Fidelity SLAM Using Gaussian Splatting with Rendering-Guided Densification and Regularized Optimization
Viaarxiv icon