Picture for Runjian Chen

Runjian Chen

JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data

Add code
Mar 13, 2025
Viaarxiv icon

Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation

Add code
Mar 10, 2025
Viaarxiv icon

CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning

Add code
Dec 04, 2024
Figure 1 for CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning
Figure 2 for CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning
Figure 3 for CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning
Figure 4 for CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning
Viaarxiv icon

TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception

Add code
Dec 04, 2024
Figure 1 for TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception
Figure 2 for TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception
Figure 3 for TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception
Figure 4 for TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception
Viaarxiv icon

MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

Add code
Apr 24, 2024
Figure 1 for MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Figure 2 for MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Figure 3 for MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Figure 4 for MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Viaarxiv icon

Towards Implicit Prompt For Text-To-Image Models

Add code
Mar 08, 2024
Viaarxiv icon

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis

Add code
Feb 25, 2024
Viaarxiv icon

CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement

Add code
Nov 20, 2023
Figure 1 for CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement
Figure 2 for CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement
Figure 3 for CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement
Figure 4 for CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement
Viaarxiv icon

SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving

Add code
Sep 25, 2023
Figure 1 for SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving
Figure 2 for SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving
Figure 3 for SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving
Figure 4 for SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving
Viaarxiv icon

MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training

Add code
Mar 23, 2023
Figure 1 for MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
Figure 2 for MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
Figure 3 for MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
Figure 4 for MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
Viaarxiv icon