Picture for Masanori Suganuma

Masanori Suganuma

TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos

Add code
Jan 10, 2025
Figure 1 for TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos
Figure 2 for TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos
Figure 3 for TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos
Figure 4 for TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos
Viaarxiv icon

RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting

Add code
Dec 13, 2024
Viaarxiv icon

Rethinking Annotation for Object Detection: Is Annotating Small-size Instances Worth Its Cost?

Add code
Dec 07, 2024
Viaarxiv icon

Cascaded Multi-Scale Attention for Enhanced Multi-Scale Feature Extraction and Interaction with Low-Resolution Images

Add code
Dec 03, 2024
Viaarxiv icon

Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability

Add code
Oct 20, 2024
Figure 1 for Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability
Figure 2 for Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability
Figure 3 for Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability
Figure 4 for Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability
Viaarxiv icon

An Improved Method for Personalizing Diffusion Models

Add code
Jul 07, 2024
Viaarxiv icon

SBCFormer: Lightweight Network Capable of Full-size ImageNet Classification at 1 FPS on Single Board Computers

Add code
Nov 07, 2023
Viaarxiv icon

Visual Abductive Reasoning Meets Driving Hazard Prediction: Problem Formulation and Dataset

Add code
Oct 10, 2023
Figure 1 for Visual Abductive Reasoning Meets Driving Hazard Prediction: Problem Formulation and Dataset
Figure 2 for Visual Abductive Reasoning Meets Driving Hazard Prediction: Problem Formulation and Dataset
Figure 3 for Visual Abductive Reasoning Meets Driving Hazard Prediction: Problem Formulation and Dataset
Figure 4 for Visual Abductive Reasoning Meets Driving Hazard Prediction: Problem Formulation and Dataset
Viaarxiv icon

Contextual Affinity Distillation for Image Anomaly Detection

Add code
Jul 06, 2023
Viaarxiv icon

That's BAD: Blind Anomaly Detection by Implicit Local Feature Clustering

Add code
Jul 06, 2023
Viaarxiv icon