Picture for Bing Li

Bing Li

Helen

NuWa: Deriving Lightweight Task-Specific Vision Transformers for Edge Devices

Add code
Apr 04, 2025
Viaarxiv icon

Towards Benchmarking and Assessing the Safety and Robustness of Autonomous Driving on Safety-critical Scenarios

Add code
Mar 31, 2025
Viaarxiv icon

Can Video Diffusion Model Reconstruct 4D Geometry?

Add code
Mar 27, 2025
Viaarxiv icon

Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models

Add code
Feb 03, 2025
Figure 1 for Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models
Figure 2 for Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models
Figure 3 for Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models
Figure 4 for Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models
Viaarxiv icon

Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering

Add code
Dec 24, 2024
Viaarxiv icon

WiFi CSI Based Temporal Activity Detection Via Dual Pyramid Network

Add code
Dec 19, 2024
Viaarxiv icon

Belted and Ensembled Neural Network for Linear and Nonlinear Sufficient Dimension Reduction

Add code
Dec 12, 2024
Figure 1 for Belted and Ensembled Neural Network for Linear and Nonlinear Sufficient Dimension Reduction
Figure 2 for Belted and Ensembled Neural Network for Linear and Nonlinear Sufficient Dimension Reduction
Figure 3 for Belted and Ensembled Neural Network for Linear and Nonlinear Sufficient Dimension Reduction
Figure 4 for Belted and Ensembled Neural Network for Linear and Nonlinear Sufficient Dimension Reduction
Viaarxiv icon

mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA

Add code
Nov 22, 2024
Figure 1 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 2 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 3 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 4 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Viaarxiv icon

SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning

Add code
Nov 15, 2024
Figure 1 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 2 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 3 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 4 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Viaarxiv icon

DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning

Add code
Nov 13, 2024
Figure 1 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Figure 2 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Figure 3 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Figure 4 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Viaarxiv icon