Picture for Bing Li

Bing Li

Helen

Belted and Ensembled Neural Network for Linear and Nonlinear Sufficient Dimension Reduction

Add code
Dec 12, 2024
Viaarxiv icon

mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA

Add code
Nov 22, 2024
Figure 1 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 2 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 3 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 4 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Viaarxiv icon

SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning

Add code
Nov 15, 2024
Figure 1 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 2 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 3 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 4 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Viaarxiv icon

DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning

Add code
Nov 13, 2024
Figure 1 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Figure 2 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Figure 3 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Figure 4 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Viaarxiv icon

HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision

Add code
Nov 11, 2024
Viaarxiv icon

VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization

Add code
Nov 03, 2024
Figure 1 for VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization
Figure 2 for VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization
Figure 3 for VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization
Figure 4 for VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization
Viaarxiv icon

SAM-Guided Masked Token Prediction for 3D Scene Understanding

Add code
Oct 17, 2024
Figure 1 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Figure 2 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Figure 3 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Figure 4 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Viaarxiv icon

SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

Add code
Oct 13, 2024
Figure 1 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Figure 2 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Figure 3 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Figure 4 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Viaarxiv icon

Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression

Add code
Oct 02, 2024
Viaarxiv icon

Token Caching for Diffusion Transformer Acceleration

Add code
Sep 27, 2024
Viaarxiv icon