Picture for Bing Li

Bing Li

Helen

Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering

Add code
Dec 24, 2024
Viaarxiv icon

WiFi CSI Based Temporal Activity Detection Via Dual Pyramid Network

Add code
Dec 19, 2024
Viaarxiv icon

Belted and Ensembled Neural Network for Linear and Nonlinear Sufficient Dimension Reduction

Add code
Dec 12, 2024
Viaarxiv icon

mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA

Add code
Nov 22, 2024
Figure 1 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 2 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 3 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 4 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Viaarxiv icon

SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning

Add code
Nov 15, 2024
Figure 1 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 2 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 3 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 4 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Viaarxiv icon

DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning

Add code
Nov 13, 2024
Figure 1 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Figure 2 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Figure 3 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Figure 4 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Viaarxiv icon

HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision

Add code
Nov 11, 2024
Figure 1 for HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision
Figure 2 for HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision
Figure 3 for HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision
Figure 4 for HSTrack: Bootstrap End-to-End Multi-Camera 3D Multi-object Tracking with Hybrid Supervision
Viaarxiv icon

VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization

Add code
Nov 03, 2024
Figure 1 for VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization
Figure 2 for VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization
Figure 3 for VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization
Figure 4 for VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization
Viaarxiv icon

SAM-Guided Masked Token Prediction for 3D Scene Understanding

Add code
Oct 17, 2024
Figure 1 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Figure 2 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Figure 3 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Figure 4 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Viaarxiv icon

SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

Add code
Oct 13, 2024
Figure 1 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Figure 2 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Figure 3 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Figure 4 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Viaarxiv icon