Picture for Weisi Lin

Weisi Lin

VQA$^2$:Visual Question Answering for Video Quality Assessment

Add code
Nov 06, 2024
Viaarxiv icon

R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?

Add code
Oct 07, 2024
Viaarxiv icon

Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs

Add code
Sep 30, 2024
Viaarxiv icon

Explore the Hallucination on Low-level Perception for MLLMs

Add code
Sep 15, 2024
Figure 1 for Explore the Hallucination on Low-level Perception for MLLMs
Figure 2 for Explore the Hallucination on Low-level Perception for MLLMs
Figure 3 for Explore the Hallucination on Low-level Perception for MLLMs
Figure 4 for Explore the Hallucination on Low-level Perception for MLLMs
Viaarxiv icon

MRSE: An Efficient Multi-modality Retrieval System for Large Scale E-commerce

Add code
Aug 27, 2024
Viaarxiv icon

UNQA: Unified No-Reference Quality Assessment for Audio, Image, Video, and Audio-Visual Content

Add code
Jul 29, 2024
Viaarxiv icon

Q-Ground: Image Quality Grounding with Large Multi-modality Models

Add code
Jul 24, 2024
Viaarxiv icon

LPViT: Low-Power Semi-structured Pruning for Vision Transformers

Add code
Jul 02, 2024
Viaarxiv icon

DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection

Add code
Jul 02, 2024
Viaarxiv icon

CMC-Bench: Towards a New Paradigm of Visual Signal Compression

Add code
Jun 13, 2024
Figure 1 for CMC-Bench: Towards a New Paradigm of Visual Signal Compression
Figure 2 for CMC-Bench: Towards a New Paradigm of Visual Signal Compression
Figure 3 for CMC-Bench: Towards a New Paradigm of Visual Signal Compression
Figure 4 for CMC-Bench: Towards a New Paradigm of Visual Signal Compression
Viaarxiv icon