Picture for Mingrui Chen

Mingrui Chen

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Add code
Feb 18, 2025
Viaarxiv icon

Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer

Add code
May 22, 2024
Viaarxiv icon

Vision Transformer with Sparse Scan Prior

Add code
May 22, 2024
Viaarxiv icon

RMT: Retentive Networks Meet Vision Transformers

Add code
Sep 20, 2023
Viaarxiv icon

Occ$^2$Net: Robust Image Matching Based on 3D Occupancy Estimation for Occluded Regions

Add code
Aug 14, 2023
Viaarxiv icon

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

Add code
Jun 05, 2023
Viaarxiv icon

On the Hidden Mystery of OCR in Large Multimodal Models

Add code
May 13, 2023
Viaarxiv icon

ICDAR 2023 Competition on Reading the Seal Title

Add code
Apr 24, 2023
Viaarxiv icon

OccDepth: A Depth-Aware Method for 3D Semantic Scene Completion

Add code
Feb 27, 2023
Viaarxiv icon

Ternary and Binary Quantization for Improved Classification

Add code
Mar 31, 2022
Figure 1 for Ternary and Binary Quantization for Improved Classification
Figure 2 for Ternary and Binary Quantization for Improved Classification
Figure 3 for Ternary and Binary Quantization for Improved Classification
Viaarxiv icon