Picture for Zuheng Ming

Zuheng Ming

Interactive Masked Image Modeling for Multimodal Object Detection in Remote Sensing

Add code
Sep 13, 2024
Viaarxiv icon

Multimodal Transformer Using Cross-Channel attention for Object Detection in Remote Sensing Images

Add code
Oct 21, 2023
Viaarxiv icon

TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language

Add code
Sep 11, 2023
Viaarxiv icon

MMFormer: Multimodal Transformer Using Multiscale Self-Attention for Remote Sensing Image Classification

Add code
Mar 23, 2023
Viaarxiv icon

Identity Documents Authentication based on Forgery Detection of Guilloche Pattern

Add code
Jun 22, 2022
Figure 1 for Identity Documents Authentication based on Forgery Detection of Guilloche Pattern
Figure 2 for Identity Documents Authentication based on Forgery Detection of Guilloche Pattern
Figure 3 for Identity Documents Authentication based on Forgery Detection of Guilloche Pattern
Figure 4 for Identity Documents Authentication based on Forgery Detection of Guilloche Pattern
Viaarxiv icon

VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification

Add code
May 24, 2022
Figure 1 for VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
Figure 2 for VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
Figure 3 for VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
Figure 4 for VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
Viaarxiv icon

ViTransPAD: Video Transformer using convolution and self-attention for Face Presentation Attack Detection

Add code
Mar 14, 2022
Figure 1 for ViTransPAD: Video Transformer using convolution and self-attention for Face Presentation Attack Detection
Figure 2 for ViTransPAD: Video Transformer using convolution and self-attention for Face Presentation Attack Detection
Figure 3 for ViTransPAD: Video Transformer using convolution and self-attention for Face Presentation Attack Detection
Figure 4 for ViTransPAD: Video Transformer using convolution and self-attention for Face Presentation Attack Detection
Viaarxiv icon

Exploring Multi-Tasking Learning in Document Attribute Classification

Add code
Aug 30, 2021
Figure 1 for Exploring Multi-Tasking Learning in Document Attribute Classification
Figure 2 for Exploring Multi-Tasking Learning in Document Attribute Classification
Figure 3 for Exploring Multi-Tasking Learning in Document Attribute Classification
Figure 4 for Exploring Multi-Tasking Learning in Document Attribute Classification
Viaarxiv icon

MIDV-2020: A Comprehensive Benchmark Dataset for Identity Document Analysis

Add code
Jul 01, 2021
Figure 1 for MIDV-2020: A Comprehensive Benchmark Dataset for Identity Document Analysis
Figure 2 for MIDV-2020: A Comprehensive Benchmark Dataset for Identity Document Analysis
Figure 3 for MIDV-2020: A Comprehensive Benchmark Dataset for Identity Document Analysis
Figure 4 for MIDV-2020: A Comprehensive Benchmark Dataset for Identity Document Analysis
Viaarxiv icon

A Survey On Anti-Spoofing Methods For Face Recognition with RGB Cameras of Generic Consumer Devices

Add code
Oct 08, 2020
Figure 1 for A Survey On Anti-Spoofing Methods For Face Recognition with RGB Cameras of Generic Consumer Devices
Figure 2 for A Survey On Anti-Spoofing Methods For Face Recognition with RGB Cameras of Generic Consumer Devices
Figure 3 for A Survey On Anti-Spoofing Methods For Face Recognition with RGB Cameras of Generic Consumer Devices
Figure 4 for A Survey On Anti-Spoofing Methods For Face Recognition with RGB Cameras of Generic Consumer Devices
Viaarxiv icon