Picture for Weihong Lin

Weihong Lin

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Add code
Apr 14, 2025
Viaarxiv icon

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Add code
Mar 06, 2025
Viaarxiv icon

HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models

Add code
Feb 28, 2025
Viaarxiv icon

UniVIE: A Unified Label Space Approach to Visual Information Extraction from Form-like Documents

Add code
Jan 17, 2024
Viaarxiv icon

A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images

Add code
Apr 17, 2023
Figure 1 for A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Figure 2 for A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Figure 3 for A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Figure 4 for A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Viaarxiv icon

Robust Table Structure Recognition with Dynamic Queries Enhanced Detection Transformer

Add code
Mar 21, 2023
Viaarxiv icon

Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning

Add code
Oct 03, 2022
Figure 1 for Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Figure 2 for Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Figure 3 for Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Figure 4 for Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Viaarxiv icon

TSRFormer: Table Structure Recognition with Transformers

Add code
Aug 09, 2022
Figure 1 for TSRFormer: Table Structure Recognition with Transformers
Figure 2 for TSRFormer: Table Structure Recognition with Transformers
Figure 3 for TSRFormer: Table Structure Recognition with Transformers
Figure 4 for TSRFormer: Table Structure Recognition with Transformers
Viaarxiv icon

DETRs with Hybrid Matching

Add code
Jul 26, 2022
Figure 1 for DETRs with Hybrid Matching
Figure 2 for DETRs with Hybrid Matching
Figure 3 for DETRs with Hybrid Matching
Figure 4 for DETRs with Hybrid Matching
Viaarxiv icon

Robust Table Detection and Structure Recognition from Heterogeneous Document Images

Add code
Mar 17, 2022
Figure 1 for Robust Table Detection and Structure Recognition from Heterogeneous Document Images
Figure 2 for Robust Table Detection and Structure Recognition from Heterogeneous Document Images
Figure 3 for Robust Table Detection and Structure Recognition from Heterogeneous Document Images
Figure 4 for Robust Table Detection and Structure Recognition from Heterogeneous Document Images
Viaarxiv icon