Picture for Dongsheng Wang

Dongsheng Wang

Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks

Add code
Oct 13, 2024
Viaarxiv icon

TsCA: On the Semantic Consistency Alignment via Conditional Transport for Compositional Zero-Shot Learning

Add code
Aug 16, 2024
Viaarxiv icon

Instruction Tuning-free Visual Token Complement for Multimodal LLMs

Add code
Aug 09, 2024
Viaarxiv icon

2M-NER: Contrastive Learning for Multilingual and Multimodal NER with Language and Modal Fusion

Add code
Apr 26, 2024
Viaarxiv icon

BuDDIE: A Business Document Dataset for Multi-task Information Extraction

Add code
Apr 05, 2024
Viaarxiv icon

Large Language Models as Financial Data Annotators: A Study on Effectiveness and Efficiency

Add code
Mar 26, 2024
Viaarxiv icon

MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining

Add code
Mar 20, 2024
Viaarxiv icon

DocGraphLM: Documental Graph Language Model for Information Extraction

Add code
Jan 05, 2024
Viaarxiv icon

DocLLM: A layout-aware generative language model for multimodal document understanding

Add code
Dec 31, 2023
Figure 1 for DocLLM: A layout-aware generative language model for multimodal document understanding
Figure 2 for DocLLM: A layout-aware generative language model for multimodal document understanding
Figure 3 for DocLLM: A layout-aware generative language model for multimodal document understanding
Figure 4 for DocLLM: A layout-aware generative language model for multimodal document understanding
Viaarxiv icon

Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection

Add code
Oct 22, 2023
Viaarxiv icon