Picture for Wenyuan Xue

Wenyuan Xue

A Survey on Hallucination in Large Vision-Language Models

Add code
Feb 01, 2024
Viaarxiv icon

Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition

Add code
Nov 27, 2023
Figure 1 for Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition
Figure 2 for Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition
Figure 3 for Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition
Figure 4 for Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition
Viaarxiv icon

ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition

Add code
Aug 15, 2023
Viaarxiv icon

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

Add code
Jun 28, 2021
Figure 1 for TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition
Figure 2 for TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition
Figure 3 for TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition
Figure 4 for TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition
Viaarxiv icon