Picture for Changcun Bao

Changcun Bao

Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration

Add code
Sep 03, 2023
Viaarxiv icon

Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA

Add code
Apr 04, 2023
Figure 1 for Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Figure 2 for Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Figure 3 for Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Figure 4 for Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Viaarxiv icon