Picture for Zechuan Li

Zechuan Li

DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights

Add code
Oct 02, 2024
Figure 1 for DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights
Figure 2 for DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights
Figure 3 for DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights
Figure 4 for DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights
Viaarxiv icon

Beyond Full Label: Single-Point Prompt for Infrared Small Target Label Generation

Add code
Aug 15, 2024
Viaarxiv icon

OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition

Add code
Nov 30, 2023
Viaarxiv icon

First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment

Add code
Jun 23, 2023
Figure 1 for First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment
Figure 2 for First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment
Figure 3 for First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment
Figure 4 for First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment
Viaarxiv icon