Picture for Hongkuan Zhang

Hongkuan Zhang

Optimizing multi-user sound communications in reverberating environments with acoustic reconfigurable metasurfaces

Add code
Aug 03, 2023
Viaarxiv icon

Cross-Modal Similarity-Based Curriculum Learning for Image Captioning

Add code
Dec 14, 2022
Viaarxiv icon

Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images

Add code
Dec 13, 2022
Figure 1 for Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images
Figure 2 for Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images
Figure 3 for Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images
Figure 4 for Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images
Viaarxiv icon