Picture for Binghong Wu

Binghong Wu

Harmonizing Visual Text Comprehension and Generation

Add code
Jul 23, 2024
Figure 1 for Harmonizing Visual Text Comprehension and Generation
Figure 2 for Harmonizing Visual Text Comprehension and Generation
Figure 3 for Harmonizing Visual Text Comprehension and Generation
Figure 4 for Harmonizing Visual Text Comprehension and Generation
Viaarxiv icon

A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding

Add code
Jul 02, 2024
Viaarxiv icon

TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy

Add code
Jun 03, 2024
Viaarxiv icon

TextSquare: Scaling up Text-Centric Visual Instruction Tuning

Add code
Apr 19, 2024
Figure 1 for TextSquare: Scaling up Text-Centric Visual Instruction Tuning
Figure 2 for TextSquare: Scaling up Text-Centric Visual Instruction Tuning
Figure 3 for TextSquare: Scaling up Text-Centric Visual Instruction Tuning
Figure 4 for TextSquare: Scaling up Text-Centric Visual Instruction Tuning
Viaarxiv icon

Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer

Add code
Nov 23, 2023
Viaarxiv icon

Contrastive Centroid Supervision Alleviates Domain Shift in Medical Image Classification

Add code
May 31, 2022
Figure 1 for Contrastive Centroid Supervision Alleviates Domain Shift in Medical Image Classification
Figure 2 for Contrastive Centroid Supervision Alleviates Domain Shift in Medical Image Classification
Figure 3 for Contrastive Centroid Supervision Alleviates Domain Shift in Medical Image Classification
Figure 4 for Contrastive Centroid Supervision Alleviates Domain Shift in Medical Image Classification
Viaarxiv icon

Opinions Vary? Diagnosis First!

Add code
Feb 14, 2022
Figure 1 for Opinions Vary? Diagnosis First!
Figure 2 for Opinions Vary? Diagnosis First!
Figure 3 for Opinions Vary? Diagnosis First!
Figure 4 for Opinions Vary? Diagnosis First!
Viaarxiv icon

Progressive Hard-case Mining across Pyramid Levels in Object Detection

Add code
Sep 15, 2021
Figure 1 for Progressive Hard-case Mining across Pyramid Levels in Object Detection
Figure 2 for Progressive Hard-case Mining across Pyramid Levels in Object Detection
Figure 3 for Progressive Hard-case Mining across Pyramid Levels in Object Detection
Figure 4 for Progressive Hard-case Mining across Pyramid Levels in Object Detection
Viaarxiv icon

Robust Collaborative Learning of Patch-level and Image-level Annotations for Diabetic Retinopathy Grading from Fundus Image

Add code
Aug 03, 2020
Figure 1 for Robust Collaborative Learning of Patch-level and Image-level Annotations for Diabetic Retinopathy Grading from Fundus Image
Figure 2 for Robust Collaborative Learning of Patch-level and Image-level Annotations for Diabetic Retinopathy Grading from Fundus Image
Figure 3 for Robust Collaborative Learning of Patch-level and Image-level Annotations for Diabetic Retinopathy Grading from Fundus Image
Figure 4 for Robust Collaborative Learning of Patch-level and Image-level Annotations for Diabetic Retinopathy Grading from Fundus Image
Viaarxiv icon

Residual-CycleGAN based Camera Adaptation for Robust Diabetic Retinopathy Screening

Add code
Jul 31, 2020
Figure 1 for Residual-CycleGAN based Camera Adaptation for Robust Diabetic Retinopathy Screening
Figure 2 for Residual-CycleGAN based Camera Adaptation for Robust Diabetic Retinopathy Screening
Figure 3 for Residual-CycleGAN based Camera Adaptation for Robust Diabetic Retinopathy Screening
Figure 4 for Residual-CycleGAN based Camera Adaptation for Robust Diabetic Retinopathy Screening
Viaarxiv icon