Picture for Daiqing Wu

Daiqing Wu

Char-SAM: Turning Segment Anything Model into Scene Text Segmentation Annotator with Character-level Visual Prompts

Add code
Dec 27, 2024
Viaarxiv icon

Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues

Add code
Dec 17, 2024
Figure 1 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Figure 2 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Figure 3 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Figure 4 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Viaarxiv icon

Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition

Add code
Jul 09, 2024
Figure 1 for Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition
Figure 2 for Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition
Figure 3 for Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition
Figure 4 for Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition
Viaarxiv icon

Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering

Add code
Mar 24, 2022
Figure 1 for Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering
Figure 2 for Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering
Figure 3 for Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering
Figure 4 for Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering
Viaarxiv icon