Picture for Maria Wang

Maria Wang

JD

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Add code
Feb 19, 2024
Figure 1 for ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Figure 2 for ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Figure 3 for ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Figure 4 for ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Viaarxiv icon

Towards Better Semantic Understanding of Mobile Interfaces

Add code
Oct 06, 2022
Figure 1 for Towards Better Semantic Understanding of Mobile Interfaces
Figure 2 for Towards Better Semantic Understanding of Mobile Interfaces
Figure 3 for Towards Better Semantic Understanding of Mobile Interfaces
Figure 4 for Towards Better Semantic Understanding of Mobile Interfaces
Viaarxiv icon

ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots

Add code
Sep 16, 2022
Figure 1 for ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
Figure 2 for ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
Figure 3 for ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
Figure 4 for ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
Viaarxiv icon

PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling

Add code
Jul 06, 2021
Figure 1 for PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Figure 2 for PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Figure 3 for PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Figure 4 for PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Viaarxiv icon