Picture for Zhaowen Wang

Zhaowen Wang

WAS: Dataset and Methods for Artistic Text Segmentation

Add code
Jul 31, 2024
Figure 1 for WAS: Dataset and Methods for Artistic Text Segmentation
Figure 2 for WAS: Dataset and Methods for Artistic Text Segmentation
Figure 3 for WAS: Dataset and Methods for Artistic Text Segmentation
Figure 4 for WAS: Dataset and Methods for Artistic Text Segmentation
Viaarxiv icon

Scaling Up Video Summarization Pretraining with Large Language Models

Add code
Apr 04, 2024
Viaarxiv icon

Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation

Add code
Nov 30, 2023
Figure 1 for Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation
Figure 2 for Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation
Figure 3 for Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation
Figure 4 for Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation
Viaarxiv icon

DualVector: Unsupervised Vector Font Synthesis with Dual-Part Representation

Add code
May 17, 2023
Figure 1 for DualVector: Unsupervised Vector Font Synthesis with Dual-Part Representation
Figure 2 for DualVector: Unsupervised Vector Font Synthesis with Dual-Part Representation
Figure 3 for DualVector: Unsupervised Vector Font Synthesis with Dual-Part Representation
Figure 4 for DualVector: Unsupervised Vector Font Synthesis with Dual-Part Representation
Viaarxiv icon

Improving Diffusion Models for Scene Text Editing with Dual Encoders

Add code
Apr 12, 2023
Figure 1 for Improving Diffusion Models for Scene Text Editing with Dual Encoders
Figure 2 for Improving Diffusion Models for Scene Text Editing with Dual Encoders
Figure 3 for Improving Diffusion Models for Scene Text Editing with Dual Encoders
Figure 4 for Improving Diffusion Models for Scene Text Editing with Dual Encoders
Viaarxiv icon

Align and Attend: Multimodal Summarization with Dual Contrastive Losses

Add code
Mar 13, 2023
Viaarxiv icon

LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos

Add code
Oct 12, 2022
Figure 1 for LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
Figure 2 for LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
Figure 3 for LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
Figure 4 for LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
Viaarxiv icon

Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment

Add code
Oct 10, 2022
Figure 1 for Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment
Figure 2 for Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment
Figure 3 for Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment
Figure 4 for Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment
Viaarxiv icon

Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition

Add code
Jul 31, 2022
Figure 1 for Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition
Figure 2 for Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition
Figure 3 for Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition
Figure 4 for Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition
Viaarxiv icon

MHMS: Multimodal Hierarchical Multimedia Summarization

Add code
Apr 07, 2022
Figure 1 for MHMS: Multimodal Hierarchical Multimedia Summarization
Figure 2 for MHMS: Multimodal Hierarchical Multimedia Summarization
Figure 3 for MHMS: Multimodal Hierarchical Multimedia Summarization
Figure 4 for MHMS: Multimodal Hierarchical Multimedia Summarization
Viaarxiv icon