Picture for Tao Wang

Tao Wang

ScribbleVS: Scribble-Supervised Medical Image Segmentation via Dynamic Competitive Pseudo Label Selection

Add code
Nov 15, 2024
Viaarxiv icon

Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation

Add code
Nov 08, 2024
Viaarxiv icon

Synergy-Guided Regional Supervision of Pseudo Labels for Semi-Supervised Medical Image Segmentation

Add code
Nov 07, 2024
Viaarxiv icon

Pseudo-labeling with Keyword Refining for Few-Supervised Video Captioning

Add code
Nov 06, 2024
Viaarxiv icon

Exploring structure diversity in atomic resolution microscopy with graph neural networks

Add code
Oct 23, 2024
Viaarxiv icon

Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation

Add code
Oct 17, 2024
Figure 1 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Figure 2 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Figure 3 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Figure 4 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Viaarxiv icon

WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification

Add code
Sep 18, 2024
Viaarxiv icon

DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech

Add code
Sep 18, 2024
Figure 1 for DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Figure 2 for DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Figure 3 for DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Figure 4 for DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Viaarxiv icon

Text Prompt is Not Enough: Sound Event Enhanced Prompt Adapter for Target Style Audio Generation

Add code
Sep 14, 2024
Figure 1 for Text Prompt is Not Enough: Sound Event Enhanced Prompt Adapter for Target Style Audio Generation
Figure 2 for Text Prompt is Not Enough: Sound Event Enhanced Prompt Adapter for Target Style Audio Generation
Figure 3 for Text Prompt is Not Enough: Sound Event Enhanced Prompt Adapter for Target Style Audio Generation
Figure 4 for Text Prompt is Not Enough: Sound Event Enhanced Prompt Adapter for Target Style Audio Generation
Viaarxiv icon

LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection

Add code
Sep 03, 2024
Figure 1 for LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection
Figure 2 for LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection
Figure 3 for LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection
Figure 4 for LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection
Viaarxiv icon