Picture for Wenhao Wang

Wenhao Wang

TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation

Add code
Nov 05, 2024
Viaarxiv icon

Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies

Add code
Oct 14, 2024
Figure 1 for Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies
Figure 2 for Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies
Figure 3 for Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies
Figure 4 for Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies
Viaarxiv icon

KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server

Add code
Oct 10, 2024
Figure 1 for KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server
Figure 2 for KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server
Figure 3 for KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server
Figure 4 for KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server
Viaarxiv icon

Image Copy Detection for Diffusion Models

Add code
Sep 30, 2024
Figure 1 for Image Copy Detection for Diffusion Models
Figure 2 for Image Copy Detection for Diffusion Models
Figure 3 for Image Copy Detection for Diffusion Models
Figure 4 for Image Copy Detection for Diffusion Models
Viaarxiv icon

Localizing Memorization in SSL Vision Encoders

Add code
Sep 27, 2024
Figure 1 for Localizing Memorization in SSL Vision Encoders
Figure 2 for Localizing Memorization in SSL Vision Encoders
Figure 3 for Localizing Memorization in SSL Vision Encoders
Figure 4 for Localizing Memorization in SSL Vision Encoders
Viaarxiv icon

MonoFormer: One Transformer for Both Diffusion and Autoregression

Add code
Sep 24, 2024
Figure 1 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Figure 2 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Figure 3 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Figure 4 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Viaarxiv icon

AnyPattern: Towards In-context Image Copy Detection

Add code
Apr 28, 2024
Viaarxiv icon

VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models

Add code
Mar 10, 2024
Viaarxiv icon

TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision

Add code
Mar 10, 2024
Figure 1 for TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
Figure 2 for TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
Figure 3 for TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
Figure 4 for TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
Viaarxiv icon

OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning

Add code
Feb 10, 2024
Viaarxiv icon