Picture for Wengyu Zhang

Wengyu Zhang

Mean of Means: A 10-dollar Solution for Human Localization with Calibration-free and Unconstrained Camera Settings

Add code
Jul 30, 2024
Viaarxiv icon

Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval

Add code
Jul 23, 2024
Figure 1 for Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Figure 2 for Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Figure 3 for Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Figure 4 for Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Viaarxiv icon

A Survey on Personalized Content Synthesis with Diffusion Models

Add code
May 09, 2024
Viaarxiv icon

A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal Reasoning

Add code
Mar 22, 2024
Viaarxiv icon

Generative Active Learning for Image Synthesis Personalization

Add code
Mar 22, 2024
Viaarxiv icon