Picture for Guang Dai

Guang Dai

Unbiased General Annotated Dataset Generation

Add code
Dec 14, 2024
Viaarxiv icon

Visual Object Tracking across Diverse Data Modalities: A Review

Add code
Dec 13, 2024
Viaarxiv icon

Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing

Add code
Oct 24, 2024
Viaarxiv icon

On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs

Add code
Oct 16, 2024
Figure 1 for On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs
Figure 2 for On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs
Figure 3 for On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs
Figure 4 for On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs
Viaarxiv icon

Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery

Add code
Sep 29, 2024
Figure 1 for Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Figure 2 for Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Figure 3 for Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Figure 4 for Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Viaarxiv icon

SpotActor: Training-Free Layout-Controlled Consistent Image Generation

Add code
Sep 07, 2024
Viaarxiv icon

Disentangled Noisy Correspondence Learning

Add code
Aug 10, 2024
Figure 1 for Disentangled Noisy Correspondence Learning
Figure 2 for Disentangled Noisy Correspondence Learning
Figure 3 for Disentangled Noisy Correspondence Learning
Figure 4 for Disentangled Noisy Correspondence Learning
Viaarxiv icon

Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models

Add code
Jul 22, 2024
Figure 1 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 2 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 3 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 4 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Viaarxiv icon

Timestep-Aware Correction for Quantized Diffusion Models

Add code
Jul 04, 2024
Figure 1 for Timestep-Aware Correction for Quantized Diffusion Models
Figure 2 for Timestep-Aware Correction for Quantized Diffusion Models
Figure 3 for Timestep-Aware Correction for Quantized Diffusion Models
Figure 4 for Timestep-Aware Correction for Quantized Diffusion Models
Viaarxiv icon

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

Add code
Jun 18, 2024
Figure 1 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 2 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 3 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 4 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Viaarxiv icon