Picture for Kai Li

Kai Li

Department of Computer Science and Technology, Tsinghua University, Beijing, China

SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion

Add code
Feb 17, 2025
Viaarxiv icon

RUN: Reversible Unfolding Network for Concealed Object Segmentation

Add code
Jan 30, 2025
Figure 1 for RUN: Reversible Unfolding Network for Concealed Object Segmentation
Figure 2 for RUN: Reversible Unfolding Network for Concealed Object Segmentation
Figure 3 for RUN: Reversible Unfolding Network for Concealed Object Segmentation
Figure 4 for RUN: Reversible Unfolding Network for Concealed Object Segmentation
Viaarxiv icon

Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification

Add code
Jan 25, 2025
Figure 1 for Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
Figure 2 for Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
Figure 3 for Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
Figure 4 for Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
Viaarxiv icon

Diffusion Models for Smarter UAVs: Decision-Making and Modeling

Add code
Jan 10, 2025
Viaarxiv icon

GDSR: Global-Detail Integration through Dual-Branch Network with Wavelet Losses for Remote Sensing Image Super-Resolution

Add code
Jan 07, 2025
Viaarxiv icon

Facial Attractiveness Prediction in Live Streaming: A New Benchmark and Multi-modal Method

Add code
Jan 05, 2025
Figure 1 for Facial Attractiveness Prediction in Live Streaming: A New Benchmark and Multi-modal Method
Figure 2 for Facial Attractiveness Prediction in Live Streaming: A New Benchmark and Multi-modal Method
Figure 3 for Facial Attractiveness Prediction in Live Streaming: A New Benchmark and Multi-modal Method
Figure 4 for Facial Attractiveness Prediction in Live Streaming: A New Benchmark and Multi-modal Method
Viaarxiv icon

LLM4AD: A Platform for Algorithm Design with Large Language Model

Add code
Dec 23, 2024
Viaarxiv icon

HES-UNet: A U-Net for Hepatic Echinococcosis Lesion Segmentation

Add code
Dec 09, 2024
Viaarxiv icon

Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image Segmentation

Add code
Dec 09, 2024
Figure 1 for Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image Segmentation
Figure 2 for Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image Segmentation
Figure 3 for Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image Segmentation
Figure 4 for Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image Segmentation
Viaarxiv icon

InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences

Add code
Dec 03, 2024
Figure 1 for InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences
Figure 2 for InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences
Figure 3 for InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences
Figure 4 for InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences
Viaarxiv icon