Picture for Dawei Leng

Dawei Leng

HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation

Add code
Oct 18, 2024
Viaarxiv icon

Qihoo-T2X: An Efficiency-Focused Diffusion Transformer via Proxy Tokens for Text-to-Any-Task

Add code
Sep 06, 2024
Viaarxiv icon

IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities

Add code
Aug 23, 2024
Figure 1 for IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
Figure 2 for IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
Figure 3 for IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
Figure 4 for IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
Viaarxiv icon

FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance

Add code
Aug 15, 2024
Figure 1 for FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance
Figure 2 for FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance
Figure 3 for FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance
Figure 4 for FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance
Viaarxiv icon

Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities

Add code
Sep 02, 2023
Viaarxiv icon

What Makes Good Open-Vocabulary Detector: A Disassembling Perspective

Add code
Sep 01, 2023
Viaarxiv icon

Zero and R2D2: A Large-scale Chinese Cross-modal Benchmark and A Vision-Language Framework

Add code
May 08, 2022
Figure 1 for Zero and R2D2: A Large-scale Chinese Cross-modal Benchmark and A Vision-Language Framework
Figure 2 for Zero and R2D2: A Large-scale Chinese Cross-modal Benchmark and A Vision-Language Framework
Figure 3 for Zero and R2D2: A Large-scale Chinese Cross-modal Benchmark and A Vision-Language Framework
Figure 4 for Zero and R2D2: A Large-scale Chinese Cross-modal Benchmark and A Vision-Language Framework
Viaarxiv icon

Heterogeneous Graph based Deep Learning for Biomedical Network Link Prediction

Add code
Feb 24, 2021
Figure 1 for Heterogeneous Graph based Deep Learning for Biomedical Network Link Prediction
Figure 2 for Heterogeneous Graph based Deep Learning for Biomedical Network Link Prediction
Figure 3 for Heterogeneous Graph based Deep Learning for Biomedical Network Link Prediction
Figure 4 for Heterogeneous Graph based Deep Learning for Biomedical Network Link Prediction
Viaarxiv icon

Sequence-based deep learning antibody design for in silico antibody affinity maturation

Add code
Feb 21, 2021
Figure 1 for Sequence-based deep learning antibody design for in silico antibody affinity maturation
Figure 2 for Sequence-based deep learning antibody design for in silico antibody affinity maturation
Figure 3 for Sequence-based deep learning antibody design for in silico antibody affinity maturation
Figure 4 for Sequence-based deep learning antibody design for in silico antibody affinity maturation
Viaarxiv icon

Real-time tracking of COVID-19 and coronavirus research updates through text mining

Add code
Feb 09, 2021
Figure 1 for Real-time tracking of COVID-19 and coronavirus research updates through text mining
Figure 2 for Real-time tracking of COVID-19 and coronavirus research updates through text mining
Figure 3 for Real-time tracking of COVID-19 and coronavirus research updates through text mining
Figure 4 for Real-time tracking of COVID-19 and coronavirus research updates through text mining
Viaarxiv icon