Picture for Hanling Zhang

Hanling Zhang

SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer

Add code
Jan 23, 2026
Viaarxiv icon

VocabTailor: Dynamic Vocabulary Selection for Downstream Tasks in Small Language Models

Add code
Aug 21, 2025
Viaarxiv icon

VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate

Add code
Apr 16, 2025
Figure 1 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Figure 2 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Figure 3 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Figure 4 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Viaarxiv icon

DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers

Add code
Mar 28, 2025
Figure 1 for DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers
Figure 2 for DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers
Figure 3 for DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers
Figure 4 for DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers
Viaarxiv icon

DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation

Add code
Feb 17, 2025
Figure 1 for DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation
Figure 2 for DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation
Figure 3 for DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation
Figure 4 for DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation
Viaarxiv icon

E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling

Add code
Dec 19, 2024
Figure 1 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Figure 2 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Figure 3 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Figure 4 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Viaarxiv icon

DiTFastAttn: Attention Compression for Diffusion Transformer Models

Add code
Jun 12, 2024
Figure 1 for DiTFastAttn: Attention Compression for Diffusion Transformer Models
Figure 2 for DiTFastAttn: Attention Compression for Diffusion Transformer Models
Figure 3 for DiTFastAttn: Attention Compression for Diffusion Transformer Models
Figure 4 for DiTFastAttn: Attention Compression for Diffusion Transformer Models
Viaarxiv icon

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Add code
Dec 02, 2021
Figure 1 for TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation
Figure 2 for TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation
Figure 3 for TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation
Figure 4 for TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation
Viaarxiv icon

Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild

Add code
Mar 18, 2021
Figure 1 for Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild
Figure 2 for Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild
Figure 3 for Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild
Figure 4 for Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild
Viaarxiv icon

A novel graph structure for salient object detection based on divergence background and compact foreground

Add code
Nov 30, 2017
Figure 1 for A novel graph structure for salient object detection based on divergence background and compact foreground
Figure 2 for A novel graph structure for salient object detection based on divergence background and compact foreground
Figure 3 for A novel graph structure for salient object detection based on divergence background and compact foreground
Figure 4 for A novel graph structure for salient object detection based on divergence background and compact foreground
Viaarxiv icon