Picture for Shengju Qian

Shengju Qian

Text-Animator: Controllable Visual Text Video Generation

Add code
Jun 25, 2024
Viaarxiv icon

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

Add code
Apr 23, 2024
Viaarxiv icon

Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models

Add code
Mar 25, 2024
Figure 1 for Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models
Figure 2 for Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models
Figure 3 for Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models
Figure 4 for Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models
Viaarxiv icon

Prompt Highlighter: Interactive Control for Multi-Modal LLMs

Add code
Dec 07, 2023
Figure 1 for Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Figure 2 for Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Figure 3 for Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Figure 4 for Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Viaarxiv icon

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Add code
Sep 21, 2023
Figure 1 for LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Figure 2 for LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Figure 3 for LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Figure 4 for LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Viaarxiv icon

TagCLIP: Improving Discrimination Ability of Open-Vocabulary Semantic Segmentation

Add code
Apr 15, 2023
Figure 1 for TagCLIP: Improving Discrimination Ability of Open-Vocabulary Semantic Segmentation
Figure 2 for TagCLIP: Improving Discrimination Ability of Open-Vocabulary Semantic Segmentation
Figure 3 for TagCLIP: Improving Discrimination Ability of Open-Vocabulary Semantic Segmentation
Figure 4 for TagCLIP: Improving Discrimination Ability of Open-Vocabulary Semantic Segmentation
Viaarxiv icon

StraIT: Non-autoregressive Generation with Stratified Image Transformer

Add code
Mar 01, 2023
Viaarxiv icon

What Makes for Good Tokenizers in Vision Transformer?

Add code
Dec 21, 2022
Viaarxiv icon

Blending Anti-Aliasing into Vision Transformer

Add code
Oct 28, 2021
Figure 1 for Blending Anti-Aliasing into Vision Transformer
Figure 2 for Blending Anti-Aliasing into Vision Transformer
Figure 3 for Blending Anti-Aliasing into Vision Transformer
Figure 4 for Blending Anti-Aliasing into Vision Transformer
Viaarxiv icon

Temporal Interlacing Network

Add code
Jan 17, 2020
Figure 1 for Temporal Interlacing Network
Figure 2 for Temporal Interlacing Network
Figure 3 for Temporal Interlacing Network
Figure 4 for Temporal Interlacing Network
Viaarxiv icon