Picture for Heng Tao Shen

Heng Tao Shen

Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves

Add code
Dec 16, 2024
Viaarxiv icon

GT23D-Bench: A Comprehensive General Text-to-3D Generation Benchmark

Add code
Dec 13, 2024
Viaarxiv icon

SeMv-3D: Towards Semantic and Mutil-view Consistency simultaneously for General Text-to-3D Generation with Triplane Priors

Add code
Oct 10, 2024
Viaarxiv icon

On Efficient Variants of Segment Anything Model: A Survey

Add code
Oct 07, 2024
Figure 1 for On Efficient Variants of Segment Anything Model: A Survey
Figure 2 for On Efficient Variants of Segment Anything Model: A Survey
Figure 3 for On Efficient Variants of Segment Anything Model: A Survey
Figure 4 for On Efficient Variants of Segment Anything Model: A Survey
Viaarxiv icon

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Add code
Sep 09, 2024
Figure 1 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 2 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 3 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 4 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Viaarxiv icon

VQ-Flow: Taming Normalizing Flows for Multi-Class Anomaly Detection via Hierarchical Vector Quantization

Add code
Sep 02, 2024
Viaarxiv icon

DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion

Add code
Aug 13, 2024
Figure 1 for DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Figure 2 for DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Figure 3 for DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Figure 4 for DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Viaarxiv icon

Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning

Add code
Aug 01, 2024
Figure 1 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 2 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 3 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 4 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Viaarxiv icon

GalleryGPT: Analyzing Paintings with Large Multimodal Models

Add code
Aug 01, 2024
Figure 1 for GalleryGPT: Analyzing Paintings with Large Multimodal Models
Figure 2 for GalleryGPT: Analyzing Paintings with Large Multimodal Models
Figure 3 for GalleryGPT: Analyzing Paintings with Large Multimodal Models
Figure 4 for GalleryGPT: Analyzing Paintings with Large Multimodal Models
Viaarxiv icon

Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization

Add code
May 24, 2024
Viaarxiv icon