Picture for Heng Tao Shen

Heng Tao Shen

SeMv-3D: Towards Semantic and Mutil-view Consistency simultaneously for General Text-to-3D Generation with Triplane Priors

Add code
Oct 10, 2024
Viaarxiv icon

On Efficient Variants of Segment Anything Model: A Survey

Add code
Oct 07, 2024
Figure 1 for On Efficient Variants of Segment Anything Model: A Survey
Figure 2 for On Efficient Variants of Segment Anything Model: A Survey
Figure 3 for On Efficient Variants of Segment Anything Model: A Survey
Figure 4 for On Efficient Variants of Segment Anything Model: A Survey
Viaarxiv icon

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Add code
Sep 09, 2024
Figure 1 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 2 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 3 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 4 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Viaarxiv icon

VQ-Flow: Taming Normalizing Flows for Multi-Class Anomaly Detection via Hierarchical Vector Quantization

Add code
Sep 02, 2024
Viaarxiv icon

DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion

Add code
Aug 13, 2024
Figure 1 for DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Figure 2 for DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Figure 3 for DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Figure 4 for DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Viaarxiv icon

GalleryGPT: Analyzing Paintings with Large Multimodal Models

Add code
Aug 01, 2024
Viaarxiv icon

Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning

Add code
Aug 01, 2024
Figure 1 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 2 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 3 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 4 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Viaarxiv icon

Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization

Add code
May 24, 2024
Viaarxiv icon

Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning

Add code
Mar 15, 2024
Viaarxiv icon

ReCo-Diff: Explore Retinex-Based Condition Strategy in Diffusion Model for Low-Light Image Enhancement

Add code
Dec 20, 2023
Viaarxiv icon