Picture for Di Liu

Di Liu

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering

Add code
Feb 05, 2025
Figure 1 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 2 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 3 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 4 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Viaarxiv icon

Improved Training Technique for Latent Consistency Models

Add code
Feb 03, 2025
Figure 1 for Improved Training Technique for Latent Consistency Models
Figure 2 for Improved Training Technique for Latent Consistency Models
Figure 3 for Improved Training Technique for Latent Consistency Models
Figure 4 for Improved Training Technique for Latent Consistency Models
Viaarxiv icon

Temporal Action Localization with Cross Layer Task Decoupling and Refinement

Add code
Dec 13, 2024
Viaarxiv icon

Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision

Add code
Nov 03, 2024
Viaarxiv icon

DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models

Add code
Oct 10, 2024
Viaarxiv icon

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Add code
Sep 16, 2024
Figure 1 for RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
Figure 2 for RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
Figure 3 for RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
Figure 4 for RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
Viaarxiv icon

Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation

Add code
Sep 15, 2024
Viaarxiv icon

SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models

Add code
Jun 03, 2024
Figure 1 for SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Figure 2 for SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Figure 3 for SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Figure 4 for SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Viaarxiv icon

Implicit In-context Learning

Add code
May 23, 2024
Figure 1 for Implicit In-context Learning
Figure 2 for Implicit In-context Learning
Figure 3 for Implicit In-context Learning
Figure 4 for Implicit In-context Learning
Viaarxiv icon

Instantaneous Perception of Moving Objects in 3D

Add code
May 05, 2024
Viaarxiv icon