Picture for Zhenguo Li

Zhenguo Li

Adding Additional Control to One-Step Diffusion with Joint Distribution Matching

Add code
Mar 09, 2025
Viaarxiv icon

Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views

Add code
Mar 04, 2025
Viaarxiv icon

Implicit Search via Discrete Diffusion: A Study on Chess

Add code
Feb 27, 2025
Viaarxiv icon

Self-Adjust Softmax

Add code
Feb 25, 2025
Viaarxiv icon

Improved Diffusion-based Generative Model with Better Adversarial Robustness

Add code
Feb 24, 2025
Viaarxiv icon

Corrupted but Not Broken: Rethinking the Impact of Corrupted Data in Visual Instruction Tuning

Add code
Feb 18, 2025
Viaarxiv icon

Efficient Neural Theorem Proving via Fine-grained Proof Structure Analysis

Add code
Jan 30, 2025
Viaarxiv icon

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Add code
Dec 16, 2024
Figure 1 for SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
Figure 2 for SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
Figure 3 for SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
Figure 4 for SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
Viaarxiv icon

Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models

Add code
Nov 29, 2024
Figure 1 for Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models
Figure 2 for Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models
Figure 3 for Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models
Figure 4 for Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models
Viaarxiv icon

Efficient Multi-modal Large Language Models via Visual Token Grouping

Add code
Nov 26, 2024
Figure 1 for Efficient Multi-modal Large Language Models via Visual Token Grouping
Figure 2 for Efficient Multi-modal Large Language Models via Visual Token Grouping
Figure 3 for Efficient Multi-modal Large Language Models via Visual Token Grouping
Figure 4 for Efficient Multi-modal Large Language Models via Visual Token Grouping
Viaarxiv icon