Picture for Zhenyang Li

Zhenyang Li

ES-Mem: Event Segmentation-Based Memory for Long-Term Dialogue Agents

Add code
Jan 13, 2026
Viaarxiv icon

DiffER: Diffusion Entity-Relation Modeling for Reversal Curse in Diffusion Large Language Models

Add code
Jan 12, 2026
Viaarxiv icon

DiffuGR: Generative Document Retrieval with Diffusion Language Models

Add code
Nov 19, 2025
Viaarxiv icon

EventTracer: Fast Path Tracing-based Event Stream Rendering

Add code
Aug 25, 2025
Figure 1 for EventTracer: Fast Path Tracing-based Event Stream Rendering
Figure 2 for EventTracer: Fast Path Tracing-based Event Stream Rendering
Figure 3 for EventTracer: Fast Path Tracing-based Event Stream Rendering
Figure 4 for EventTracer: Fast Path Tracing-based Event Stream Rendering
Viaarxiv icon

Enhanced Velocity Field Modeling for Gaussian Video Reconstruction

Add code
Jul 31, 2025
Viaarxiv icon

Joint Flashback Adaptation for Forgetting-Resistant Instruction Tuning

Add code
May 21, 2025
Viaarxiv icon

Evaluating Model Robustness Using Adaptive Sparse L0 Regularization

Add code
Aug 28, 2024
Figure 1 for Evaluating Model Robustness Using Adaptive Sparse L0 Regularization
Figure 2 for Evaluating Model Robustness Using Adaptive Sparse L0 Regularization
Viaarxiv icon

Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Add code
Jul 09, 2024
Figure 1 for Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
Figure 2 for Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
Figure 3 for Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
Figure 4 for Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
Viaarxiv icon

Training-free CryoET Tomogram Segmentation

Add code
Jul 08, 2024
Figure 1 for Training-free CryoET Tomogram Segmentation
Figure 2 for Training-free CryoET Tomogram Segmentation
Figure 3 for Training-free CryoET Tomogram Segmentation
Figure 4 for Training-free CryoET Tomogram Segmentation
Viaarxiv icon

Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR

Add code
May 27, 2024
Figure 1 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Figure 2 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Figure 3 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Figure 4 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Viaarxiv icon