Picture for Dongcheng Zhao

Dongcheng Zhao

Light Alignment Improves LLM Safety via Model Self-Reflection with a Single Neuron

Add code
Feb 02, 2026
Viaarxiv icon

TEFormer: Structured Bidirectional Temporal Enhancement Modeling in Spiking Transformers

Add code
Jan 26, 2026
Viaarxiv icon

Towards Reliable Evaluation of Adversarial Robustness for Spiking Neural Networks

Add code
Dec 27, 2025
Viaarxiv icon

Efficient LLM Safety Evaluation through Multi-Agent Debate

Add code
Nov 09, 2025
Viaarxiv icon

MVPBench: A Benchmark and Fine-Tuning Framework for Aligning Large Language Models with Diverse Human Values

Add code
Sep 09, 2025
Viaarxiv icon

PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks

Add code
May 22, 2025
Viaarxiv icon

STEP: A Unified Spiking Transformer Evaluation Platform for Fair and Reproducible Benchmarking

Add code
May 16, 2025
Viaarxiv icon

Incorporating brain-inspired mechanisms for multimodal learning in artificial intelligence

Add code
May 15, 2025
Viaarxiv icon

Redefining Superalignment: From Weak-to-Strong Alignment to Human-AI Co-Alignment to Sustainable Symbiotic Society

Add code
Apr 24, 2025
Viaarxiv icon

Biologically Inspired Spiking Diffusion Model with Adaptive Lateral Selection Mechanism

Add code
Mar 31, 2025
Figure 1 for Biologically Inspired Spiking Diffusion Model with Adaptive Lateral Selection Mechanism
Figure 2 for Biologically Inspired Spiking Diffusion Model with Adaptive Lateral Selection Mechanism
Figure 3 for Biologically Inspired Spiking Diffusion Model with Adaptive Lateral Selection Mechanism
Figure 4 for Biologically Inspired Spiking Diffusion Model with Adaptive Lateral Selection Mechanism
Viaarxiv icon