Picture for Yi Zeng

Yi Zeng

VESTA: A Fully Automated Scenario Generation and Safety Evaluation Framework for LLM Agents

Add code
Jun 07, 2026
Viaarxiv icon

CogManip: Benchmarking Manipulative Behavior in Multi-Turn Interactions with Large Language Model

Add code
Jun 04, 2026
Viaarxiv icon

Natural Gradient Bayesian Filtering: Geometry-Aware Filter for Dynamical Systems

Add code
May 04, 2026
Viaarxiv icon

Covering-radius and Collinearity- Minimizing Pilots for Channel Estimation in TDD Systems

Add code
Apr 07, 2026
Viaarxiv icon

ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI

Add code
Feb 15, 2026
Viaarxiv icon

Light Alignment Improves LLM Safety via Model Self-Reflection with a Single Neuron

Add code
Feb 02, 2026
Viaarxiv icon

TEFormer: Structured Bidirectional Temporal Enhancement Modeling in Spiking Transformers

Add code
Jan 26, 2026
Viaarxiv icon

EDU-CIRCUIT-HW: Evaluating Multimodal Large Language Models on Real-World University-Level STEM Student Handwritten Solutions

Add code
Jan 23, 2026
Viaarxiv icon

CogToM: A Comprehensive Theory of Mind Benchmark inspired by Human Cognition for Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

TiMem: Temporal-Hierarchical Memory Consolidation for Long-Horizon Conversational Agents

Add code
Jan 06, 2026
Viaarxiv icon