Picture for Xiangyu Zhang

Xiangyu Zhang

Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models

Add code
Oct 10, 2025
Viaarxiv icon

An Energy-Efficient Edge Coprocessor for Neural Rendering with Explicit Data Reuse Strategies

Add code
Oct 09, 2025
Viaarxiv icon

Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech

Add code
Sep 19, 2025
Figure 1 for Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech
Figure 2 for Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech
Figure 3 for Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech
Figure 4 for Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech
Viaarxiv icon

MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation

Add code
Aug 26, 2025
Figure 1 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 2 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 3 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 4 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Viaarxiv icon

ASTRA: Autonomous Spatial-Temporal Red-teaming for AI Software Assistants

Add code
Aug 05, 2025
Viaarxiv icon

StepFun-Prover Preview: Let's Think and Verify Step by Step

Add code
Jul 27, 2025
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Viaarxiv icon

Holistic Tokenizer for Autoregressive Image Generation

Add code
Jul 03, 2025
Viaarxiv icon

MGC: A Compiler Framework Exploiting Compositional Blindness in Aligned LLMs for Malware Generation

Add code
Jul 02, 2025
Figure 1 for MGC: A Compiler Framework Exploiting Compositional Blindness in Aligned LLMs for Malware Generation
Figure 2 for MGC: A Compiler Framework Exploiting Compositional Blindness in Aligned LLMs for Malware Generation
Figure 3 for MGC: A Compiler Framework Exploiting Compositional Blindness in Aligned LLMs for Malware Generation
Figure 4 for MGC: A Compiler Framework Exploiting Compositional Blindness in Aligned LLMs for Malware Generation
Viaarxiv icon

Can Mixture-of-Experts Surpass Dense LLMs Under Strictly Equal Resources?

Add code
Jun 13, 2025
Viaarxiv icon