Picture for Yang Zhao

Yang Zhao

Frank

ShardMemo: Masked MoE Routing for Sharded Agentic LLM Memory

Add code
Jan 29, 2026
Viaarxiv icon

PhyG-MoE: A Physics-Guided Mixture-of-Experts Framework for Energy-Efficient GNSS Interference Recognition

Add code
Jan 19, 2026
Viaarxiv icon

SKANet: A Cognitive Dual-Stream Framework with Adaptive Modality Fusion for Robust Compound GNSS Interference Classification

Add code
Jan 19, 2026
Viaarxiv icon

Consolidation or Adaptation? PRISM: Disentangling SFT and RL Data via Gradient Concentration

Add code
Jan 12, 2026
Viaarxiv icon

MAESTRO: Meta-learning Adaptive Estimation of Scalarization Trade-offs for Reward Optimization

Add code
Jan 12, 2026
Viaarxiv icon

HiSciBench: A Hierarchical Multi-disciplinary Benchmark for Scientific Intelligence from Reading to Discovery

Add code
Dec 28, 2025
Viaarxiv icon

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

End-to-End Training for Autoregressive Video Diffusion via Self-Resampling

Add code
Dec 17, 2025
Figure 1 for End-to-End Training for Autoregressive Video Diffusion via Self-Resampling
Figure 2 for End-to-End Training for Autoregressive Video Diffusion via Self-Resampling
Figure 3 for End-to-End Training for Autoregressive Video Diffusion via Self-Resampling
Figure 4 for End-to-End Training for Autoregressive Video Diffusion via Self-Resampling
Viaarxiv icon

VaseVQA-3D: Benchmarking 3D VLMs on Ancient Greek Pottery

Add code
Oct 06, 2025
Figure 1 for VaseVQA-3D: Benchmarking 3D VLMs on Ancient Greek Pottery
Figure 2 for VaseVQA-3D: Benchmarking 3D VLMs on Ancient Greek Pottery
Figure 3 for VaseVQA-3D: Benchmarking 3D VLMs on Ancient Greek Pottery
Figure 4 for VaseVQA-3D: Benchmarking 3D VLMs on Ancient Greek Pottery
Viaarxiv icon

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Add code
Sep 19, 2025
Figure 1 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Figure 2 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Figure 3 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Figure 4 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Viaarxiv icon