Picture for Zhenheng Tang

Zhenheng Tang

Dissecting Outlier Dynamics in LLM NVFP4 Pretraining

Add code
Feb 02, 2026
Viaarxiv icon

On the Spectral Flattening of Quantized Embeddings

Add code
Feb 01, 2026
Viaarxiv icon

EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines

Add code
Jan 14, 2026
Viaarxiv icon

CloneMem: Benchmarking Long-Term Memory for AI Clones

Add code
Jan 11, 2026
Viaarxiv icon

Efficient MoE Inference with Fine-Grained Scheduling of Disaggregated Expert Parallelism

Add code
Dec 25, 2025
Figure 1 for Efficient MoE Inference with Fine-Grained Scheduling of Disaggregated Expert Parallelism
Figure 2 for Efficient MoE Inference with Fine-Grained Scheduling of Disaggregated Expert Parallelism
Figure 3 for Efficient MoE Inference with Fine-Grained Scheduling of Disaggregated Expert Parallelism
Figure 4 for Efficient MoE Inference with Fine-Grained Scheduling of Disaggregated Expert Parallelism
Viaarxiv icon

Octopus: Agentic Multimodal Reasoning with Six-Capability Orchestration

Add code
Nov 19, 2025
Viaarxiv icon

GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging

Add code
Aug 26, 2025
Figure 1 for GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging
Figure 2 for GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging
Figure 3 for GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging
Figure 4 for GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging
Viaarxiv icon

AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models

Add code
Jun 24, 2025
Viaarxiv icon

Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression

Add code
May 26, 2025
Figure 1 for Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression
Figure 2 for Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression
Figure 3 for Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression
Figure 4 for Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression
Viaarxiv icon

Assessing Judging Bias in Large Reasoning Models: An Empirical Study

Add code
Apr 14, 2025
Figure 1 for Assessing Judging Bias in Large Reasoning Models: An Empirical Study
Figure 2 for Assessing Judging Bias in Large Reasoning Models: An Empirical Study
Figure 3 for Assessing Judging Bias in Large Reasoning Models: An Empirical Study
Figure 4 for Assessing Judging Bias in Large Reasoning Models: An Empirical Study
Viaarxiv icon