Picture for Jian Cheng

Jian Cheng

DALI: A Workload-Aware Offloading Framework for Efficient MoE Inference on Local PCs

Add code
Feb 03, 2026
Viaarxiv icon

IntraSlice: Towards High-Performance Structural Pruning with Block-Intra PCA for LLMs

Add code
Feb 02, 2026
Viaarxiv icon

Certain Head, Uncertain Tail: Expert-Sample for Test-Time Scaling in Fine-Grained MoE

Add code
Feb 02, 2026
Viaarxiv icon

Game-Theoretic Co-Evolution for LLM-Based Heuristic Discovery

Add code
Jan 30, 2026
Viaarxiv icon

RoboNeuron: A Modular Framework Linking Foundation Models and ROS for Embodied AI

Add code
Dec 11, 2025
Figure 1 for RoboNeuron: A Modular Framework Linking Foundation Models and ROS for Embodied AI
Figure 2 for RoboNeuron: A Modular Framework Linking Foundation Models and ROS for Embodied AI
Figure 3 for RoboNeuron: A Modular Framework Linking Foundation Models and ROS for Embodied AI
Figure 4 for RoboNeuron: A Modular Framework Linking Foundation Models and ROS for Embodied AI
Viaarxiv icon

Deep (Predictive) Discounted Counterfactual Regret Minimization

Add code
Nov 11, 2025
Viaarxiv icon

DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization

Add code
Nov 06, 2025
Viaarxiv icon

Block Rotation is All You Need for MXFP4 Quantization

Add code
Nov 06, 2025
Viaarxiv icon

Goal-Oriented Skill Abstraction for Offline Multi-Task Reinforcement Learning

Add code
Jul 09, 2025
Viaarxiv icon

Synergizing Reinforcement Learning and Genetic Algorithms for Neural Combinatorial Optimization

Add code
Jun 11, 2025
Figure 1 for Synergizing Reinforcement Learning and Genetic Algorithms for Neural Combinatorial Optimization
Figure 2 for Synergizing Reinforcement Learning and Genetic Algorithms for Neural Combinatorial Optimization
Figure 3 for Synergizing Reinforcement Learning and Genetic Algorithms for Neural Combinatorial Optimization
Figure 4 for Synergizing Reinforcement Learning and Genetic Algorithms for Neural Combinatorial Optimization
Viaarxiv icon