Picture for Zhi Zhang

Zhi Zhang

Dense ReLU Neural Networks for Temporal-spatial Model

Add code
Nov 15, 2024
Viaarxiv icon

Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory

Add code
Nov 01, 2024
Viaarxiv icon

SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training

Add code
Oct 20, 2024
Figure 1 for SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training
Figure 2 for SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training
Figure 3 for SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training
Figure 4 for SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training
Viaarxiv icon

DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting

Add code
Oct 15, 2024
Figure 1 for DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Figure 2 for DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Figure 3 for DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Figure 4 for DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Viaarxiv icon

MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router

Add code
Oct 15, 2024
Figure 1 for MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router
Figure 2 for MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router
Figure 3 for MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router
Figure 4 for MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router
Viaarxiv icon

Just say what you want: only-prompting self-rewarding online preference optimization

Add code
Sep 26, 2024
Viaarxiv icon

Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis

Add code
Aug 10, 2024
Figure 1 for Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis
Figure 2 for Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis
Figure 3 for Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis
Figure 4 for Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis
Viaarxiv icon

Let the Code LLM Edit Itself When You Edit the Code

Add code
Jul 03, 2024
Viaarxiv icon

Interference Cancellation Based Neural Receiver for Superimposed Pilot in Multi-Layer Transmission

Add code
Jun 27, 2024
Viaarxiv icon

ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization

Add code
May 23, 2024
Figure 1 for ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization
Figure 2 for ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization
Figure 3 for ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization
Figure 4 for ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization
Viaarxiv icon