Picture for Jintao Zhang

Jintao Zhang

Accurate INT8 Training Through Dynamic Block-Level Fallback

Add code
Mar 11, 2025
Viaarxiv icon

SAGE: A Framework of Precise Retrieval for RAG

Add code
Mar 03, 2025
Viaarxiv icon

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Add code
Feb 25, 2025
Viaarxiv icon

Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

Add code
Feb 03, 2025
Figure 1 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Figure 2 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Figure 3 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Figure 4 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Viaarxiv icon

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Add code
Nov 17, 2024
Figure 1 for SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration
Figure 2 for SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration
Figure 3 for SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration
Figure 4 for SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration
Viaarxiv icon

FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial Modeling

Add code
Oct 17, 2024
Figure 1 for FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial Modeling
Figure 2 for FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial Modeling
Figure 3 for FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial Modeling
Figure 4 for FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial Modeling
Viaarxiv icon

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Add code
Oct 03, 2024
Figure 1 for SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Figure 2 for SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Figure 3 for SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Figure 4 for SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Viaarxiv icon

HMF: A Hybrid Multi-Factor Framework for Dynamic Intraoperative Hypotension Prediction

Add code
Sep 17, 2024
Viaarxiv icon

A Point-Neighborhood Learning Framework for Nasal Endoscope Image Segmentation

Add code
May 30, 2024
Viaarxiv icon

LIKO: LiDAR, Inertial, and Kinematic Odometry for Bipedal Robots

Add code
Apr 28, 2024
Figure 1 for LIKO: LiDAR, Inertial, and Kinematic Odometry for Bipedal Robots
Figure 2 for LIKO: LiDAR, Inertial, and Kinematic Odometry for Bipedal Robots
Figure 3 for LIKO: LiDAR, Inertial, and Kinematic Odometry for Bipedal Robots
Figure 4 for LIKO: LiDAR, Inertial, and Kinematic Odometry for Bipedal Robots
Viaarxiv icon