Picture for Yujie Wang

Yujie Wang

Data-Centric and Heterogeneity-Adaptive Sequence Parallelism for Efficient LLM Training

Add code
Dec 02, 2024
Viaarxiv icon

Absolute State-wise Constrained Policy Optimization: High-Probability State-wise Constraints Satisfaction

Add code
Oct 02, 2024
Figure 1 for Absolute State-wise Constrained Policy Optimization: High-Probability State-wise Constraints Satisfaction
Figure 2 for Absolute State-wise Constrained Policy Optimization: High-Probability State-wise Constraints Satisfaction
Figure 3 for Absolute State-wise Constrained Policy Optimization: High-Probability State-wise Constraints Satisfaction
Figure 4 for Absolute State-wise Constrained Policy Optimization: High-Probability State-wise Constraints Satisfaction
Viaarxiv icon

Efficient Multi-Task Large Model Training via Data Heterogeneity-aware Model Management

Add code
Sep 05, 2024
Figure 1 for Efficient Multi-Task Large Model Training via Data Heterogeneity-aware Model Management
Figure 2 for Efficient Multi-Task Large Model Training via Data Heterogeneity-aware Model Management
Figure 3 for Efficient Multi-Task Large Model Training via Data Heterogeneity-aware Model Management
Figure 4 for Efficient Multi-Task Large Model Training via Data Heterogeneity-aware Model Management
Viaarxiv icon

First Activations Matter: Training-Free Methods for Dynamic Activation in Large Language Models

Add code
Aug 21, 2024
Viaarxiv icon

MOYU: A Theoretical Study on Massive Over-activation Yielded Uplifts in LLMs

Add code
Jun 18, 2024
Figure 1 for MOYU: A Theoretical Study on Massive Over-activation Yielded Uplifts in LLMs
Figure 2 for MOYU: A Theoretical Study on Massive Over-activation Yielded Uplifts in LLMs
Viaarxiv icon

QQQ: Quality Quattuor-Bit Quantization for Large Language Models

Add code
Jun 14, 2024
Viaarxiv icon

DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing,Defocus Rendering and Blur Removal

Add code
May 27, 2024
Figure 1 for DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing,Defocus Rendering and Blur Removal
Figure 2 for DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing,Defocus Rendering and Blur Removal
Figure 3 for DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing,Defocus Rendering and Blur Removal
Figure 4 for DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Refocusing,Defocus Rendering and Blur Removal
Viaarxiv icon

LSEnet: Lorentz Structural Entropy Neural Network for Deep Graph Clustering

Add code
May 20, 2024
Figure 1 for LSEnet: Lorentz Structural Entropy Neural Network for Deep Graph Clustering
Figure 2 for LSEnet: Lorentz Structural Entropy Neural Network for Deep Graph Clustering
Figure 3 for LSEnet: Lorentz Structural Entropy Neural Network for Deep Graph Clustering
Figure 4 for LSEnet: Lorentz Structural Entropy Neural Network for Deep Graph Clustering
Viaarxiv icon

Dynamic Activation Pitfalls in LLaMA Models: An Empirical Study

Add code
May 15, 2024
Viaarxiv icon

Cross-domain Chinese Sentence Pattern Parsing

Add code
Feb 27, 2024
Viaarxiv icon