Picture for Dongsheng Li

Dongsheng Li

National University of Defense Technology, Changsha, China

Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training

Add code
Jan 13, 2025
Viaarxiv icon

Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference

Add code
Dec 25, 2024
Viaarxiv icon

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Add code
Dec 13, 2024
Viaarxiv icon

Oracle-guided Dynamic User Preference Modeling for Sequential Recommendation

Add code
Dec 01, 2024
Figure 1 for Oracle-guided Dynamic User Preference Modeling for Sequential Recommendation
Figure 2 for Oracle-guided Dynamic User Preference Modeling for Sequential Recommendation
Figure 3 for Oracle-guided Dynamic User Preference Modeling for Sequential Recommendation
Figure 4 for Oracle-guided Dynamic User Preference Modeling for Sequential Recommendation
Viaarxiv icon

Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey

Add code
Nov 08, 2024
Figure 1 for Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Figure 2 for Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Figure 3 for Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Figure 4 for Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Viaarxiv icon

Constructing and Masking Preference Profile with LLMs for Filtering Discomforting Recommendation

Add code
Oct 07, 2024
Figure 1 for Constructing and Masking Preference Profile with LLMs for Filtering Discomforting Recommendation
Figure 2 for Constructing and Masking Preference Profile with LLMs for Filtering Discomforting Recommendation
Figure 3 for Constructing and Masking Preference Profile with LLMs for Filtering Discomforting Recommendation
Figure 4 for Constructing and Masking Preference Profile with LLMs for Filtering Discomforting Recommendation
Viaarxiv icon

How Do Large Language Models Understand Graph Patterns? A Benchmark for Graph Pattern Comprehension

Add code
Oct 04, 2024
Figure 1 for How Do Large Language Models Understand Graph Patterns? A Benchmark for Graph Pattern Comprehension
Figure 2 for How Do Large Language Models Understand Graph Patterns? A Benchmark for Graph Pattern Comprehension
Figure 3 for How Do Large Language Models Understand Graph Patterns? A Benchmark for Graph Pattern Comprehension
Figure 4 for How Do Large Language Models Understand Graph Patterns? A Benchmark for Graph Pattern Comprehension
Viaarxiv icon

Translating Mental Imaginations into Characters with Codebooks and Dynamics-Enhanced Decoding

Add code
Sep 25, 2024
Figure 1 for Translating Mental Imaginations into Characters with Codebooks and Dynamics-Enhanced Decoding
Figure 2 for Translating Mental Imaginations into Characters with Codebooks and Dynamics-Enhanced Decoding
Figure 3 for Translating Mental Imaginations into Characters with Codebooks and Dynamics-Enhanced Decoding
Figure 4 for Translating Mental Imaginations into Characters with Codebooks and Dynamics-Enhanced Decoding
Viaarxiv icon

Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling

Add code
Sep 18, 2024
Viaarxiv icon

Federated Prediction-Powered Inference from Decentralized Data

Add code
Sep 03, 2024
Figure 1 for Federated Prediction-Powered Inference from Decentralized Data
Figure 2 for Federated Prediction-Powered Inference from Decentralized Data
Figure 3 for Federated Prediction-Powered Inference from Decentralized Data
Figure 4 for Federated Prediction-Powered Inference from Decentralized Data
Viaarxiv icon