Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ding Zhang

Loss-Aware Curriculum Learning for Chinese Grammatical Error Correction

Dec 31, 2024

Ding Zhang, Yangning Li, Lichen Bai, Hao Zhang, Yinghui Li, Haiye Lin, Hai-Tao Zheng, Xin Su, Zifei Shan

Abstract:Chinese grammatical error correction (CGEC) aims to detect and correct errors in the input Chinese sentences. Recently, Pre-trained Language Models (PLMS) have been employed to improve the performance. However, current approaches ignore that correction difficulty varies across different instances and treat these samples equally, enhancing the challenge of model learning. To address this problem, we propose a multi-granularity Curriculum Learning (CL) framework. Specifically, we first calculate the correction difficulty of these samples and feed them into the model from easy to hard batch by batch. Then Instance-Level CL is employed to help the model optimize in the appropriate direction automatically by regulating the loss function. Extensive experimental results and comprehensive analyses of various datasets prove the effectiveness of our method.

* ICASSP 2025

Via

Access Paper or Ask Questions

Efficient and Robust Continual Graph Learning for Graph Classification in Biology

Nov 18, 2024

Ding Zhang, Jane Downer, Can Chen, Ren Wang

Figure 1 for Efficient and Robust Continual Graph Learning for Graph Classification in Biology

Figure 2 for Efficient and Robust Continual Graph Learning for Graph Classification in Biology

Figure 3 for Efficient and Robust Continual Graph Learning for Graph Classification in Biology

Figure 4 for Efficient and Robust Continual Graph Learning for Graph Classification in Biology

Abstract:Graph classification is essential for understanding complex biological systems, where molecular structures and interactions are naturally represented as graphs. Traditional graph neural networks (GNNs) perform well on static tasks but struggle in dynamic settings due to catastrophic forgetting. We present Perturbed and Sparsified Continual Graph Learning (PSCGL), a robust and efficient continual graph learning framework for graph data classification, specifically targeting biological datasets. We introduce a perturbed sampling strategy to identify critical data points that contribute to model learning and a motif-based graph sparsification technique to reduce storage needs while maintaining performance. Additionally, our PSCGL framework inherently defends against graph backdoor attacks, which is crucial for applications in sensitive biological contexts. Extensive experiments on biological datasets demonstrate that PSCGL not only retains knowledge across tasks but also enhances the efficiency and robustness of graph classification models in biology.

Via

Access Paper or Ask Questions

Online Learning for Adaptive Probing and Scheduling in Dense WLANs

Dec 27, 2022

Tianyi Xu, Ding Zhang, Zizhan Zheng

Abstract:Existing solutions to network scheduling typically assume that the instantaneous link rates are completely known before a scheduling decision is made or consider a bandit setting where the accurate link quality is discovered only after it has been used for data transmission. In practice, the decision maker can obtain (relatively accurate) channel information, e.g., through beamforming in mmWave networks, right before data transmission. However, frequent beamforming incurs a formidable overhead in densely deployed mmWave WLANs. In this paper, we consider the important problem of throughput optimization with joint link probing and scheduling. The problem is challenging even when the link rate distributions are pre-known (the offline setting) due to the necessity of balancing the information gains from probing and the cost of reducing the data transmission opportunity. We develop an approximation algorithm with guaranteed performance when the probing decision is non-adaptive, and a dynamic programming based solution for the more challenging adaptive setting. We further extend our solutions to the online setting with unknown link rate distributions and develop a contextual-bandit based algorithm and derive its regret bound. Numerical results using data traces collected from real-world mmWave deployments demonstrate the efficiency of our solutions.

Via

Access Paper or Ask Questions

Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical Error Correction

Oct 19, 2022

Shirong Ma, Yinghui Li, Rongyi Sun, Qingyu Zhou, Shulin Huang, Ding Zhang, Li Yangning, Ruiyang Liu, Zhongli Li, Yunbo Cao(+2 more)

Figure 1 for Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical Error Correction

Figure 2 for Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical Error Correction

Figure 3 for Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical Error Correction

Figure 4 for Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical Error Correction

Abstract:Chinese Grammatical Error Correction (CGEC) is both a challenging NLP task and a common application in human daily life. Recently, many data-driven approaches are proposed for the development of CGEC research. However, there are two major limitations in the CGEC field: First, the lack of high-quality annotated training corpora prevents the performance of existing CGEC models from being significantly improved. Second, the grammatical errors in widely used test sets are not made by native Chinese speakers, resulting in a significant gap between the CGEC models and the real application. In this paper, we propose a linguistic rules-based approach to construct large-scale CGEC training corpora with automatically generated grammatical errors. Additionally, we present a challenging CGEC benchmark derived entirely from errors made by native Chinese speakers in real-world scenarios. Extensive experiments and detailed analyses not only demonstrate that the training data constructed by our method effectively improves the performance of CGEC models, but also reflect that our benchmark is an excellent resource for further development of the CGEC field.

* Long paper, accepted at the Findings of EMNLP 2022

Via

Access Paper or Ask Questions

Contextual Similarity is More Valuable than Character Similarity: Curriculum Learning for Chinese Spell Checking

Jul 17, 2022

Ding Zhang, Yinghui Li, Qingyu Zhou, Shirong Ma, Yangning Li, Yunbo Cao, Hai-Tao Zheng

Figure 1 for Contextual Similarity is More Valuable than Character Similarity: Curriculum Learning for Chinese Spell Checking

Figure 2 for Contextual Similarity is More Valuable than Character Similarity: Curriculum Learning for Chinese Spell Checking

Figure 3 for Contextual Similarity is More Valuable than Character Similarity: Curriculum Learning for Chinese Spell Checking

Figure 4 for Contextual Similarity is More Valuable than Character Similarity: Curriculum Learning for Chinese Spell Checking

Abstract:Chinese Spell Checking (CSC) task aims to detect and correct Chinese spelling errors. In recent years, related researches focus on introducing the character similarity from confusion set to enhance the CSC models, ignoring the context of characters that contain richer information. To make better use of contextual similarity, we propose a simple yet effective curriculum learning framework for the CSC task. With the help of our designed model-agnostic framework, existing CSC models will be trained from easy to difficult as humans learn Chinese characters and achieve further performance improvements. Extensive experiments and detailed analyses on widely used SIGHAN datasets show that our method outperforms previous state-of-the-art methods.

Via

Access Paper or Ask Questions

Joint AP Probing and Scheduling: A Contextual Bandit Approach

Aug 13, 2021

Tianyi Xu, Ding Zhang, Parth H. Pathak, Zizhan Zheng

Figure 1 for Joint AP Probing and Scheduling: A Contextual Bandit Approach

Abstract:We consider a set of APs with unknown data rates that cooperatively serve a mobile client. The data rate of each link is i.i.d. sampled from a distribution that is unknown a priori. In contrast to traditional link scheduling problems under uncertainty, we assume that in each time step, the device can probe a subset of links before deciding which one to use. We model this problem as a contextual bandit problem with probing (CBwP) and present an efficient algorithm. We further establish the regret of our algorithm for links with Bernoulli data rates. Our CBwP model is a novel extension of the classic contextual bandit model and can potentially be applied to a large class of sequential decision-making problems that involve joint probing and play under uncertainty.

Via

Access Paper or Ask Questions