Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shunsuke Kando

Syntactic Learnability of Echo State Neural Language Models at Scale

Mar 03, 2025

Ryo Ueda, Tatsuki Kuribayashi, Shunsuke Kando, Kentaro Inui

Abstract:What is a neural model with minimum architectural complexity that exhibits reasonable language learning capability? To explore such a simple but sufficient neural language model, we revisit a basic reservoir computing (RC) model, Echo State Network (ESN), a restricted class of simple Recurrent Neural Networks. Our experiments showed that ESN with a large hidden state is comparable or superior to Transformer in grammaticality judgment tasks when trained with about 100M words, suggesting that architectures as complex as that of Transformer may not always be necessary for syntactic learning.

* 10 pages

Via

Access Paper or Ask Questions

Textless Dependency Parsing by Labeled Sequence Prediction

Jul 14, 2024

Shunsuke Kando, Yusuke Miyao, Jason Naradowsky, Shinnosuke Takamichi

Abstract:Traditional spoken language processing involves cascading an automatic speech recognition (ASR) system into text processing models. In contrast, "textless" methods process speech representations without ASR systems, enabling the direct use of acoustic speech features. Although their effectiveness is shown in capturing acoustic features, it is unclear in capturing lexical knowledge. This paper proposes a textless method for dependency parsing, examining its effectiveness and limitations. Our proposed method predicts a dependency tree from a speech signal without transcribing, representing the tree as a labeled sequence. scading method outperforms the textless method in overall parsing accuracy, the latter excels in instances with important acoustic features. Our findings highlight the importance of fusing word-level representations and sentence-level prosody for enhanced parsing performance. The code and models are made publicly available: https://github.com/mynlp/SpeechParser.

* Accepted to Interspeech 2024

Via

Access Paper or Ask Questions

Multilingual Syntax-aware Language Modeling through Dependency Tree Conversion

Apr 19, 2022

Shunsuke Kando, Hiroshi Noji, Yusuke Miyao

Figure 1 for Multilingual Syntax-aware Language Modeling through Dependency Tree Conversion

Figure 2 for Multilingual Syntax-aware Language Modeling through Dependency Tree Conversion

Figure 3 for Multilingual Syntax-aware Language Modeling through Dependency Tree Conversion

Figure 4 for Multilingual Syntax-aware Language Modeling through Dependency Tree Conversion

Abstract:Incorporating stronger syntactic biases into neural language models (LMs) is a long-standing goal, but research in this area often focuses on modeling English text, where constituent treebanks are readily available. Extending constituent tree-based LMs to the multilingual setting, where dependency treebanks are more common, is possible via dependency-to-constituency conversion methods. However, this raises the question of which tree formats are best for learning the model, and for which languages. We investigate this question by training recurrent neural network grammars (RNNGs) using various conversion methods, and evaluating them empirically in a multilingual setting. We examine the effect on LM performance across nine conversion methods and five languages through seven types of syntactic tests. On average, the performance of our best model represents a 19 \% increase in accuracy over the worst choice across all languages. Our best model shows the advantage over sequential/overparameterized LMs, suggesting the positive effect of syntax injection in a multilingual setting. Our experiments highlight the importance of choosing the right tree formalism, and provide insights into making an informed decision.

* To appear in the 6th ACL Workshop on Structured Prediction for NLP (SPNLP)

Via

Access Paper or Ask Questions