Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yuval Krymolowski

Using the Distribution of Performance for Studying Statistical NLP Systems and Corpora

Jun 20, 2001

Yuval Krymolowski

Figure 1 for Using the Distribution of Performance for Studying Statistical NLP Systems and Corpora

Figure 2 for Using the Distribution of Performance for Studying Statistical NLP Systems and Corpora

Figure 3 for Using the Distribution of Performance for Studying Statistical NLP Systems and Corpora

Figure 4 for Using the Distribution of Performance for Studying Statistical NLP Systems and Corpora

Abstract:Statistical NLP systems are frequently evaluated and compared on the basis of their performances on a single split of training and test data. Results obtained using a single split are, however, subject to sampling noise. In this paper we argue in favour of reporting a distribution of performance figures, obtained by resampling the training data, rather than a single number. The additional information from distributions can be used to make statistically quantified statements about differences across parameter settings, systems, and corpora.

* To be presented in ACL/EACL Workshop on Evaluation for Language and Dialogue Systems

Via

Access Paper or Ask Questions

Applying System Combination to Base Noun Phrase Identification

Aug 17, 2000

Erik F. Tjong Kim Sang, Walter Daelemans, Herve Dejean, Rob Koeling, Yuval Krymolowski, Vasin Punyakanok, Dan Roth

Figure 1 for Applying System Combination to Base Noun Phrase Identification

Figure 2 for Applying System Combination to Base Noun Phrase Identification

Figure 3 for Applying System Combination to Base Noun Phrase Identification

Abstract:We use seven machine learning algorithms for one task: identifying base noun phrases. The results have been processed by different system combination methods and all of these outperformed the best individual result. We have applied the seven learners with the best combinator, a majority vote of the top five systems, to a standard data set and managed to improve the best published result for this data set.

* Proceedings of COLING 2000, Saarbruecken, Germany
* 7 pages

Via

Access Paper or Ask Questions

A Memory-Based Approach to Learning Shallow Natural Language Patterns

Apr 15, 1999

Shlomo Argamon, Ido Dagan, Yuval Krymolowski

Figure 1 for A Memory-Based Approach to Learning Shallow Natural Language Patterns

Figure 2 for A Memory-Based Approach to Learning Shallow Natural Language Patterns

Figure 3 for A Memory-Based Approach to Learning Shallow Natural Language Patterns

Figure 4 for A Memory-Based Approach to Learning Shallow Natural Language Patterns

Abstract:Recognizing shallow linguistic patterns, such as basic syntactic relationships between words, is a common task in applied natural language and text processing. The common practice for approaching this task is by tedious manual definition of possible pattern structures, often in the form of regular expressions or finite automata. This paper presents a novel memory-based learning method that recognizes shallow patterns in new text based on a bracketed training corpus. The training data are stored as-is, in efficient suffix-tree data structures. Generalization is performed on-line at recognition time by comparing subsequences of the new text to positive and negative evidence in the corpus. This way, no information in the training is lost, as can happen in other learning systems that construct a single generalized model at the time of training. The paper presents experimental results for recognizing noun phrase, subject-verb and verb-object patterns in English. Since the learning approach enables easy porting to new domains, we plan to apply it to syntactic patterns in other languages and to sub-language patterns for information extraction.

* 27 pages. This is a revised and extended version of the paper presented in COLING-ACL '98. To appear in Journal of Experimental and Theoretical AI (JETAI) special issue on memory-based learning

Via

Access Paper or Ask Questions