Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dania Egedi

University of Pennsylvania

Status of the XTAG System

Nov 03, 1994

Christy Doran, Dania Egedi, Beth Ann Hockey, B. Srinivas

Abstract:XTAG is an ongoing project to develop a wide-coverage grammar for English, based on the Feature-based Lexicalized Tree Adjoining Grammar (FB-LTAG) formalism. The XTAG system integrates a morphological analyzer, an N-best part-of-speech tagger, an Early-style parser and an X-window interface, along with a wide-coverage grammar for English developed using the system. This system serves as a linguist's workbench for developing FB-LTAG specifications. This paper presents a description of and recent improvements to the various components of the XTAG system. It also presents the recent performance of the wide-coverage grammar on various corpora and compares it against the performance of other wide-coverage and domain-specific grammars.

* Proceedings of TAG+3, 1994
* uuencoded compressed ps file. 4 pages

Via

Access Paper or Ask Questions

Constraining Lexical Selection Across Languages Using TAGs

Nov 03, 1994

Dania Egedi, Martha Palmer

Abstract:Lexical selection in Machine Translation consists of several related components. Two that have received a lot of attention are lexical mapping from an underlying concept or lexical item, and choosing the correct subcategorization frame based on argument structure. Because most MT applications are small or relatively domain specific, a third component of lexical selection is generally overlooked - distinguishing between lexical items that are closely related conceptually. While some MT systems have proposed using a 'world knowledge' module to decide which word is more appropriate based on various pragmatic or stylistic constraints, we are interested in seeing how much we can accomplish using a combination of syntax and lexical semantics. By using separate ontologies for each language implemented in FB-LTAGs, we are able to elegantly model the more specific and language dependent syntactic and semantic distinctions necessary to further filter the choice of the lexical item.

* ps file. 4 pages, Proceedings of TAG+3, 1994

Via

Access Paper or Ask Questions

Determining Determiner Sequencing: A Syntactic Analysis for English

Nov 03, 1994

Beth Ann Hockey, Dania Egedi

Abstract:Previous work on English determiners has primarily concentrated on their semantics or scoping properties rather than their complex ordering behavior. The little work that has been done on determiner ordering generally splits determiners into three subcategories. However, this small number of categories does not capture the finer distinctions necessary to correctly order determiners. This paper presents a syntactic account of determiner sequencing based on eight independently identified semantic features. Complex determiners, such as genitives, partitives, and determiner modifying adverbials, are also presented. This work has been implemented as part of XTAG, a wide-coverage grammar for English based in the Feature-Based, Lexicalized Tree Adjoining Grammar (FB-LTAG) formalism.

* ps file. 4 pages. Proceedings of TAG+3, 1994

Via

Access Paper or Ask Questions

A Freely Available Wide Coverage Morphological Analyzer for English

Oct 24, 1994

Daniel Karp, Yves Schabes, Martin Zaidel, Dania Egedi

Figure 1 for A Freely Available Wide Coverage Morphological Analyzer for English

Figure 2 for A Freely Available Wide Coverage Morphological Analyzer for English

Figure 3 for A Freely Available Wide Coverage Morphological Analyzer for English

Figure 4 for A Freely Available Wide Coverage Morphological Analyzer for English

Abstract:This paper presents a morphological lexicon for English that handles more than 317000 inflected forms derived from over 90000 stems. The lexicon is available in two formats. The first can be used by an implementation of a two-level processor for morphological analysis. The second, derived from the first one for efficiency reasons, consists of a disk-based database using a UNIX hash table facility. We also built an X Window tool to facilitate the maintenance and browsing of the lexicon. The package is ready to be integrated into an natural language application such as a parser through hooks written in Lisp and C.

* Proceedings of Coling 92
* uuencoded compressed ps file. 5 pages. Contact info has been upated from Coling '92 version

Via

Access Paper or Ask Questions

Korean to English Translation Using Synchronous TAGs

Oct 24, 1994

Dania Egedi, Martha Palmer, Hyun S. Park, Aravind K. Joshi

Figure 1 for Korean to English Translation Using Synchronous TAGs

Figure 2 for Korean to English Translation Using Synchronous TAGs

Figure 3 for Korean to English Translation Using Synchronous TAGs

Figure 4 for Korean to English Translation Using Synchronous TAGs

Abstract:It is often argued that accurate machine translation requires reference to contextual knowledge for the correct treatment of linguistic phenomena such as dropped arguments and accurate lexical selection. One of the historical arguments in favor of the interlingua approach has been that, since it revolves around a deep semantic representation, it is better able to handle the types of linguistic phenomena that are seen as requiring a knowledge-based approach. In this paper we present an alternative approach, exemplified by a prototype system for machine translation of English and Korean which is implemented in Synchronous TAGs. This approach is essentially transfer based, and uses semantic feature unification for accurate lexical selection of polysemous verbs. The same semantic features, when combined with a discourse model which stores previously mentioned entities, can also be used for the recovery of topicalized arguments. In this paper we concentrate on the translation of Korean to English.

* Proceedings of AMTA 94
* ps file. 8 pages

Via

Access Paper or Ask Questions

Lexicalization and Grammar Development

Oct 21, 1994

B. Srinivas, Dania Egedi, Christy Doran, Tilman Becker

Figure 1 for Lexicalization and Grammar Development

Figure 2 for Lexicalization and Grammar Development

Figure 3 for Lexicalization and Grammar Development

Figure 4 for Lexicalization and Grammar Development

Abstract:In this paper we present a fully lexicalized grammar formalism as a particularly attractive framework for the specification of natural language grammars. We discuss in detail Feature-based, Lexicalized Tree Adjoining Grammars (FB-LTAGs), a representative of the class of lexicalized grammars. We illustrate the advantages of lexicalized grammars in various contexts of natural language processing, ranging from wide-coverage grammar development to parsing and machine translation. We also present a method for compact and efficient representation of lexicalized trees.

* Proceedings of KONVENS 94, Vienna, Austria, September 1994
* ps file. English w/ German abstract. 10 pages

Via

Access Paper or Ask Questions

A Freely Available Syntactic Lexicon for English

Oct 21, 1994

Dania Egedi, Patrick Martin

Figure 1 for A Freely Available Syntactic Lexicon for English

Figure 2 for A Freely Available Syntactic Lexicon for English

Figure 3 for A Freely Available Syntactic Lexicon for English

Figure 4 for A Freely Available Syntactic Lexicon for English

Abstract:This paper presents a syntactic lexicon for English that was originally derived from the Oxford Advanced Learner's Dictionary and the Oxford Dictionary of Current Idiomatic English, and then modified and augmented by hand. There are more than 37,000 syntactic entries from all 8 parts of speech. An X-windows based tool is available for maintaining the lexicon and performing searches. C and Lisp hooks are also available so that the lexicon can be easily utilized by parsers and other programs.

* Proceedings of the International Workshop on Sharable Natural Language Resources, Nara, Japan, August 1994
* Latex file with .eps figure. 8 pages

Via

Access Paper or Ask Questions

XTAG system - A Wide Coverage Grammar for English

Oct 20, 1994

Christy Doran, Dania Egedi, Beth Ann Hockey, B. Srinivas, Martin Zaidel

Figure 1 for XTAG system - A Wide Coverage Grammar for English

Figure 2 for XTAG system - A Wide Coverage Grammar for English

Figure 3 for XTAG system - A Wide Coverage Grammar for English

Figure 4 for XTAG system - A Wide Coverage Grammar for English

Abstract:This paper presents the XTAG system, a grammar development tool based on the Tree Adjoining Grammar (TAG) formalism that includes a wide-coverage syntactic grammar for English. The various components of the system are discussed and preliminary evaluation results from the parsing of various corpora are given. Results from the comparison of XTAG against the IBM statistical parser and the Alvey Natural Language Tool parser are also given.

* Proceedings of the 15th International Conference on Computational Linguistics (COLING 94), Kyoto, Japan, August 1994, pp. 922-928
* ps file. 7 pages

Via

Access Paper or Ask Questions