Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

B. Srinivas

Department of Computer and Information Science, University of Pennsylvania

Heuristics and Parse Ranking

Aug 28, 1995

B. Srinivas, Christine Doran, Seth Kulick

Figure 1 for Heuristics and Parse Ranking

Figure 2 for Heuristics and Parse Ranking

Figure 3 for Heuristics and Parse Ranking

Figure 4 for Heuristics and Parse Ranking

Abstract:There are currently two philosophies for building grammars and parsers -- Statistically induced grammars and Wide-coverage grammars. One way to combine the strengths of both approaches is to have a wide-coverage grammar with a heuristic component which is domain independent but whose contribution is tuned to particular domains. In this paper, we discuss a three-stage approach to disambiguation in the context of a lexicalized grammar, using a variety of domain independent heuristic techniques. We present a training algorithm which uses hand-bracketed treebank parses to set the weights of these heuristics. We compare the performance of our grammar against the performance of the IBM statistical grammar, using both untrained and trained weights for the heuristics.

* International Workshop on Parsing Technologies (IWPT 95)
* uuencoded compressed ps file. A4 format. 10 pages

Via

Access Paper or Ask Questions

Some Novel Applications of Explanation-Based Learning to Parsing Lexicalized Tree-Adjoining Grammars

May 10, 1995

B. Srinivas, Aravind Joshi

Figure 1 for Some Novel Applications of Explanation-Based Learning to Parsing Lexicalized Tree-Adjoining Grammars

Figure 2 for Some Novel Applications of Explanation-Based Learning to Parsing Lexicalized Tree-Adjoining Grammars

Figure 3 for Some Novel Applications of Explanation-Based Learning to Parsing Lexicalized Tree-Adjoining Grammars

Figure 4 for Some Novel Applications of Explanation-Based Learning to Parsing Lexicalized Tree-Adjoining Grammars

Abstract:In this paper we present some novel applications of Explanation-Based Learning (EBL) technique to parsing Lexicalized Tree-Adjoining grammars. The novel aspects are (a) immediate generalization of parses in the training set, (b) generalization over recursive structures and (c) representation of generalized parses as Finite State Transducers. A highly impoverished parser called a ``stapler'' has also been introduced. We present experimental results using EBL for different corpora and architectures to show the effectiveness of our approach.

* ACL 1995
* uuencoded postscript file

Via

Access Paper or Ask Questions

Bootstrapping A Wide-Coverage CCG from FB-LTAG

Nov 03, 1994

Christine Doran, B. Srinivas

Abstract:A number of researchers have noted the similarities between LTAGs and CCGs. Observing this resemblance, we felt that we could make use of the wide-coverage grammar developed in the XTAG project to build a wide-coverage CCG. To our knowledge there have been no attempts to construct a large-scale CCG parser with the lexicon to support it. In this paper, we describe such a system, built by adapting various XTAG components to CCG. We find that, despite the similarities between the formalisms, certain parts of the grammatical workload are distributed differently. In addition, the flexibility of CCG derivations allows the translated grammar to handle a number of ``non-constituent'' constructions which the XTAG grammar cannot.

* ps file. 4 pages, Proceedings of TAG+3, 1994

Via

Access Paper or Ask Questions

Status of the XTAG System

Nov 03, 1994

Christy Doran, Dania Egedi, Beth Ann Hockey, B. Srinivas

Abstract:XTAG is an ongoing project to develop a wide-coverage grammar for English, based on the Feature-based Lexicalized Tree Adjoining Grammar (FB-LTAG) formalism. The XTAG system integrates a morphological analyzer, an N-best part-of-speech tagger, an Early-style parser and an X-window interface, along with a wide-coverage grammar for English developed using the system. This system serves as a linguist's workbench for developing FB-LTAG specifications. This paper presents a description of and recent improvements to the various components of the XTAG system. It also presents the recent performance of the wide-coverage grammar on various corpora and compares it against the performance of other wide-coverage and domain-specific grammars.

* Proceedings of TAG+3, 1994
* uuencoded compressed ps file. 4 pages

Via

Access Paper or Ask Questions

Feature-Based TAG in place of multi-component adjunction: Computational Implications

Oct 26, 1994

B. A. Hockey, B. Srinivas

Abstract:Using feature-based Tree Adjoining Grammar (TAG), this paper presents linguistically motivated analyses of constructions claimed to require multi-component adjunction. These feature-based TAG analyses permit parsing of these constructions using an existing unification-based Earley-style TAG parser, thus obviating the need for a multi-component TAG parser without sacrificing linguistic coverage for English.

* Natural Language Processing Pacific Rim Symposium (NLPRS 93)
* ps file. 9 pages

Via

Access Paper or Ask Questions

Disambiguation of Super Parts of Speech (or Supertags): Almost Parsing

Oct 26, 1994

Aravind K. Joshi, B. Srinivas

Figure 1 for Disambiguation of Super Parts of Speech (or Supertags): Almost Parsing

Abstract:In a lexicalized grammar formalism such as Lexicalized Tree-Adjoining Grammar (LTAG), each lexical item is associated with at least one elementary structure (supertag) that localizes syntactic and semantic dependencies. Thus a parser for a lexicalized grammar must search a large set of supertags to choose the right ones to combine for the parse of the sentence. We present techniques for disambiguating supertags using local information such as lexical preference and local lexical dependencies. The similarity between LTAG and Dependency grammars is exploited in the dependency model of supertag disambiguation. The performance results for various models of supertag disambiguation such as unigram, trigram and dependency-based models are presented.

* Proceedings of the 15th International Conference on Computational Linguistics (COLING 94), Kyoto, Japan, August 1994
* ps file. 8 pages

Via

Access Paper or Ask Questions

Lexicalization and Grammar Development

Oct 21, 1994

B. Srinivas, Dania Egedi, Christy Doran, Tilman Becker

Figure 1 for Lexicalization and Grammar Development

Figure 2 for Lexicalization and Grammar Development

Figure 3 for Lexicalization and Grammar Development

Figure 4 for Lexicalization and Grammar Development

Abstract:In this paper we present a fully lexicalized grammar formalism as a particularly attractive framework for the specification of natural language grammars. We discuss in detail Feature-based, Lexicalized Tree Adjoining Grammars (FB-LTAGs), a representative of the class of lexicalized grammars. We illustrate the advantages of lexicalized grammars in various contexts of natural language processing, ranging from wide-coverage grammar development to parsing and machine translation. We also present a method for compact and efficient representation of lexicalized trees.

* Proceedings of KONVENS 94, Vienna, Austria, September 1994
* ps file. English w/ German abstract. 10 pages

Via

Access Paper or Ask Questions

XTAG system - A Wide Coverage Grammar for English

Oct 20, 1994

Christy Doran, Dania Egedi, Beth Ann Hockey, B. Srinivas, Martin Zaidel

Figure 1 for XTAG system - A Wide Coverage Grammar for English

Figure 2 for XTAG system - A Wide Coverage Grammar for English

Figure 3 for XTAG system - A Wide Coverage Grammar for English

Figure 4 for XTAG system - A Wide Coverage Grammar for English

Abstract:This paper presents the XTAG system, a grammar development tool based on the Tree Adjoining Grammar (TAG) formalism that includes a wide-coverage syntactic grammar for English. The various components of the system are discussed and preliminary evaluation results from the parsing of various corpora are given. Results from the comparison of XTAG against the IBM statistical parser and the Alvey Natural Language Tool parser are also given.

* Proceedings of the 15th International Conference on Computational Linguistics (COLING 94), Kyoto, Japan, August 1994, pp. 922-928
* ps file. 7 pages

Via

Access Paper or Ask Questions