Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Leonid Peshkin

Matroids Hitting Sets and Unsupervised Dependency Grammar Induction

Jul 15, 2017

Nicholas Harvey, Vahab Mirrokni, David Karger, Virginia Savova, Leonid Peshkin

Figure 1 for Matroids Hitting Sets and Unsupervised Dependency Grammar Induction

Figure 2 for Matroids Hitting Sets and Unsupervised Dependency Grammar Induction

Figure 3 for Matroids Hitting Sets and Unsupervised Dependency Grammar Induction

Figure 4 for Matroids Hitting Sets and Unsupervised Dependency Grammar Induction

Abstract:This paper formulates a novel problem on graphs: find the minimal subset of edges in a fully connected graph, such that the resulting graph contains all spanning trees for a set of specifed sub-graphs. This formulation is motivated by an un-supervised grammar induction problem from computational linguistics. We present a reduction to some known problems and algorithms from graph theory, provide computational complexity results, and describe an approximation algorithm.

* 11 pages 4 figures

Via

Access Paper or Ask Questions

Learning to Cooperate via Policy Search

Aug 07, 2014

Leonid Peshkin, Kee-Eung Kim, Nicolas Meuleau, Leslie Pack Kaelbling

Figure 1 for Learning to Cooperate via Policy Search

Figure 2 for Learning to Cooperate via Policy Search

Figure 3 for Learning to Cooperate via Policy Search

Figure 4 for Learning to Cooperate via Policy Search

Abstract:Cooperative games are those in which both agents share the same payoff structure. Value-based reinforcement-learning algorithms, such as variants of Q-learning, have been applied to learning cooperative games, but they only apply when the game state is completely observable to both agents. Policy search methods are a reasonable alternative to value-based methods for partially observable environments. In this paper, we provide a gradient-based distributed policy-search method for cooperative games and compare the notion of local optimum to that of Nash equilibrium. We demonstrate the effectiveness of this method experimentally in a small, partially observable simulated soccer domain.

* Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

Via

Access Paper or Ask Questions

Learning Finite-State Controllers for Partially Observable Environments

Jan 23, 2013

Nicolas Meuleau, Leonid Peshkin, Kee-Eung Kim, Leslie Pack Kaelbling

Figure 1 for Learning Finite-State Controllers for Partially Observable Environments

Figure 2 for Learning Finite-State Controllers for Partially Observable Environments

Figure 3 for Learning Finite-State Controllers for Partially Observable Environments

Figure 4 for Learning Finite-State Controllers for Partially Observable Environments

Abstract:Reactive (memoryless) policies are sufficient in completely observable Markov decision processes (MDPs), but some kind of memory is usually necessary for optimal control of a partially observable MDP. Policies with finite memory can be represented as finite-state automata. In this paper, we extend Baird and Moore's VAPS algorithm to the problem of learning general finite-state automata. Because it performs stochastic gradient descent, this algorithm can be shown to converge to a locally optimal finite-state controller. We provide the details of the algorithm and then consider the question of under what conditions stochastic gradient descent will outperform exact gradient descent. We conclude with empirical results comparing the performance of stochastic and exact gradient descent, and showing the ability of our algorithm to extract the useful information contained in the sequence of past observations to compensate for the lack of observability at each time-step.

* Appears in Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI1999)

Via

Access Paper or Ask Questions

Factored Particles for Scalable Monitoring

Dec 12, 2012

Brenda Ng, Leonid Peshkin, Avi Pfeffer

Figure 1 for Factored Particles for Scalable Monitoring

Figure 2 for Factored Particles for Scalable Monitoring

Figure 3 for Factored Particles for Scalable Monitoring

Figure 4 for Factored Particles for Scalable Monitoring

Abstract:Exact monitoring in dynamic Bayesian networks is intractable, so approximate algorithms are necessary. This paper presents a new family of approximate monitoring algorithms that combine the best qualities of the particle filtering and Boyen-Koller methods. Our algorithms maintain an approximate representation the belief state in the form of sets of factored particles, that correspond to samples of clusters of state variables. Empirical results show that our algorithms outperform both ordinary particle filtering and the Boyen-Koller algorithm on large systems.

* Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

Via

Access Paper or Ask Questions

Reinforcement Learning for Adaptive Routing

Mar 28, 2007

Leonid Peshkin, Virginia Savova

Figure 1 for Reinforcement Learning for Adaptive Routing

Figure 2 for Reinforcement Learning for Adaptive Routing

Abstract:Reinforcement learning means learning a policy--a mapping of observations into actions--based on feedback from the environment. The learning can be viewed as browsing a set of policies while evaluating them by trial through interaction with the environment. We present an application of gradient ascent algorithm for reinforcement learning to a complex domain of packet routing in network communication and compare the performance of this algorithm to other routing methods on a benchmark problem.

* In Proceedings of the Intnl Joint Conf on Neural Networks (IJCNN), 2002

Via

Access Paper or Ask Questions

Dependency Parsing with Dynamic Bayesian Network

Mar 27, 2007

Virginia Savova, Leonid Peshkin

Figure 1 for Dependency Parsing with Dynamic Bayesian Network

Figure 2 for Dependency Parsing with Dynamic Bayesian Network

Figure 3 for Dependency Parsing with Dynamic Bayesian Network

Figure 4 for Dependency Parsing with Dynamic Bayesian Network

Abstract:Exact parsing with finite state automata is deemed inappropriate because of the unbounded non-locality languages overwhelmingly exhibit. We propose a way to structure the parsing task in order to make it amenable to local classification methods. This allows us to build a Dynamic Bayesian Network which uncovers the syntactic dependency structure of English sentences. Experiments with the Wall Street Journal demonstrate that the model successfully learns from labeled data.

* In proceedings of American Association for Artificial Intelligence AAAI 2005
* 6 pages

Via

Access Paper or Ask Questions

Structure induction by lossless graph compression

Mar 27, 2007

Leonid Peshkin

Figure 1 for Structure induction by lossless graph compression

Figure 2 for Structure induction by lossless graph compression

Figure 3 for Structure induction by lossless graph compression

Figure 4 for Structure induction by lossless graph compression

Abstract:This work is motivated by the necessity to automate the discovery of structure in vast and evergrowing collection of relational data commonly represented as graphs, for example genomic networks. A novel algorithm, dubbed Graphitour, for structure induction by lossless graph compression is presented and illustrated by a clear and broadly known case of nested structure in a DNA molecule. This work extends to graphs some well established approaches to grammatical inference previously applied only to strings. The bottom-up graph compression problem is related to the maximum cardinality (non-bipartite) maximum cardinality matching problem. The algorithm accepts a variety of graph types including directed graphs and graphs with labeled nodes and arcs. The resulting structure could be used for representation and classification of graphs.

* In proceedings of the Data Compression Conference, 2007, pp 53-62, published by the IEEE Computer Society Press
* 10 pages, 7 figures, 2 tables published in Proceedings of the Data Compression Conference, 2007

Via

Access Paper or Ask Questions

Part-of-Speech Tagging with Minimal Lexicalization

Dec 27, 2003

Virginia Savova, Leonid Peshkin

Figure 1 for Part-of-Speech Tagging with Minimal Lexicalization

Figure 2 for Part-of-Speech Tagging with Minimal Lexicalization

Figure 3 for Part-of-Speech Tagging with Minimal Lexicalization

Figure 4 for Part-of-Speech Tagging with Minimal Lexicalization

Abstract:We use a Dynamic Bayesian Network to represent compactly a variety of sublexical and contextual features relevant to Part-of-Speech (PoS) tagging. The outcome is a flexible tagger (LegoTag) with state-of-the-art performance (3.6% error on a benchmark corpus). We explore the effect of eliminating redundancy and radically reducing the size of feature vocabularies. We find that a small but linguistically motivated set of suffixes results in improved cross-corpora generalization. We also show that a minimal lexicon limited to function words is sufficient to ensure reasonable performance.

* 10 pages text; 1 figure. To appear in "Current Issues in Linguistic Theory: Recent Advances in Natural Language Processing";John Benjamins Publishers, Amsterdam

Via

Access Paper or Ask Questions

Bayesian Information Extraction Network

Jun 10, 2003

Leonid Peshkin, Avi Pfeffer

Figure 1 for Bayesian Information Extraction Network

Figure 2 for Bayesian Information Extraction Network

Figure 3 for Bayesian Information Extraction Network

Figure 4 for Bayesian Information Extraction Network

Abstract:Dynamic Bayesian networks (DBNs) offer an elegant way to integrate various aspects of language in one model. Many existing algorithms developed for learning and inference in DBNs are applicable to probabilistic language modeling. To demonstrate the potential of DBNs for natural language processing, we employ a DBN in an information extraction task. We show how to assemble wealth of emerging linguistic instruments for shallow parsing, syntactic and semantic tagging, morphological decomposition, named entity recognition etc. in order to incrementally build a robust information extraction system. Our method outperforms previously published results on an established benchmark domain.

* Intl. Joint Conference on Artificial Intelligence, 2003
* 6 pages

Via

Access Paper or Ask Questions

Learning from Scarce Experience

Apr 20, 2002

Leonid Peshkin, Christian R. Shelton

Figure 1 for Learning from Scarce Experience

Figure 2 for Learning from Scarce Experience

Figure 3 for Learning from Scarce Experience

Figure 4 for Learning from Scarce Experience

Abstract:Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each change of the target policy, its value is estimated from the results of following that very policy. This requires a large number of interactions with the environment as different polices are considered. We present a family of algorithms based on likelihood ratio estimation that use data gathered when executing one policy (or collection of policies) to estimate the value of a different policy. The algorithms combine estimation and optimization stages. The former utilizes experience to build a non-parametric representation of an optimized function. The latter performs optimization on this estimate. We show positive empirical results and provide the sample complexity bound.

* 8 pages 4 figures

Via

Access Paper or Ask Questions