Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Panagiotis Stamatopoulos

Summarizing Reports on Evolving Events; Part I: Linear Evolution

Aug 22, 2005

Stergos D. Afantenos, Vangelis Karkaletsis, Panagiotis Stamatopoulos

Figure 1 for Summarizing Reports on Evolving Events; Part I: Linear Evolution

Figure 2 for Summarizing Reports on Evolving Events; Part I: Linear Evolution

Figure 3 for Summarizing Reports on Evolving Events; Part I: Linear Evolution

Figure 4 for Summarizing Reports on Evolving Events; Part I: Linear Evolution

Abstract:We present an approach for summarization from multiple documents which report on events that evolve through time, taking into account the different document sources. We distinguish the evolution of an event into linear and non-linear. According to our approach, each document is represented by a collection of messages which are then used in order to instantiate the cross-document relations that determine the summary content. The paper presents the summarization system that implements this approach through a case study on linear evolution.

* Edited by Galia Angelova, Kalina Bontcheva, Ruslan Mitkov, Nicolas Nicolov, and Nikolai Nikolov, Recent Advances in Natural Language Processing (RANLP 2005). Borovets, Bulgaria: INCOMA, 18-24.
* 7 pages. Published on the Conference Recent Advances in Natural Language Processing (RANLP, 2005)

Via

Access Paper or Ask Questions

Summarization from Medical Documents: A Survey

Apr 13, 2005

Stergos D. Afantenos, Vangelis Karkaletsis, Panagiotis Stamatopoulos

Figure 1 for Summarization from Medical Documents: A Survey

Abstract:Objective: The aim of this paper is to survey the recent work in medical documents summarization. Background: During the last decade, documents summarization got increasing attention by the AI research community. More recently it also attracted the interest of the medical research community as well, due to the enormous growth of information that is available to the physicians and researchers in medicine, through the large and growing number of published journals, conference proceedings, medical sites and portals on the World Wide Web, electronic medical records, etc. Methodology: This survey gives first a general background on documents summarization, presenting the factors that summarization depends upon, discussing evaluation issues and describing briefly the various types of summarization techniques. It then examines the characteristics of the medical domain through the different types of medical documents. Finally, it presents and discusses the summarization techniques used so far in the medical domain, referring to the corresponding systems and their characteristics. Discussion and conclusions: The paper discusses thoroughly the promising paths for future research in medical documents summarization. It mainly focuses on the issue of scaling to large collections of documents in various languages and from different media, on personalization issues, on portability to new sub-domains, and on the integration of summarization technology in practical applications

* Artificial Intelligence in Medicine, Volume 33, Issue 2, February 2005, Pages 157-177
* 21 pages, 4 tables

Via

Access Paper or Ask Questions

Learning to Filter Spam E-Mail: A Comparison of a Naive Bayesian and a Memory-Based Approach

Sep 18, 2000

Ion Androutsopoulos, Georgios Paliouras, Vangelis Karkaletsis, Georgios Sakkis, Constantine D. Spyropoulos, Panagiotis Stamatopoulos

Figure 1 for Learning to Filter Spam E-Mail: A Comparison of a Naive Bayesian and a Memory-Based Approach

Figure 2 for Learning to Filter Spam E-Mail: A Comparison of a Naive Bayesian and a Memory-Based Approach

Figure 3 for Learning to Filter Spam E-Mail: A Comparison of a Naive Bayesian and a Memory-Based Approach

Figure 4 for Learning to Filter Spam E-Mail: A Comparison of a Naive Bayesian and a Memory-Based Approach

Abstract:We investigate the performance of two machine learning algorithms in the context of anti-spam filtering. The increasing volume of unsolicited bulk e-mail (spam) has generated a need for reliable anti-spam filters. Filters of this type have so far been based mostly on keyword patterns that are constructed by hand and perform poorly. The Naive Bayesian classifier has recently been suggested as an effective method to construct automatically anti-spam filters with superior performance. We investigate thoroughly the performance of the Naive Bayesian filter on a publicly available corpus, contributing towards standard benchmarks. At the same time, we compare the performance of the Naive Bayesian filter to an alternative memory-based learning approach, after introducing suitable cost-sensitive evaluation measures. Both methods achieve very accurate spam filtering, outperforming clearly the keyword-based filter of a widely used e-mail reader.

* Proceedings of the workshop "Machine Learning and Textual Information Access", 4th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD-2000), H. Zaragoza, P. Gallinari and M. Rajman (Eds.), Lyon, France, September 2000, pp. 1-13

Via

Access Paper or Ask Questions