Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

René Quiniou

LACODAM

NegPSpan: efficient extraction of negative sequential patterns with embedding constraints

Jul 25, 2018

Thomas Guyet, René Quiniou

Figure 1 for NegPSpan: efficient extraction of negative sequential patterns with embedding constraints

Figure 2 for NegPSpan: efficient extraction of negative sequential patterns with embedding constraints

Figure 3 for NegPSpan: efficient extraction of negative sequential patterns with embedding constraints

Figure 4 for NegPSpan: efficient extraction of negative sequential patterns with embedding constraints

Abstract:Mining frequent sequential patterns consists in extracting recurrent behaviors, modeled as patterns, in a big sequence dataset. Such patterns inform about which events are frequently observed in sequences, i.e. what does really happen. Sometimes, knowing that some specific event does not happen is more informative than extracting a lot of observed events. Negative sequential patterns (NSP) formulate recurrent behaviors by patterns containing both observed events and absent events. Few approaches have been proposed to mine such NSPs. In addition, the syntax and semantics of NSPs differ in the different methods which makes it difficult to compare them. This article provides a unified framework for the formulation of the syntax and the semantics of NSPs. Then, we introduce a new algorithm, NegPSpan, that extracts NSPs using a PrefixSpan depth-first scheme and enabling maxgap constraints that other approaches do not take into account. The formal framework allows for highlighting the differences between the proposed approach wrt to the methods from the literature, especially wrt the state of the art approach eNSP. Intensive experiments on synthetic and real datasets show that NegPSpan can extract meaningful NSPs and that it can process bigger datasets than eNSP thanks to significantly lower memory requirements and better computation times.

Via

Access Paper or Ask Questions

Efficiency Analysis of ASP Encodings for Sequential Pattern Mining Tasks

Nov 14, 2017

Thomas Guyet, Yves Moinard, René Quiniou, Torsten Schaub

Figure 1 for Efficiency Analysis of ASP Encodings for Sequential Pattern Mining Tasks

Figure 2 for Efficiency Analysis of ASP Encodings for Sequential Pattern Mining Tasks

Figure 3 for Efficiency Analysis of ASP Encodings for Sequential Pattern Mining Tasks

Figure 4 for Efficiency Analysis of ASP Encodings for Sequential Pattern Mining Tasks

Abstract:This article presents the use of Answer Set Programming (ASP) to mine sequential patterns. ASP is a high-level declarative logic programming paradigm for high level encoding combinatorial and optimization problem solving as well as knowledge representation and reasoning. Thus, ASP is a good candidate for implementing pattern mining with background knowledge, which has been a data mining issue for a long time. We propose encodings of the classical sequential pattern mining tasks within two representations of embeddings (fill-gaps vs skip-gaps) and for various kinds of patterns: frequent, constrained and condensed. We compare the computational performance of these encodings with each other to get a good insight into the efficiency of ASP encodings. The results show that the fill-gaps strategy is better on real problems due to lower memory consumption. Finally, compared to a constraint programming approach (CPSM), another declarative programming paradigm, our proposal showed comparable performance.

* Bruno Pinaud; Fabrice Guillet; Bruno Cremilleux; Cyril de Runz. Advances in Knowledge Discovery and Management, 7, Springer, pp.41--81, 2017, 978-3-319-65405-8

Via

Access Paper or Ask Questions

Mining relevant interval rules

Sep 11, 2017

Thomas Guyet, René Quiniou, Véronique Masson

Figure 1 for Mining relevant interval rules

Figure 2 for Mining relevant interval rules

Abstract:This article extends the method of Garriga et al. for mining relevant rules to numerical attributes by extracting interval-based pattern rules. We propose an algorithm that extracts such rules from numerical datasets using the interval-pattern approach from Kaytoue et al. This algorithm has been implemented and evaluated on real datasets.

* International Conference on Formal Concept Analysis, Jun 2017, Rennes, France. Supplementary proceedings of International Conference on Formal Concept Analysis (ICFCA)

Via

Access Paper or Ask Questions

Using Answer Set Programming for pattern mining

Sep 27, 2014

Thomas Guyet, Yves Moinard, René Quiniou

Figure 1 for Using Answer Set Programming for pattern mining

Figure 2 for Using Answer Set Programming for pattern mining

Abstract:Serial pattern mining consists in extracting the frequent sequential patterns from a unique sequence of itemsets. This paper explores the ability of a declarative language, such as Answer Set Programming (ASP), to solve this issue efficiently. We propose several ASP implementations of the frequent sequential pattern mining task: a non-incremental and an incremental resolution. The results show that the incremental resolution is more efficient than the non-incremental one, but both ASP programs are less efficient than dedicated algorithms. Nonetheless, this approach can be seen as a first step toward a generic framework for sequential pattern mining with constraints.

* Intelligence Artificielle Fondamentale (2014)

Via

Access Paper or Ask Questions

Self-adaptive web intrusion detection system

Jul 22, 2009

Thomas Guyet, René Quiniou, Wei Wang, Marie-Odile Cordier

Abstract:The evolution of the web server contents and the emergence of new kinds of intrusions make necessary the adaptation of the intrusion detection systems (IDS). Nowadays, the adaptation of the IDS requires manual -- tedious and unreactive -- actions from system administrators. In this paper, we present a self-adaptive intrusion detection system which relies on a set of local model-based diagnosers. The redundancy of diagnoses is exploited, online, by a meta-diagnoser to check the consistency of computed partial diagnoses, and to trigger the adaptation of defective diagnoser models (or signatures) in case of inconsistency. This system is applied to the intrusion detection from a stream of HTTP requests. Our results show that our system 1) detects intrusion occurrences sensitively and precisely, 2) accurately self-adapts diagnoser model, thus improving its detection accuracy.

Via

Access Paper or Ask Questions

Learning rules from multisource data for cardiac monitoring

Feb 19, 2009

Marie-Odile Cordier, Elisa Fromont, René Quiniou

Figure 1 for Learning rules from multisource data for cardiac monitoring

Figure 2 for Learning rules from multisource data for cardiac monitoring

Figure 3 for Learning rules from multisource data for cardiac monitoring

Figure 4 for Learning rules from multisource data for cardiac monitoring

Abstract:This paper formalises the concept of learning symbolic rules from multisource data in a cardiac monitoring context. Our sources, electrocardiograms and arterial blood pressure measures, describe cardiac behaviours from different viewpoints. To learn interpretable rules, we use an Inductive Logic Programming (ILP) method. We develop an original strategy to cope with the dimensionality issues caused by using this ILP technique on a rich multisource language. The results show that our method greatly improves the feasibility and the efficiency of the process while staying accurate. They also confirm the benefits of using multiple sources to improve the diagnosis of cardiac arrhythmias.

Via

Access Paper or Ask Questions