Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haiming Chen

Grammar construction methods for extended deterministic expressions

Jan 24, 2023

Xiaoying Mou, Haiming Chen

Abstract:Extended regular expressions with counting and interleaving are widely used in practice. However the related theoretical studies for this kind of expressions currently cannot meet the need of practical work. This paper develops syntax definitions for extended deterministic expressions and their subclasses, hope to completely solve the long-standing problem that there are no syntax definitions for this kind of expressions, which has become an important reason for restricting the use of extended expressions.

* in Chinese language

Via

Access Paper or Ask Questions

An Effective Algorithm for Learning Single Occurrence Regular Expressions with Interleaving

Jun 05, 2019

Yeting Li, Haiming Chen, Xiaolan Zhang, Lingqi Zhang

Figure 1 for An Effective Algorithm for Learning Single Occurrence Regular Expressions with Interleaving

Figure 2 for An Effective Algorithm for Learning Single Occurrence Regular Expressions with Interleaving

Figure 3 for An Effective Algorithm for Learning Single Occurrence Regular Expressions with Interleaving

Figure 4 for An Effective Algorithm for Learning Single Occurrence Regular Expressions with Interleaving

Abstract:The advantages offered by the presence of a schema are numerous. However, many XML documents in practice are not accompanied by a (valid) schema, making schema inference an attractive research problem. The fundamental task in XML schema learning is inferring restricted subclasses of regular expressions. Most previous work either lacks support for interleaving or only has limited support for interleaving. In this paper, we first propose a new subclass Single Occurrence Regular Expressions with Interleaving (SOIRE), which has unrestricted support for interleaving. Then, based on single occurrence automaton and maximum independent set, we propose an algorithm iSOIRE to infer SOIREs. Finally, we further conduct a series of experiments on real datasets to evaluate the effectiveness of our work, comparing with both ongoing learning algorithms in academia and industrial tools in real-world. The results reveal the practicability of SOIRE and the effectiveness of iSOIRE, showing the high preciseness and conciseness of our work.

Via

Access Paper or Ask Questions

Learning Restricted Regular Expressions with Interleaving

Apr 30, 2019

Chunmei Dong, Yeting Li, Haiming Chen

Figure 1 for Learning Restricted Regular Expressions with Interleaving

Figure 2 for Learning Restricted Regular Expressions with Interleaving

Figure 3 for Learning Restricted Regular Expressions with Interleaving

Figure 4 for Learning Restricted Regular Expressions with Interleaving

Abstract:The advantages for the presence of an XML schema for XML documents are numerous. However, many XML documents in practice are not accompanied by a schema or by a valid schema. Relax NG is a popular and powerful schema language, which supports the unconstrained interleaving operator. Focusing on the inference of Relax NG, we propose a new subclass of regular expressions with interleaving and design a polynomial inference algorithm. Then we conducted a series of experiments based on large-scale real data and on three XML data corpora, and experimental results show that our subclass has a better practicality than previous ones, and the regular expressions inferred by our algorithm are more precise.

Via

Access Paper or Ask Questions