URI | http://purl.tuc.gr/dl/dias/B103115A-82D8-4F23-BF34-145A4765DDA5 | - |
Identifier | http://www.vldb.org/conf/1999/P22.pdf | - |
Language | en | - |
Extent | 12 pages | en |
Title | SPIRIT: Sequential pattern mining with regular expression constraints | en |
Creator | Garofalakis Minos | en |
Creator | Γαροφαλακης Μινως | el |
Creator | Rastogi Rajeev | en |
Creator | Shim Kyuseok | en |
Content Summary | Discovering sequential patterns is an important problem in
data mining with a host of application domains including
medicine, telecommunications, and the World Wide Web.
Conventional mining systems provide users with only a
very restricted mechanism (based on minimum support)
for specifying patterns of interest. In this paper, we propose
the use of Regular Expressions (REs) as a flexible
constraint specification tool that enables user-controlled
focus to be incorporated into the pattern mining process.
We develop a family of novel algorithms (termed SPIRIT
– Sequential Pattern mIning with Regular expressIon consTraints)
for mining frequent sequential patterns that also
satisfy user-specified RE constraints. The main distinguishing
factor among the proposed schemes is the degree
to which the RE constraints are enforced to prune the
search space of patterns during computation. Our solutions
provide valuable insights into the tradeoffs that arise
when constraints that do not subscribe to nice properties
(like anti-monotonicity) are integrated into the mining process.
A quantitative exploration of these tradeoffs is conducted
through an extensive experimental study on synthetic
and real-life data sets.
| en |
Type of Item | Δημοσίευση σε Συνέδριο | el |
Type of Item | Conference Publication | en |
License | http://creativecommons.org/licenses/by/4.0/ | en |
Date of Item | 2015-12-01 | - |
Date of Publication | 1999 | - |
Subject | Databases | en |
Subject | Data mining | en |
Bibliographic Citation | M.N. Garofalakis, R. Rastogi and K. Shim, "SPIRIT: Sequential pattern mining with
regular expression constraints", in 25th VLDB Conference, September 1999, pp. 223-234. | en |