URI | http://purl.tuc.gr/dl/dias/BA5E9D2B-D9DD-46CC-ABA2-A38BC3AFF5B9 | - |
Αναγνωριστικό | https://doi.org/10.1145/27641.28057 | - |
Γλώσσα | en | - |
Μέγεθος | 20 pages | en |
Τίτλος | Description and performance analysis of signature file methods
| en |
Δημιουργός | Christodoulakis Stavros | en |
Δημιουργός | Χριστοδουλακης Σταυρος | el |
Δημιουργός | Christos Faloutsos | en |
Περίληψη | Signature files have attracted a lot of interest as an access method for text and specifically for messages in the office environment. Messages are stored sequentially in the message file, whereas their hash-coded abstractions (signatures) are stored sequentially in the signature file. To answer a query, the signature file is examined first, and many nonqualifying messages are immediately rejected. In this paper we examine the problem of designing signature extraction methods and studying their performance. We describe two old methods, generalize another one, and propose a new method and its variation. We provide exact and approximate formulas for the dependency between the false drop probability and the signature size for all the methods, and we show that the proposed method (VBC) achieves approximately ten times smaller false drop probability than the old methods, whereas it is well suited for collections of documents with variable document sizes. | en |
Τύπος | Peer-Reviewed Journal Publication | en |
Τύπος | Δημοσίευση σε Περιοδικό με Κριτές | el |
Άδεια Χρήσης | http://creativecommons.org/licenses/by/4.0/ | en |
Ημερομηνία | 2015-10-05 | - |
Ημερομηνία Δημοσίευσης | 1987 | - |
Βιβλιογραφική Αναφορά | S. Christodoulakis., C. Faloutsos ,"Description and performance analysis of signature file methods ",ACM Trans. on Inf. Syst.,vol.3,no.3,pp.237-257 ,1987.doi :10.1145/27641.28057 | en |