| Using syntactic analysis in a document retrieval system that uses signature files |
| Full text |
Pdf
(1.15 MB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Brussels, Belgium
Pages: 179 - 192
Year of Publication: 1989
ISBN:0-89791-408-2
|
|
Authors
|
|
R. Sacks-Davis
|
Department of Computer Science, Royal Melbourne Institute of Technology, GPO Box 2476V, Melbourne, VIC 3001, Australia
|
|
P. Wallis
|
Department of Computer Science, Royal Melbourne Institute of Technology, GPO Box 2476V, Melbourne, VIC 3001, Australia
|
|
R. Wilkinson
|
Department of Computer Science, Royal Melbourne Institute of Technology, GPO Box 2476V, Melbourne, VIC 3001, Australia
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 3, Downloads (12 Months): 20, Citation Count: 2
|
|
|
ABSTRACT
Our work involves the study of the extent to which natural language processing techniques aid the automatic indexing and retrieval of documents. In this paper we describe the use of signature files in large text retrieval systems. We show that good performance can be obtained without requiring the significant overheads required for the inverted file technique. We examine the use of syntactic analysis of the text in all stages of retrieval and argue that an initial Boolean query should be performed that provides a subset of documents, which are then ranked. We then give an algorithm for generating such queries, taking into account the syntactic structure of the queries.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
Croft 88
|
|
| |
Dillon 83
|
M- Dillon, A. Gray Fully Automatic Syntaz-based Indezing J. of the American Society for Information Science, Vol. 34, No. 2. March 1983, pp. 99-108
|
| |
Fagan 87
|
|
 |
Faloutsos 85
|
|
| |
Kent 88
|
|
| |
Kent 89
|
A.J. Kent, Ft. Sacks-Davis, K. Ramamohanarao A Signature File Scheme Based on Multiple Organisations for Indezing Very Large Databases To appear: Journal of American Society for Information Science.
|
| |
Metzer 89
|
D.P. Metzler, S. W. Haas, C. L. C#ic, L. H. Wheeler Constituent Object Parsing for Information Retrieval and Similar Tezt Processing Problems Journal of the American Society for Information Science, Vol. 40, No. 6, 1989, pp. 398-423
|
| |
Palmer 85
|
P. Palmer, C. Berrut Definifion of a surface syntactical parser for naferal language. In Proceedings of ACSI, Montreal, 1985
|
| |
Roberts 79
|
.C.S. Roberts Partial Match Retrieval via the Method of Superimposed Codes Proceedings of the IEEE, Vol. 67, No. 12, 1979, pp. 1624-1642
|
 |
Sacks-Davis 87
|
|
| |
Salton 85
|
|
| |
Salton 89
|
|
| |
Smeaton 88
|
A.F. Smeaton Using Parsing of Natural Language Documents as part of Document Retrieval. Ph. D. Thesis, National University of Ireland, 1988
|
Peer to Peer - Readers of this Article have also read:
-
M4: a metamodel for data preprocessing
Proceedings of the 4th ACM international workshop on Data warehousing and OLAP
Anca Vaduva
, Jörg-Uwe Kietz
, Regina Zücker
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
|