loading...
A Novel Method for Detecting Similar Documents
Big Island, Hawaii January 07-January 10
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/HICSS.2002.99403735th Annual Hawaii International Conf ...
 This Article 
 
PURCHASE ARTICLE: $0
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
DDUAC03We describe a system for rapidly determining document similarity among a set of documents obtained from an information retrieval (IR) system. We obtain a ranked list of the most important terms in each document using a rapid phrase recognizer system. We store these in a database and compute document similarity using a simple database query. If the number of terms found to not be contained in both documents is less than some predetermined threshold compared to the total number of terms in the document, these documents are determined to be very similar.
Citation:
J. Cooper, A. Coden, E. Brown, "A Novel Method for Detecting Similar Documents," hicss, vol. 4, pp.101b, 35th Annual Hawaii International Conference on System Sciences (HICSS'02)-Volume 4, 2002
Usage of this product signifies your acceptance of the Terms of Use.