Overview of fingerprinting methods for local text reuse detection

Leena Lulu, Boumediene Belkhouche, Saad Harous

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

We overview several local text reuse detection methods based on fingerprinting techniques. We first define the context of local text reuse and situate it within the general spectrum of information retrieval in order to pinpoint its particular applicability and challenges. After a brief description of the major text reuse detection approaches, we introduce the general principles of fingerprinting algorithms from an information retrieval perspective. Three classes of fingerprinting methods (overlap, non-overlap, and randomized) are surveyed. Specific algorithms, such as k-gram, winnowing, hailstorm, DCT and hash-breaking, are described. The performance and characteristics of these algorithms are summarized based on data from the literature.

Original languageEnglish
Title of host publicationProceedings of the 2016 12th International Conference on Innovations in Information Technology, IIT 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781509053438
DOIs
Publication statusPublished - Mar 16 2017
Event12th International Conference on Innovations in Information Technology, IIT 2016 - Al Ain, United Arab Emirates
Duration: Nov 28 2016Nov 29 2016

Publication series

NameProceedings of the 2016 12th International Conference on Innovations in Information Technology, IIT 2016

Other

Other12th International Conference on Innovations in Information Technology, IIT 2016
Country/TerritoryUnited Arab Emirates
CityAl Ain
Period11/28/1611/29/16

Keywords

  • Fingerprinting
  • Information Retrieval
  • Plagiarism Detection
  • Text Reuse

ASJC Scopus subject areas

  • Computer Science Applications
  • Hardware and Architecture
  • Information Systems
  • Computer Networks and Communications
  • Instrumentation

Fingerprint

Dive into the research topics of 'Overview of fingerprinting methods for local text reuse detection'. Together they form a unique fingerprint.

Cite this