To request access this dataset you will need to login with an IMPACT account. Accounts are free. If you don't have one please register.

Summary

DS-1382
content-reuse-detection
Tool
University of Southern California-Information Sciences Institute
University of Southern California-Information Sciences Institute
10/17/2018
Unknown
49 (lowest rank is 49)

Category & Restrictions

Other
Unrestricted
true

Description


source code for content reuse detection paper

This repository contains the code and pointers to datasets used in the paper "Precise Detection of Content Reuse in the Web" by Calvin Ardi and John Heidemann.

Additional Details

N/A
true
false
university of southern california-information sciences institute, content-reuse-detection, hashing, source code, archive, 1382, text, duplicate detection
hashing,archive,duplicate-detection