To request access this dataset you will need to login with an IMPACT account. Accounts are free. If you don't have one please register.

Summary

DS-1382
content-reuse-detection
Tool
University of Southern California-Information Sciences Institute
University of Southern California-Information Sciences Institute
10/17/2018
Unknown
56 (lowest rank is 56)

Category & Restrictions

Other
intrusion detection, penetration testing, dns data
Unrestricted
true

Description


source code for content reuse detection paper

This repository contains the code and pointers to datasets used in the paper "Precise Detection of Content Reuse in the Web" by Calvin Ardi and John Heidemann.

Additional Details

N/A
true
false
detection, california, sciences, southern, institute, reuse, content, 1382, content-reuse-detection, archive, duplicate, hashing, duplicate-detection, university of southern california-information sciences institute, 2018, anonymized, code, paper, source, calvin, pointers, ardi, precise, heidemann, web, repository, datasets, other, john
hashing,archive,duplicate-detection