This is a non-IMPACT record, meaning that access to the data is not controlled by IMPACT. For access, see the directions below.

Disclaimer:
This Resource is offered and provided outside of the IMPACT mediation framework. IMPACT and the IMPACT Coordination Council/Blackfire Technology, Inc. expressly disclaim all conditions, representations and warranties including but not limited to Resource availability, quality, accuracy, non-infringement, and non-interference. All Resource information and access is controlled by entities and under terms that are external to the IMPACT legal framework.

Summary

DS-1097
Popular Website Crawl
External Dataset
External Data Source
Internet-Wide Scan Data Repository
Unknown
Unknown
55 (lowest rank is 55)

Category & Restrictions

Other
application layer security
Unrestricted
Unknown

Description


HAR files resulting from automatically visiting 35,000 popular Web sites with Google Chrome.

This dataset is a set of HAR files resulting from the crawl of of 35,000 popular Web sites. The list of Web sites was provided by SimilarWeb (similar to Alexa rank). Each of the 35,000 Web sites has been visited 5 times using Google Chrome, and, for each visit, we built the corresponding HAR file (see spec. at http://www.softwareishard.com/blog/har-12-spec/), containing details of all the HTTP transactions performed to render the page. The dataset is divided in European and Extra-European archives. The file 'eu.zip' includes HARs of Web sites popular in Europe; the file 'extra_eu.zip' includes HARs of Web sites popular in U.S.A., Brazil, Russia and Australia. Interesting information can be derived analyzing 'Cookie' and 'Set-Cookie' headers. Crawling was performed during spring 2017. ; martino.trevisan@politico.it

Additional Details

N/A
false
false
alexa internet, interactive media, communication protocol, cryptographic protocol, world wide web, application layer protocols, blog, macos web browsers, google software, transport layer security, external data source, google chrome, windows web browsers, internet protocol, uniform resource identifier, website, hypertext transfer protocol, inferlink corporation, 1097, secure communication, history of computing, digital media, cross platform web browsers, similarweb, popular website crawl, politico