To request access this dataset you will need to login with an IMPACT account. Accounts are free. If you don't have one please register.
The cookies in this data set were gathered from crawls of the top 100K Alexa web sites conducted in November, 2013 and April, 2015. Due to page request timeouts, our Crawler successfully visited 95,220 (95,311) web sites. Note, the set of web sites that caused a timeout is likely an artifact of our crawler, however even among the top 100K Alexa web sites, downtime is not uncommon. The data set is described in detail in the paper "An Empirical Study of Web Cookies", by Cahn et al., which appeared in WWW '16.
http cookie, web security exploits, alexa internet, web crawler, world wide web, alexa, www conference, scott alfeld, university of wisconsin, cooky, www, 658, request timeout, wisconsin, international conference on world wide web, paul barford, hypertext transfer protocol, web site, track user, aaron cahn, web cookie data, site, crawler, http, user behavior