How Bad is Similarweb Data?

A genuine question

Jack Bandy
2 min readJul 8, 2022
Photo by Etienne Girardet on Unsplash

I recently tried out similarweb as a potential data source for my dissertation. My experience was troubling.

The title of this post is a genuine question: just how bad is similarweb data? Who trusts data from Similarweb, and why?

As a brief example to illustrate my concern, consider the “top 20” list. With any kind of representative sample of data, it would be fairly simple to generate an accurate list of the most popular websites in the United States.

According to Similarweb, the most popular websites in the United States in March 2022 were:

  1. google.com
  2. youtube.com
  3. facebook.com
  4. yahoo.com
  5. amazon.com

According to Comscore, which uses web browing data from a random panel of internet users in the U.S., the most popular webistes in March 2022 were:

  1. youtube.com
  2. google.com
  3. facebook.com
  4. amazon.com
  5. yahoo.com

So far, not so bad: Similarweb and Comscore report the same set of five websites, just in a different order.

--

--

Jack Bandy

PhD student studying AI, ethics, and media. Trying to share things I learn in plain english. 🐦 @jackbandy