
We already talked about the effect of Digg's January algo change. The summary of that analysis is that the chances of a story getting promoted after getting 25 diggs was cut by 38% after the algo change.
Today we visit the algo change again and, let me cut the story short, the algo change had exactly the opposite of its intended effect. Let's dive in.
Let's visit Digg's original post again. This quote summarizes the reasoning:
One of the keys to getting a story promoted is diversity in Digging activity.
Basically, Digg was worried that a small group of people were getting their stories promoted too many times. That is, Digg's promotion was not as democratic as it should be. The diversity algo changes were aimed at ensuring that a diverse group of users promotes a story instead of a power user and his/her friends.
The data is from Social Alerter's database as collected via Digg's API. The data is all popular stories, divided into two groups:
The gap is meant to flank the algo change to make sure the data is collected before and after the algo's roll out.
The data looked at who submitted the popular stories, and how many popular stories each user submitted. The Before data contained 8673 submissions from 4294 users. The After data contained 5590 submissions from 1810 users.
You can download the full data set. Note that the usernames are encoded so as not to "out" anyone. That's not the point of this analysis.
As you can see, the top user in Before and After was the same person, called B1 in the data set. In the Before set, B1 submitted 214 stories, i.e. ~2.5% of all stories. In the After set, B1 submitted ~5% of all stories. This was very surprising and actually holds true throughout the two data sets.
Let's take another metric: the top 10 users. As you can see, seven of the ten top users in the Before set remained in the top 10 in the After set. So much for introducing new diversity.
But here is the bombshell: after the algo change, the top users submitted as many popular stories as they used to. In the Before set, the top ten users submitted ~12.5% of the popular stories. In the After set, the top ten users submitted ~20% of the stories.
Actually, it gets worse: the top 3 users in the Before set submitted ~6% of the popular stories, but submitted ~10.5% of the popular stories in the After set. Also, the first new user that made it into the After rankings, clocked in at rank 21; the first 20 users were already fairly-to-very active Digg users. Pray, do tell me, isn't this the opposite of what Digg wanted to achieve?
Here is another metric: if we compare the first 20 users in the Before and After list, i.e. the league table up to (but excluding) the first new user on the After list, we see another disturbing pattern. For each rank position, the average increase in popular story share is 166%. The first three ranks really broke the records: the first doubled their share of popular stories (i.e. 203% of Before), the second clocked in at 176% of Before, and the third clocked in at 170% of Before.
As you can see, there are many different metrics that point in the same direction: the algo change actually concentrated power into the hands of the top users. This is exactly the opposite of the intention of the algo change.
Coupled with the previous results about the algo change, we have a disturbing conclusion. The top users are able to get their stories promoted better but the chances of promotion are cut. This means that the 'common man', us lowly not power Diggers, are the ones who got hit the worst.
Photo above a crop from brooklyn.
Interesting...
Nice post, Pierre. Very interesting little assembly of statistics...so, the powerful became more powerful, and the unknown millions remain unknown.
Sounds very familiar, looking at it in the context of recent US politics!
Even more interesting...
Were you aware that if you're not logged in, you're not even given an option to claim an identity? You're forced into commenting anonymously. That might stand a brief review...
Joe Dolson, Accessible Web Design
Yes and I need to figure out
Yes and I need to figure out how to fix it. Drupal can be oh so annoying sometimes.
I guess Digg needs to go a
I guess Digg needs to go a long way before measuring their strategies and going for the community. It sounds like they just thought 'hey, let's do this' and did it, without even analyzing the data that you have so precisely described.
WOW thanks
Great metrics, thanks for gathering this info with something that probably would have gone under the radar for most of us.
Kyle
www.KyleHealey.com
Digg Needs A Real Competitor, Stumble That
If this is true then Digg will end up digging its own grave. Quality will suffer. My trips to Digg for browsing are declining, maybe this is the reason why. You don't need statistics to tell your brain that the content of a site is irrelevant to your life.
Dave Foreman
Thanks for the comments
Thanks for the comments everyone. I'm happy you find this stuff interesting - I thought I was being too much of a sad geek! Of course, subscribe to the RSS feed as more such posts are coming :)
Does Digg need a competitor? Everyone and everything does, yes.
Is the quality of promoted pages lower? Only you can answer that.
Are power users bad and evil? No way. Every community gets leaders or power users.
The point here is to point out that Digg's data tells a story different from the official Digg story. That's bad on many levels, which as your comments suggest, I need to blog about that too soon :)
Wow! Kickass post pic!
Nice job on that, btw. Where do you ever find such quality, appropriate images to go with your posts...? That must take some skill. Do you find that posts with accompanying images like that tend to do better?
A friend of mine found it
A friend of mine found it for me. I'm ever so grateful for his help :)
Pierre
Solid Data - Great post
Really cool that you pulled this data. Looking forward to seeing the comments roll in - maybe this is just the story your "lowly-not-power-Digg-account" needed to get over the hump (if you submit it yourself, of course ;).
Abhilash
Boutique SEO Consulting
Thanks! The data is pulled
Thanks!
The data is pulled in real-time for Social Alerter to work. I just analyze the data once in a while looking to answer interesting questions like this one. These get posted on the blog.
Check out the rest of Social Alerter. You'll see where the data is also used :)
Intriguing Study
This was quite an interesting study. I personally have concluded that the entire algo change and ensuing 'revolt' were little more than an elaborate publicity stunt geared towards encouraging more small-time Diggers to spend more time on the site, thus boosting Alexa and Compete rankings - and hopefully (for Digg) bolster their bargaining position by justifying the rumored asking price in the ongoing negotiations with Google and other prospective suitors.
Peter Egan
Post new comment