Introduction to Google PageRank: Myths & Facts
Clearing up the confusion about what Google's PageRank algorithm is, how it works, how important it is, and how we can use data based on it.
Clearing up the confusion about what Google's PageRank algorithm is, how it works, how important it is, and how we can use data based on it.
Google’s PageRank algorithm has been around since, well, as long as Google itself. Yet there still seems to be a lot of confusion about what it is, how it works, how important it is and how we can use data based on it.
While all of those things have changed to a degree over the years, as Google as refined and developed the way it does things, the basic facts have remained constant.
Before we begin diving into PageRank, however, we need to make an important distinction between two things:
We’ll look at these two things separately.
PageRank: What is it, Exactly?
PageRank is what made Google what it is today: a heuristic analysis of the web’s link graph, or, in simpler terms, a concrete, mathematical way to ascribe importance to a web page based on the links that point to it from other web pages.
This is what initially made Google’s results so successful compared to the nine other search engines that had come before it. Although they weren’t the first to look at links, they were the first to do it in such a meaningful way.
I won’t go into great detail on exactly how it works, but if you’re interested, Matt Cutts has a decent bit of information about it as the intro to his post about why PageRank sculpting no longer works.
The algorithm itself is described in Brin and Page’s seminal paper, “Anatomy of a Large-Scale Hypertextual Search Engine.” In other words, it’s public knowledge.
However, it is also clear that PageRank has evolved a lot over the last 10 years. The basic structure is probably pretty much the same as it has ever been, but it has become much more sophisticated in many respects (for example, introducing an additional factor for detecting and valuing links based on where they are on a page, as well as on the strength of the page).
For this reason, PageRank as presented in the original paper is usually referred to as pure or classic PageRank, as opposed to its modern counterpart.
It’s also clear that Google now uses many other factors to produce the search results that you see every day. Some of them, as with PageRank, will be based solely on links. Others will be based on all sorts of other factors.
Google doesn’t reveal how important PageRank is as a part of this total mix, but while its importance has clearly diminished over time, it is still an important part of the total ranking algorithm.
But now that we’ve discussed the PageRank algorithm itself, what about the number that is derived from it?
Google is also suitably vague about this (and, remember, they don’t reveal it either: they only reveal the Toolbar PageRank), but, in essence, it is a logarithmic scale (like the Richter scale for measuring earthquakes) with a base of about 16.
So, that means that a PageRank of 2 is 16 times bigger than a PageRank of 1; a PageRank of 3 is 16 times better than a PageRank of 2, and so on. Just to give you an idea of some scales here:
So, hidden within that deceptively simple scale from zero to 10 is actually a massive range of importance. And this brings us nicely onto Toolbar PageRank.
Is Toolbar PageRank Just the Number From the PageRank Algorithm?
No. Google would like you to believe that this is the case, but most SEO professionals believe that there are other ranking factors from different algorithms mixed in with this number.
However, the main problem isn’t whether it is or isn’t just PageRank, but the fact that there is precious little information about whether it is or isn’t. There are also a number of other problems with it:
It should be clear by now that although the PageRank algorithm itself is important, Toolbar PageRank is next to useless as an SEO metric. I would recommend not using it at all as a measure of an SEO campaign’s performance (what you should be measuring we can discuss another time!).
It does have one worthwhile use, however, which is looking for manual penalties (although even in this context it should be used with caution). If you are monitoring the rank of a page over time and it suddenly drops, that is a good indication that a penalty has been applied.
Likewise, if you’re comparing a page’s PageRank with its equivalent mozRank (an indepedent figure that guesses at what a page’s PageRank really is) and the PageRank is markedly lower than the mozRank, that may also indicate a manual penalty. This can be useful when selecting link building partners.
It can also have some use for analyzing how Google perceives the importance of pages within a single site by comparing PageRanks across the domain and for looking for missed opportunities (i.e., if you have a page with a high PageRank but no traffic, it probably means that you need to work on incoming anchor text and on-page optimization for your chosen keywords). However, I wouldn’t recommend using it to compare page across different domains.
Busting Some Myths
And lastly, lets finish off by dispelling some common myths and misconceptions around PageRank:
Summary
Having at least a decent working knowledge of PageRank is vital to the working SEO, both in terms of helping with link building and understanding the distribution of PageRank scores around the web. However, as an SEO metric, outside of some very narrow circumstances, Toolbar PageRank is almost completely worthless and should be approached with extreme caution.