Are Search Engines Biased towards Large Established Sites?
http://spectrum.ieee.org/print/2787
excerpt
In the midst of this debate, one kind of bias that has received much attention among technologists, as well as social and political scientists, is that in favor of “popular” sites. This stems from the PageRank algorithm, introduced by the Google founders in 1998. All major search engines today use similar techniques to identify important or prestigious pages and bubble them to the top of the results. To a first approximation, PageRank attributes importance in proportion to the number of links that a page receives from other sites. The algorithm is a bit more sophisticated than that, but this approximation turns out to be pretty good on average (cf., http://arXiv.org/cs.IR/0511016).
The notion of prestige based on link popularity is a proxy for other possible importance measures, such as traffic, expert judgment, and so on. Most people would agree that the use of prestige measures in ranking search results is a very good thing—indeed, it’s the main reason why search engines work so well and have become so popular. Moreover, PageRank is designed to mimic the browsing behavior of Web users. In the absence of better assumptions, we imagine that people follow links at random. PageRank then estimates the traffic through each site. It seems, therefore, to be just the right criterion to rank sites. Why worry then?