nofollow

nofollow is a non-standard HTML attribute value used to instruct search engines that a hyperlink should not influence the link target's ranking in the search engine's index. It is intended to reduce the effectiveness of certain types of search engine spam, thereby improving the quality of search engine results and preventing spamdexing from occurring in the first place.

1 Concept and specification
2 What nofollow is not for
3 Introduction and support
4 Interpretation by the individual search engines
5 Usage by weblog software
6 Usage on other websites
7 Repurpose for paid links
8 Nofollow to control internal PageRank flow
9 Criticism
10 See also
- 10.1 Blocking and excluding content from search engines
11 References

Concept and specification

The concept for the specification of the attribute value nofollow was designed by Google’s head of webspam team Matt Cutts and Jason Shellen from Blogger.com in 2005.^[1]

The specification for nofollow is copyrighted 2005-2007 by the authors and subject to a royalty free patent policy, e.g. per the W3C Patent Policy 20040205,^[2] and IETF RFC 3667 & RFC 3668. The authors intend to submit this specification to a standards body with a liberal copyright/licensing policy such as the GMPG, IETF, and/or W3C.^[1]

What nofollow is not for

The nofollow attribute value is not meant for blocking access to content or preventing content to be indexed by search engines. The proper methods for blocking search engine spiders to access content on a website or for preventing them to include the content of a page in their index are the Robots Exclusion Standard (robots.txt) for blocking access and on page Meta Elements that are designed to specify on an individual page level, what search engine spider should or should not do with the content of the crawled page.

Introduction and support

Google announced in early 2005 that hyperlinks with rel="nofollow" attribute^[3] would not influence the link target's PageRank. In addition, the Yahoo and MSN search engines also respect this tag.^[4]

How the attribute is being interpreted differs between the search engines. While some take it literally and do not follow the link to the page being linked to, others still "follow" the link to find new web pages for indexing. In the latter case rel="nofollow" actually tells a search engine "Don't score this link" rather than "Don't follow this link." This differs from the meaning of nofollow as used within a robots meta tag, which does tell a search engine: "Do not follow any of the hyperlinks in the body of this document.".

Interpretation by the individual search engines

While all engines that support the attribute exclude links that use the attribute from their ranking calculation, the details about the exact interpretation of the attribute vary from search engine to search engine.^[5]^[6]

Google takes "nofollow" literally and does not "follow" the link at all. That is supposedly their official statement, but experiments conducted by SEOs show conflicting results. They show instead that Google does follow the link, but does not index the linked-to page, unless it was in Google's index already for other reasons (such as other, non-nofollow links that point to the page).^[6] Links with "nofollow" are included in the backlinks reporting data at Google's Webmaster Central.^[7]
Yahoo! "follows it", but excludes it from their ranking calculation.
MSN Search respects "nofollow" as regards not counting the link in their ranking, but it is not proven whether or not MSN follows the link.
Ask.com does not use the attribute for anything.

rel="nofollow" Action	Google	Yahoo!	MSN Search	Ask.com
Follows the link	Yes	Yes	Not proven	Yes
Indexes the "linked to" page	No	Yes	No	Yes
Shows the existence of the link	Only for a previously indexed page	Yes	No	Yes
In SERPs for anchor text	Only for a previously indexed page	Yes	No	Yes

Usage by weblog software

Most weblog software marks reader-submitted links this way by default (with no option to disable it without code modification). A more sophisticated server software could spare the nofollow for links submitted by trusted users like those registered for a long time, on a whitelist, or with a high karma. Some server software adds rel="nofollow" to pages that have been recently edited but omits it from stable pages, under the theory that stable pages will have had offending links removed by human editors.

The widely used blogging platform Wordpress version 1.5 and above automatically assigns the nofollow attribute to all user-submitted links (comment data, commenter URI, etc).^[8]

Usage on other websites

MediaWiki software, which powers Wikipedia, was equipped with nofollow support soon after initial announcement in 2005. The option was enabled on most international Wikipedias. One of the prominent exceptions was the English language one. Initially, after a discussion, it was decided not to use rel="nofollow" in articles and to use a URL blacklist instead. In this way, English Wikipedia contributed to the scores of the pages it linked to, and expected editors to link to relevant pages.

In May 2006, a patch to MediaWiki software allowed to enable nofollow selectively in namespaces. This functionality was used on pages that are not considered to be part of the actual encyclopedia, such as discussion pages and resources for editors.^[9] Following increasing spam problems and a within-Foundation order from Jimmy Wales, rel="nofollow" was added to article-space links in January 2007;^[10].^[11] However, the various interwiki templates and shortcuts that link to other Wikimedia Foundation projects and many external wikis such as Wikia are not affected by this policy.

Other websites like Slashdot, with high user participation, use improvised nofollow implementations like adding rel="nofollow" only for potentially misbehaving users. Potential spammers posing as users can be determined through various heuristics like age of registered account and other factors. Slashdot also uses the poster's karma as a determinant in attaching a nofollow tag to user submitted links.

A number of social bookmarking and photo sharing websites also use the rel="nofollow" tag for their outgoing links, including: Digg.com, Furl, Propeller.com (formerly Netscape.com), Yahoo! My Web 2.0, YouTube, Searchles, Listible.com and Technorati Favs.^[12]

Repurpose for paid links

While the effectiveness of the nofollow attribute to prevent comment spam is in doubt and raises other issues instead,^[13] search engines have moved ahead and attempted to repurpose the attribute for something different. Google began suggesting the use of nofollow also as a machine-readable disclosure for paid links, so that these links do not get credit in search engines results.^[14]

The growth of the link buying economy, where companies' entire business models are based on paid links that affect search engine rankings,^[15] caused the debate about the use of nofollow in combination with paid links to move into the center of attention of the search engines, who started to take active steps against link buyers and sellers.^[16] This triggered a very strong response by the web master community.^[17]

Nofollow to control internal PageRank flow

Search engine optimization professionals started using the nofollow attribute to control the flow of PageRank within a website. This is an entirely different use than it was intended originally. Nofollow was designed to control the flow of PageRank from one website to another. However, SEOs realized that a nofollow used for an internal link should just work as well as nofollow used for external links. It makes sense, for example, to use nofollow for internal links to pages that are not relevant for search engines and only for visitors who are already on the website. This includes pages, such as "About Us", "Contact Us", "Terms of Use" or "Privacy Policy". Matt Cutts, one of the initial designers of the nofollow attribute, encouraged this use of the attribute.^[18]

Criticism

Some weblog authors object to the use of rel="nofollow", arguing, for example,^[19]^[20] that

Link spammers will continue to spam everyone to reach the sites that do not use rel="nofollow"
Link spammers will continue to place links for clicking (by surfers), even if those links are ignored by search engines.
Google is advocating the use of rel="nofollow" in order to reduce the effect of heavy inter-blog linking on page ranking.
According to the statistics shown at the Akismet's antispam service [1], this tag has not stopped the comment spam and has not discouraged spammers.
Lack of accreditation may discourage informational comments by professionals wishing to establish their PageRank reputation.

References

^ ^a ^b rel="nofollow" Specification, Microformats.org, retrieved June 17, 2007
^ W3C Patent Policy 20040205,W3.ORG
^ W3C (December 24, 1999), HTML 4.01 Specification, W3C.org, retrieved May 29, 2007
^ Google (January 18, 2006), Preventing comment spam, Official Google Blog, retrieved on May 29, 2007
^ Loren Baker (April 29, 2007),How Google, Yahoo & Ask.com Treat the No Follow Link Attribute, Search Engine Journal, retrieved May 29, 2007
^ ^a ^b Michael Duz (December 2, 2006),rel=”nofollow” Google, Yahoo and MSN, SEO Blog, retrieved May 29, 2007
^ Rel Nofollow Test from August 2007
^ Codex Documentation, Nofollow, Wordpress.org Documentation, retrieved May 29, 2007
^ Wikipedia (May 29, 2006), Wikipedia Signpost/2006-05-29/Technology report, Wikipedia.org, retrieved May 29, 2007
^ Brion Vibber (January 20, 2007), Nofollow back on URL links on en.wikipedia.org articles for now, Wikimedia List WikiEN-l, retrieved May 29, 2007
^ Wikipedia (January 22, 2007), Wikipedia Signpost/2007-01-22/Nofollow, Wikipedia.org, retrieved May 29, 2007
^ Loren Baker (November 15, 2007), Social Bookmarking Sites Which Don’t Use NoFollow Bookmarks and Search Engines, Search Engine Journal, retrieved on December 16, 2007
^ Jeremy Zawodny (May 30, 2006), Nofollow No Good?,Jeremy Zawodny's Blog, retrieved June 17, 2007
^ Matt Cutts (September 1, 2005), Text links and PageRank, Matt Cutts Blog, retrieved June 17, 2007
^ Philipp Lenssen (April 19, 2007), The Paid Links Economy,Google Blogoscope, retrieved June 17, 2007
^ Matt Cutts (April 14, 2007 ), How to report paid links, Matt Cutts Blog, retrieved June 17, 2007
^ Carsten Cumbrowski (May 14th, 2007), Matt Cutts on Paid Links Discussion - Q&A, SearchEngineJournal.com, retrieved June 17, 2007
^ October 8, 2007, Eric Enge Interviews Google's Matt Cutts, Stone Temple Consulting, retrieved on January 20,2008
^ Michael Hampton (May 23, 2005), Nofollow revisited, HomelandStupidity.us, retrieved May 29, 2007
^ Loren Baker (February 14, 2007), 13 Reasons Why NoFollow Tags Suck, Search Engine Journal, retrieved May 29, 2007