Search Engine Watch Forums  - Search Engine Watch (#SEW)
Search Engine Watch
SEO News

 

  #1  
Old 03-09-2009
Webnauts Webnauts is offline
Organic SEO Expert
 
Join Date: Apr 2005
Location: Germany
Posts: 23
Webnauts is on a distinguished road
Arrow Nocrawl instead Nofollow Pros and Cons

I would to ask your opinion about a possible alternative to the "nofollow attribute, which I will call it here "bots=nocrawl".

I have for example a page linking to a page called example.html

The URL looks like this:
http:// www. whateveryouwantocallthat. com/example.html?bots=nocrawl

In the robots.txt I add this:

User-agent: Googlebot
Disallow: *bots=nocrawl
Noindex: *bots=nocrawl

In addition I add in the .htaccess file X-Robots directives to prevent the robots.txt of being indexed, followed,etc.:

<FilesMatch "\.(txt)$">
Header set X-Robots-Tag "noindex,nofollow,noarchive,nosnippet"
</FilesMatch>

What difference do you see between the use of the "nofollow" attribute and the "bots=nocrawl" as setup this way.

What are the possible pros and cons using "bots=nocrawl" instead of the "nofollow" attribute?

---
P.S. To go a step further, I was thinking what would be if using "bots=nocrawl" in destination URLs and adding on the targeted web pages the new "canonical element" (where applicable, i.e duplicated pages or with similar content).
Reply With Quote
  #2  
Old 03-09-2009
AussieWebmaster's Avatar
AussieWebmaster AussieWebmaster is offline
Forums Editor, SearchEngineWatch
 
Join Date: Jun 2004
Location: NYC
Posts: 8,146
AussieWebmaster has a brilliant futureAussieWebmaster has a brilliant futureAussieWebmaster has a brilliant futureAussieWebmaster has a brilliant futureAussieWebmaster has a brilliant futureAussieWebmaster has a brilliant futureAussieWebmaster has a brilliant futureAussieWebmaster has a brilliant futureAussieWebmaster has a brilliant futureAussieWebmaster has a brilliant futureAussieWebmaster has a brilliant future
Re: Nocrawl instead Nofollow Pros and Cons

You may want to look at this from Matt Cutts http://www.mattcutts.com/blog/googlebot-keep-out/

and his cavaet there:
Obscure note #1: using the ‘googlebot=nocrawl’ technique would not be the preferred method in my mind. Why? Because it might still show ‘googlebot=nocrawl’ urls as uncrawled urls. You might wonder why Google will sometimes return an uncrawled url reference, even if Googlebot was forbidden from crawling that url by a robots.txt file. There’s a pretty good reason for that: back when I started at Google in 2000, several useful websites (eBay, the New York Times, the California DMV) had robots.txt files that forbade any page fetches whatsoever. Now I ask you, what are we supposed to return as a search result when someone does the query [california dmv]? We’d look pretty sad if we didn’t return www.dmv.ca.gov as the first result. But remember: we weren’t allowed to fetch pages from www.dmv.ca.gov at that point.
Reply With Quote
  #3  
Old 03-12-2009
Webnauts Webnauts is offline
Organic SEO Expert
 
Join Date: Apr 2005
Location: Germany
Posts: 23
Webnauts is on a distinguished road
Re: Nocrawl instead Nofollow Pros and Cons

Thanks for the quick reply. I was just looking around and I read this:

Quote:
Cutts stated explicitly that Google does not crawl nofollow links in July 2006, in his Bot Obedience: Herding Googlebot post: "At a link level, you can add a nofollow tag on the granularity of individual links to prevent Googlebot from crawling individual links (you could also make the link redirect through a page that is forbidden by robots.txt).
But I do that already, but instead using the "nofollow" attribute I use the keep out Googlebot menthod. Where do you see a difference.

Quote:
Bear in mind that if other pages link to a url, Googlebot may find the url through those other paths."
Because I bear that in mind, that is why I implement the "noindex" so that will not happen.

Quote:
Lasnick stepped in again last night to further clarify the issue in another post, If rel="nofollow" is becoming the norm. He notes that "nofollow links aren't listed any differently than other links in our Webmaster Tools backlinks section," and said that nofollow links will show up in search resulsts using the "link:" operator.
Exactly! The "nofollow" links aren't listed any differently than other links in our Webmaster Tools. But with the "bots=nocrawl" pages will never show up, if the "bot=nocrawl" is setup before adding new pages. If the pages are already picked up from Googlebot before you have implemented that method, then you have to request a deletion in the Webmaster Tools, and when they are deleted, they will never show again.

Now taking this to another level. Because as I said, the targeted pages have the meta robots tag directives "noindex,nofollow,noarchive,nosnippet" or that is achieved with X-Robots directives, you might tell that they are dangling pages. But the same happens if you use the "nofollow" attribute.

So am I missing something again?

---
Quotes of Matt Cutts and Adam Lasnik are found here: http://blog.searchenginewatch.com/070215-123945
Reply With Quote
  #4  
Old 03-12-2009
rainborick rainborick is offline
Member
 
Join Date: Mar 2006
Posts: 60
rainborick is just really nicerainborick is just really nicerainborick is just really nicerainborick is just really nicerainborick is just really nice
Re: Nocrawl instead Nofollow Pros and Cons

It depends on what your goal is. If your goal is just to keep a document out of the search engines' index, then a robots <meta> tag set to noindex, a Disallow instruction in the robots.txt file, or an x-robots-tag directive in your .htaccess file is sufficient. The search engines will ignore those instructions only in extraordinary circumstances, as Aussie noted. Which one is better I think probably depends on whether or not you need the pages to be crawled, but not indexed. Beyond that, the distinctions seem pretty much without a difference. Its been such a long time since I've seen a URL-only entry appear in the search results, that I think it's pretty innocuous unless you don't even want the URL to appear in the results of the site: operator.

I wouldn't be concerned that the URL for these pages is in the link database which is only used for calculating PageRank etc., and for feeding the crawl queue. Once you've blocked them from being in the index, using appropriate <meta> tags on the blocked documents and/or including a nofollow on any links that point to them will allow you to control the flow of PageRank to and from these documents as needed.
Reply With Quote
  #5  
Old 12-16-2009
tpraja tpraja is offline
SEO-Specialist-PalanivelRaja
 
Join Date: Nov 2006
Location: chennai
Posts: 12
tpraja is on a distinguished road
Re: Nocrawl instead Nofollow Pros and Cons

According to me nofollow is better than nocrawl

Palanivel Raja

Last edited by JohnW : 12-16-2009 at 07:37 PM. Reason: removed sig link
Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Can we run Adsense for adult web sites? Please give all the pros and cons? Shiv Kumar Dogra Google AdSense 6 03-02-2007 05:07 PM
Pros Vs Cons of Google adsense transformers Google AdSense 2 08-14-2006 04:07 PM
Pros and Cons of Web Rings AussieWebmaster Link Building 0 06-22-2005 11:06 AM
Is buying text links valuable? irank Link Building 40 04-13-2005 11:53 AM
Pros & Cons for Going Multi-Lingual K.S. Katz Multilingual Search Markets & Non-US Engines 4 07-27-2004 11:12 AM


All times are GMT -4. The time now is 09:12 PM.