Search Engine Watch Forums  - Search Engine Watch (#SEW)
Search Engine Watch
SEO News

 

  #1  
Old 09-18-2008
panana panana is offline
Member
 
Join Date: Sep 2005
Posts: 8
panana is on a distinguished road
Google Bots Overloading Server?

Hello All -

I had a friend email me that he needed help. He got a message from his hosting company saying that they were on the verge of suspending his site because the Google bot(s)was hitting his site so much that he was bringing the shared server down to a snail's pace.

What I want to know is - Is it possible for Google bots to be accessing a site so much that they would be bringing down a server? Here's a quote from the message they sent him:

"We have found that your site is causing server wide load problems for the
shared web server it is using. Specifically, the site is being crawled
excessively by Googlebots.

In order to resolve this issue, we will need for you to restrict the
information on your site available to a Googlebot. You can do so with the
appropriate directives in a robots.txt file, or by adding the meta tag <meta
name="Googlebot" content="nofollow" /> to the webpage."

This just doesn't sound right. To be an ongoing problem, Google would have to be hitting the site hundreds of times each and every day. His site is not ranked very high, so I can't see Google coming by that often.

As a friend, I especially don't like them forcing him to use a NoFollow and bringing internal pages value down just to satisfy them.

Is this possibly an attack of some kind, using the Google bot identity as a front? I can't see this being a legitimate, constant problem originating from Google's cataloging.

Any help is appreciated. Hope everyone is having a great Fall so far -


Jon
Reply With Quote
  #2  
Old 09-18-2008
JohnW's Avatar
JohnW JohnW is offline
 
Join Date: Jun 2004
Location: Virginia Beach, VA.
Posts: 967
JohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud of
Re: Google Bots Overloading Server?

>You can do so with the
appropriate directives in a robots.txt file, or by adding the meta tag <meta
name="Googlebot" content="nofollow" /> to the webpage."

What a load of garbage. Anyhow, in Google webmaster tools there is an option where you can have some control of the rate of Gbot crawling. Also if you are worried, the log files may help you confirm if it's really Gbot.
Reply With Quote
  #3  
Old 09-18-2008
jimbeetle's Avatar
jimbeetle jimbeetle is offline
 
Join Date: Mar 2006
Location: New York City
Posts: 1,000
jimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud of
Re: Google Bots Overloading Server?

Yeah, garbage. I'd also have your friend either trace the e-mail headers or contact the host to make sure it actually came from them.

Wouldn't be the first time that a competitor tried to get someone to nuke themselves.
Reply With Quote
  #4  
Old 09-18-2008
freeflyer freeflyer is offline
Member
 
Join Date: Sep 2008
Posts: 19
freeflyer is on a distinguished road
Re: Google Bots Overloading Server?

its not garbage... a bot can create thousands of consecutive requests, particularly to the database. A poolrly executed ecommerce site (such as certain modded oscommerce sites) have particularly bad problems with bots slowing the server to a crawl, but the site have to be badly written. Admittedly the server has to be pants in the first place, but it can happen.
Reply With Quote
  #5  
Old 09-18-2008
mcanerin's Avatar
mcanerin mcanerin is offline
 
Join Date: Jun 2004
Location: Calgary, Alberta, Canada
Posts: 1,570
mcanerin has a reputation beyond reputemcanerin has a reputation beyond reputemcanerin has a reputation beyond reputemcanerin has a reputation beyond reputemcanerin has a reputation beyond reputemcanerin has a reputation beyond reputemcanerin has a reputation beyond reputemcanerin has a reputation beyond reputemcanerin has a reputation beyond reputemcanerin has a reputation beyond reputemcanerin has a reputation beyond repute
Re: Google Bots Overloading Server?

This is why most search engines support the crawl-delay command.

The syntax is:

Quote:
User-agent: *
Disallow:
Crawl-delay: 5
The number is seconds.

Having said this, I'm very suspicious of this email. Either your ISP is admitting to having hardware that is extremely underpowered (search engines don't even download images - it's not bandwidth, it's just hits) or this is a trick by someone to get you to commit suicide.

If it's actually from your ISP - switch ISP's or fix the underlying issue with your database or trace the IP's of the googlebot visists and verify that they ARE googlebots and not just a DOS attack using googlebot agent strings.

If it's actually from your competitor (or anyone other than your ISP) - keep doing what you are doing, since you are obviously scaring someone who is too lazy to actually try to compete with you instead, and therefore doesn't deserve to rank well anyway.

Above all, NEVER use a robots.txt disallow function for bandwidth control unless the robots are actually spidering areas you don't want them to spider.

Ian.
__________________
International SEO

Last edited by mcanerin : 09-18-2008 at 02:28 PM.
Reply With Quote
  #6  
Old 09-18-2008
JohnW's Avatar
JohnW JohnW is offline
 
Join Date: Jun 2004
Location: Virginia Beach, VA.
Posts: 967
JohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud of
Re: Google Bots Overloading Server?

>its not garbage

Sorry, but I think it is garbage for a hosting company to give this kind of stupid advice about nofollow (like this is supposed to stop crawling?)and encourage their clients to commit nofollow/robots.txt suicide with their Google rankings. If it's free hosting, then fine, let the client know that they have to start paying.
Reply With Quote
  #7  
Old 09-18-2008
JohnW's Avatar
JohnW JohnW is offline
 
Join Date: Jun 2004
Location: Virginia Beach, VA.
Posts: 967
JohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud of
Re: Google Bots Overloading Server?

Ian, good point about robots crawl delay. Still, if Gbot is in fact the problem isn't WMT a good place to deal with it?
Reply With Quote
  #8  
Old 09-18-2008
panana panana is offline
Member
 
Join Date: Sep 2005
Posts: 8
panana is on a distinguished road
Re: Google Bots Overloading Server?

Thanks everyone for the replies and all the help! My friend actually did call the host and talked to them so this is a real email from them(!).

Again, greatly appreciate the info and confirming that this just couldn't be happening. At least not the way the host is saying it is.
Reply With Quote
  #9  
Old 09-19-2008
jimbeetle's Avatar
jimbeetle jimbeetle is offline
 
Join Date: Mar 2006
Location: New York City
Posts: 1,000
jimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud of
Re: Google Bots Overloading Server?

Okay then, legitimate, but still not the best advice. Best bet now is too take the steps Ian outlined above:

-crawl delay
-check that requests actually are from googlebot
-check db and site structure
-consider host change if all above checks out
Reply With Quote
  #10  
Old 09-20-2008
freeflyer freeflyer is offline
Member
 
Join Date: Sep 2008
Posts: 19
freeflyer is on a distinguished road
Re: Google Bots Overloading Server?

Quote:
Originally Posted by JohnW View Post
Sorry, but I think it is garbage for a hosting company to give this kind of stupid advice about nofollow (like this is supposed to stop crawling?)and encourage their clients to commit nofollow/robots.txt suicide with their Google rankings. If it's free hosting, then fine, let the client know that they have to start paying.

john, admittedly the advice was garbage, but not the reasoning behind it (ie bots can slow servers), which is what it looked like was being said

Last edited by freeflyer : 09-20-2008 at 06:22 AM.
Reply With Quote
  #11  
Old 09-21-2008
Marcia's Avatar
Marcia Marcia is offline
 
Join Date: Jun 2004
Location: Los Angeles, CA
Posts: 5,479
Marcia has a reputation beyond reputeMarcia has a reputation beyond reputeMarcia has a reputation beyond reputeMarcia has a reputation beyond reputeMarcia has a reputation beyond reputeMarcia has a reputation beyond reputeMarcia has a reputation beyond reputeMarcia has a reputation beyond reputeMarcia has a reputation beyond reputeMarcia has a reputation beyond reputeMarcia has a reputation beyond repute
Re: Google Bots Overloading Server?

Quote:
(ie bots can slow servers)
It sound like those servers are lacking capability in the first place. Also, in cases I've heard of where a site was using too much in the way of server resources, the host asked them to upgrade to a dedicated server.

Their suggestions is a little too "SEO-savvy" for my liking, it sounds like an out of line suggestion.
Reply With Quote
  #12  
Old 09-22-2008
freeflyer freeflyer is offline
Member
 
Join Date: Sep 2008
Posts: 19
freeflyer is on a distinguished road
Re: Google Bots Overloading Server?

the bots only slow the server IF the site is written poorly, ie generating hundreds of unnecessary queries every time, when a handful is all thats needed.
Reply With Quote
  #13  
Old 09-22-2008
jimbeetle's Avatar
jimbeetle jimbeetle is offline
 
Join Date: Mar 2006
Location: New York City
Posts: 1,000
jimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud ofjimbeetle has much to be proud of
Re: Google Bots Overloading Server?

Quote:
the host asked them to upgrade to a dedicated server
The absence of the upsell in the e-mail from the host is what first made me think it might be a scam.
Reply With Quote
  #14  
Old 09-22-2008
ScottG ScottG is offline
Member
 
Join Date: Aug 2006
Location: Portland, Oregon
Posts: 16
ScottG is on a distinguished road
Re: Google Bots Overloading Server?

Quote:
Originally Posted by mcanerin View Post
(search engines don't even download images - it's not bandwidth, it's just hits)
Really? I thought that they just didn't come around as often as their normal bot might.

http://images.google.com/images?as_s...=Search+Images

And here is some "obama" pic (fake?):
http://blog.searchenginewatch.com/bl...%20picture.jpg
that is on SEW. Another site re-used it, and this was re-associated with SEW as well:
http://images.google.com/images?um=1...=Search+Images

And with engines like Cuil, lol, image bot = fail:
http://www.robleto.com/2008/07/28/cu...ociation-fail/
http://pixelbits.wordpress.com/2008/...l-got-glasses/

And I've seen some references in the past to: "Googlebot-Image/1.0"
Reply With Quote
  #15  
Old 09-26-2008
thedevnull thedevnull is offline
Member
 
Join Date: Aug 2008
Posts: 23
thedevnull is on a distinguished road
Re: Google Bots Overloading Server?

I think you can set the crawl rate in Google Webmaster Tools as well...
Reply With Quote
  #16  
Old 09-26-2008
JohnW's Avatar
JohnW JohnW is offline
 
Join Date: Jun 2004
Location: Virginia Beach, VA.
Posts: 967
JohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud ofJohnW has much to be proud of
Re: Google Bots Overloading Server?

Yeah, that was my take as well but WMT is not totally clear how it works. I think that in robots.txt you can only set the delay not the rate.
Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Google AdWords Advertisers must have Google Accounts to log in? andrewgoodman Google AdWords 21 01-22-2011 06:01 AM
Is the Google Adwords server making mistakes? tonerman Google AdWords 6 09-26-2008 12:16 PM
are pages at the root of the server more relevant to google? jpf566 Search Engine Optimization 3 06-14-2008 05:23 PM
The influence and domination of Google: Yahoo buys text links PixelStreamed Google Web Search 5 12-28-2006 04:41 AM
Inside The Searcher's Mind - Live from SES San Jose rustybrick SEM Related Organizations & Events 0 08-02-2004 04:37 PM


All times are GMT -4. The time now is 05:28 PM.