PDA

View Full Version : Google Pulling from Urchin?


PortofDreams
12-15-2005, 04:45 PM
I found something strange today when looking at my google sitemaps crawl stats.

http://www.celebritycouples.net/celeb/pittjolie 404 not found
http://www.celebritycouples.net/celeb/pittjolie/joliemovies 404 not found
http://www.celebritycouples.net/celeb/pittjolie/pittmovies 404 not found

These aren't actual pages on my site, rather they are what I told Urchin to record them as inside of the JavaScript tags (For readability when viewing reports). I am not putting the full URL in the script, only the relative path that I want to see in Urchin.

<script type="text/javascript">
urchinTracker('/Celeb/PittJolie');
</script>

I'm starting to think that the Google Spider is using not only Sitemaps, but also Urchin data to get URL's to crawl.

It is smart to me that Google does use Urchin data to find more pages to index if this is the case, but it shouldn't be using "Named Pages", only "Actual Pages". I was just curious if anyone else has had a similair experience like this?

Either way, it doesn't hurt anything I don't think.

jkemp
12-15-2005, 06:01 PM
More likely Googlebot is reading the url in the javascript and trying to follow the link. So it looks like Googlebot can follow links in javascript now? Just a guess.

And BTW, I'm from Knoxville too.

PortofDreams
12-15-2005, 06:33 PM
Part of me thinks that could be the case, but the other part thinks 'how could Google distinguish that this is a URL and not just some random string that contains a "/" '.

Nice to see a fellow Knoxvillian! Go Vols