mphung
01-05-2007, 02:50 PM
I've got a site with a lot of dynamic URLs that result in duplicate content being indexed. If I exclude these pages from spidering in my robots.txt file, but other people continue to link to the various versions, will they still be indexed or will the spiders consult robots.txt first and know not to index them.
In other words, do I need to put noindex,nofollow on each duplicate page in addition to the robots.txt file, or will just the latter do both (both = excluding these links based on a site crawl and also excluding them even though other sites are linking to them)?
Thanks.
In other words, do I need to put noindex,nofollow on each duplicate page in addition to the robots.txt file, or will just the latter do both (both = excluding these links based on a site crawl and also excluding them even though other sites are linking to them)?
Thanks.