|
brendan. - 13 October 2008 05:09 PM Hi All,
ive just recently pushed a site live and have been checking out some the pages google has indexed to date.
This url : googleindex
Goes to a page that that doesnt have any records. For example the product doesnt have these attributes. Hence nothing is returned!
Im just curious why the bots are crawling my page like this
Also when i look at my Customers online i can see bots indexing pages by the product attributes. This seems weird as ive posted up a sitemaps with URL. Ive attached a screen capture of the page.
anyone suggest what im doing wrong here?
cheers
Google will crawl and index any pages that have links to it - regardless of the sitemap - we have extensively tested this.
To stop google crawling and indexing those pages you need to apply restrictions via nofollow on the links that are leading to those pages and/or apply restrictions in the robots.txt file
Google is going to those ‘attribute pages’ via links in the layered navigation, compare products, sort by, etc - you need to control what pages you want crawled and indexed and those you do not.
Some people MAY want these attribute pages indexed - that is up to them - we do not so we have taken steps to prevent it.
The robots.txt and nofollow changes we have made have been documented elsewhere in this SEO forum but if you need help let me know.
With all SEO you get debate about what you should and should not do and what google will and won’t do - we tend not to listen to MOST opinion we test and then act upon the results of our tests - we are comfortable with what we do and know 100% it makes a difference to our results. But everyone is different so you also need to be comfortable and understand the changes you are making.
Hope this helps.
|