Can I Disallow Faceted Nav URLs - Robots.txt

tylerfraser

I have been disallowing /*? So I know that works without affecting crawling. I am wondering if I can disallow the faceted nav urls.

So disallow: /category.html/? /category2.html/? /category3.html/*?

To prevent the price faceted url from being cached:

/category.html?price=1%2C1000
and
/category.html?price=1%2C1000&product_material=88

Thanks!

AlanMosley

If you can no-index , follow all but the default, then you will send link juice to the pages but it will return the link juice because it is follow, but they will not index because they are no-index.

If you use robots, then it can not read the page to follow the links.

Francisco_Meza

Hey Tyler! haven't seen you on SEOmoz in a while. Hope you are good!

Check to see if this would make sense for you. GWT > Site Configuration > URL Perameters. It says "Only use this feature if you feel confident about how parameters work for your site. Telling Googlebot to exclude URLs with certain parameters could result in large numbers of your pages disappearing from our index."

tylerfraser

If I can, then I disallow hundreds of pages that are duplicate content and should not be crawled.

If I don't then I send link juice to urls that I don't want seen.

This is a good answer though, thanks. Any other thoughts?

AlanMosley

You can, but then you have links passing link juice to non followed pages. it would be better if you used canonical. even better would be to add no-index, follow meta tag when non canonical page is displayed, but this requres some codeing.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Moz Q&A is closed.

Can I Disallow Faceted Nav URLs - Robots.txt

Browse Questions

Explore more categories

Related Questions

One robots.txt file for multiple sites?

Is sitemap required on my robots.txt?

Is there a limit to how many URLs you can put in a robots.txt file?

Should I block Map pages with robots.txt?

Oh no googlebot can not access my robots.txt file

Removing robots.txt on WordPress site problem

Trailing Slashes In Url use Canonical Url or 301 Redirect?

Robots.txt file getting a 500 error - is this a problem?

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved