Monday, September 14, 2009

Content for this URL is excluded by the server because a no-index attribute

In SharePoint/MOSS server search crawl log you may see "Content for this URL is excluded by the server because a no-index attribute" apear for pages that you would like to be indexed.

What to check in this case:
  • Rules
  • Robots.txt file
  • Page Source (robots meta)
  • Page library advanced settings
  • 'Site Settings' - 'Search Visibility' under 'Site Administration'
  • and last but not least: Make sure you can access the page with the user that you definied as robots/crawler user. See central administration:
    /ssp/admin/_layouts/contentaccessaccount.aspx
    (log on to your site with this user)

Hint fot German users: The error message in German is "Der Inhalt für diese URL wird wegen eines Nichtindexattributs vom Server ausgeschlossen".

1 comment:

Ari said...

It was something else for us. In our sharepoint farm we have a seperate server to do the indexing.

The problem was that this index server was missing some dlls, web parts, and javascripts that are used by the master pages and pages layouts, and which were installed only in the web frontend servers.