Main navigation

SiteSeeker - Knowledge base

Show items by:

Kunskapsdatabasen

Excluding information from crawling and indexing

SiteSeeker normally crawls every web page and document found on a website, starting with the home page of the website.

If there are links to objects on the site that should not be crawled, they can be excluded – we show you how.

Knowledge base / How-to guides • Updated: 30 August 2012
Troubleshooting - why are some pages or documents not being indexed?

If some pages or documents have not been indexed, use this troubleshooting guide to identify the problem.

Knowledge base / How-to guides • Updated: 4 September 2012
What is the difference between the various crawling modes?

When you manually start a crawl and indexing in SiteSeeker Admin you can choose between three different crawling modes, full, minimal or no crawl mode. Here follows a description in detail of the differences between the various modes so that you can pick the best mode for your environment.

Knowledge base / FAQ / Crawl and Indexing • Updated: 4 September 2012
Solutions for crawling JavaScript links

Links that depend on JavaScript are generally not crawled by SiteSeeker or global search engines. If the website uses JavaScript links exclusively, it may become invisible to the outside world.

Knowledge base / FAQ / Crawl and Indexing • Updated: 5 November 2012
How do I index images and documents in ImageVault using SiteSeeker?

It is possible to use SiteSeeker for searching among images and documents stored in ImageVault, an image and media management tool from Meridium, using the search integration in ImageVault. The integration, which is included in ImageVault, will also enable SiteSeeker to use image and document metadata stored in ImageVault, e.g. categories, descriptions and access control lists.

Knowledge base / FAQ / Crawl and Indexing • Updated: 22 February 2012
How does SiteSeeker index images?

SiteSeeker can in addition to HTML and other frequently occurring document types also index images.

Knowledge base / FAQ / Crawl and Indexing • Updated: 17 September 2012
How does SiteSeeker use sitemaps?

SiteSeeker supports sitemaps in familiar formats and can be located using "robots.txt", starting point configuration in SiteSeeker Admin or filename.

Knowledge base / FAQ / Crawl and Indexing • Updated: 20 December 2012
Indexing documents on another website

If you link to documents on another website, you can let SiteSeeker index those specifically, and disregard any other documents found on that website.

Knowledge base / FAQ / Crawl and Indexing • Updated: 4 September 2012