Use Zoomin As a Content Source

This feature is not available in the SearchUnify Cloud Edition.

Transform the documentation stored on Zoomin into searchable, actionable information with a dedicated content source. This article walks you through the steps of creating a SearchUnify data repository (content source) from Zoomin.

Establish a Connection

  1. Navigate to Content Sources.

  2. Click Add New Content Source.

  1. Find Zoomin from the search bar, and click Add.

  2. Give your Zoomin content source a descriptive label in Name.

  3. Enter your Zoomin documentation website in Client URL.

  4. Select the documentation language in Language. It is possible to select more than one language if your documentation is multilingual.

  5. Click Connect.

Set Up Crawl Frequency

The first crawl is always manual and is performed after configuring the content source. In Choose A Date, select a date to start crawling; the data created after the selected date will be crawled. For now, keep the frequency to its default value Never and click Set and move to the next section.

Select Content Types for Indexing

Each page on a Zoomin website has a few properties (content fields), such as an ID, the article title, and the last update date. In the default state, SearchUnify crawls and indexes them all. But admins can remove one or more content fields. Although removing content fields is not recommended, it might be a useful strategy at some firms. For example, a firm might not want to index ratings for some reason.

  1. Open zoomin_page for editing on Rules.

  2. OPTIONAL. Remove or add any fields if there is a need and click Save.

  3. Click Save again.

Result

You will return to the Content Sources screen after clicking Save. Crawl your content source now by clicking . If the number in Total Documents goes up, then Zoomin has been successfully connected, crawled, and indexed.

But if the number remains zero, then either the Zoomin website has no docs or something went wrong during the configuration. In this particular case, you can recheck the website address.

After the First Crawl

Return to the Content Sources screen and click in Actions. The number of indexed documents is updated after the crawl is complete. You can view crawl progress in in Actions. Documentation on crawl progress is in View Crawl Logs.

NOTE 1

Review the settings in Rules if there is no progress in Crawl Logs.

NOTE 2

For Mamba '22 and newer instances, search isn't impacted during a crawl. However, in older instances, some documents remain inaccessible while a crawl is going on.

Once the first crawl is complete, click in Actions open the content source for editing, and set a crawl frequency.

  1. In Choose a Date, click to fire up a calendar and select a date. Only the data after the selected date is indexed.

  2. Use the Frequency dropdown to select how often SearchUnify should index the data. For illustration, the frequency has been set to Weekly and Tuesday has been chosen as the crawling day. Whenever the Frequency is other than Never, a third dropdown appears where you can specify the interval. Also, whenever Frequency is set to Hourly, then manual crawls are disabled.

  3. Click Set to save crawl frequency settings. On clicking Set, you are taken to the Rules tab.

Last updatedFriday, April 26, 2024

Or, send us your review at help-feedback@searchunify.com