Indexing Data

Overview

To implement search on your website or app you must first create a collection and index your data. Sajari supports a wide variety of data types including webpages, documents (e.g. PDF, DOC, DOCX, etc) and JSON formatted data.

There are two primary ways of indexing your data:

  1. Crawler
  2. API

Crawler

The crawler is the easiest way to get started if you want to add search to a website.

Simply create a new website collection and enter your domain. The Sajari crawler will visit your webpages, index them, and store the records in your collection.

API

You can use the API directly to index almost any type of data, whether it's for an e-commerce store, a job website, or a mobile app.

However, before you can index your data, you will need to create a schema that describes your data. Use the getting started in 5 minutes guide to set up your collection with a schema. It will also set up an initial pipeline configuration for you and index the data you initially upload.