Introducing the Replay Connector

The Replay connector allows you to repush data from a given source. You can also specify a subset of the scan or several scans at once using a given time frame.

The Replay components (Replay Server and Replay Connector) are designed for two main purposes:

  • Capture a scan data flow from a source to be able to replay it without crawling the source once again.

    This is useful in test and development environment when you need to clear index and sources several times or when running unit tests on a consistent corpus.

  • Use the replay storage to persist documents that may need to be repushed inside an index and that can be deleted from the original source (for example, when dealing with historical data).