Using the Split XML Documents Method

You can enable the Split parameter to split XML documents in several chunks.

Every chunk is then handled as a separate document. Every first-level child of the XML document is extracted.

For example, if you want to index all the contacts from a single contacts.xml file:

<contacts>
  <person name="John M.">
    <phone>2222 3333</phone>
    <folder>work</folder>
  </person>
  <person name="Mary K.">
    <phone>1000 1000</phone>
    <folder>work</folder> 
  </person>
  <person name="Sally M.">
    <phone>4242 4242</phone>
    <folder>work</folder>
  </person>
  <!-- ... -->
</contacts>

Each <person> node represents a document and stands as the root node when configuring expressions. For example, to retrieve the content of the name attribute with a XPath expression, you can enter: /person/@name

See Also
Property Descriptions
  1. In the global configuration pane, set Split to true.

    The connector splits the XML document into several chunks.

  2. Expand Root Paths, and click Add item to add a path.
  3. In Root path, enter the file or folder path to index.

    For example, /data/xml-sample-data/contacts/contacts.xml

  4. Configure the extraction parameters. See the details for the supported methods:
  5. Click Apply.