Overview

Data PreProcessing

The SourceConnector is the central unit that takes care of extracting and pre-processing data ahead of embedding. Learn more about data pre-processing with Neum AI.

The SourceConnector takes three arguements:

  • 1 Data Connector
  • 1 Loader
  • 1 Chunker

Additionally a custom_metadata property is allowed. The metadata provided in this field will be added to every NeumDocument extracted by the SourceConnector.

Usage