Learn the high level architecture for Neum AI pipelines
Generators
have been used to enable parallelization of tasks. Meaning, you can spin up workers to handle compartmentalized tasks like processing documents, generating embeddings and ingesting information into storage. This ensures that datasets are processed quickly and create robustness of the system in case of failures.
Source Connector
Embed Connector
Sink Connector
Generators
to yield results. These results can be passed on to different workers to scale the processing of data. We recommend the usage of frameworks like Celery
. Using Celery
, we can set up mutliple different worker pools:
Source Conenctor
including:
Source Conenctor
including:
Embed Conenctor
and Sink Connector
including:
Generators
and yields
that allow you parallelize your workloads.