Learn how to self-host the nodes needed to run a RAG pipeline
POOL_ADDRESS
, POOL_PORT
with the correct values for your pool.POOL_SSL
to true
docker logs --follow retrieval
to see if it started correctly.
EMBEDDINGS_TRANSFORMERS_DEVICE
to the numeric id of a cuda device (eg. 0).
And by passing --gpus all
to the docker run command.
Ensure you have the correct drivers and the nvidia container runtime installed.POOL_ADDRESS
, POOL_PORT
with the correct values for your pool.POOL_SSL
to true
docker logs --follow embeddings
to see if it started correctly.
POOL_ADDRESS
, POOL_PORT
with the correct values for your pool.POOL_SSL
to true
docker logs --follow embeddings
to see if it started correctly.
POOL_ADDRESS
, POOL_PORT
with the correct values for your pool.POOL_SSL
to true
docker logs --follow search
to see if it started correctly.
POOL_ADDRESS
, POOL_PORT
with the correct values for your pool.POOL_SSL
to true
docker logs --follow extism-runtime
to see if it started correctly.
status=="success"
after which you will be able to get the job results by fetching
a Job result event of kind==6003
(5003+1000)
and e
tag equivalent to the Job Request event id.
p
tag of the Job Request.Additionally you can also encrypt the request for the same public key, using NIP-04 as explained here.