Skip to content

Conversation

@labrenbe
Copy link
Member

@labrenbe labrenbe commented Jan 15, 2026

Adds a new demo that shows Retrieval Augmented Generation with OpenSearch

Stack:

  • OpenSearch 3.1.0 with k-NN plugin
  • Ollama running two models: nomic-embed-text:v1.5 (embeddings) & llama3.1:8b (response generation)
  • JupyterLab with interactive RAG pipeline notebook

Data

  • Documentation for the Stackable Data Platform crawled from docs.stackable.tech
  • Pre-generated json file containing ~4200 documentation chunks (93MB, stored in git)
  • Loaded into OpenSearch with a Kubernetes Job

The data file should be regenerated after each Stackable release to ensure no outdated documentation is used in the demo. This process can take more than 1 hour and is also the reason why the data is commited to git and not generated during demo installation.

Note for reviewers: You need to change the branch parameter to feat/opensearch-rag-demo in the stacks and demos files to deploy the demo. Depending on the hardware the demo should run on your local machine, but a kind cluster on Replicated with a single r1.xlarge node works best.

@labrenbe labrenbe self-assigned this Jan 15, 2026
@labrenbe labrenbe linked an issue Jan 15, 2026 that may be closed by this pull request
@labrenbe labrenbe force-pushed the feat/opensearch-rag-demo branch from 6f40f70 to 8c1beee Compare January 19, 2026 14:51
@labrenbe labrenbe marked this pull request as ready for review January 19, 2026 15:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add OpenSearch AI demo

2 participants