Project: RAG as a Service Backend

Project Overview

This project is a Retrieval Augmented Generation (RAG) as a Service backend. It's built with Node.js, Express.js, and TypeScript. MongoDB (via Mongoose) is used for data persistence. The service is designed to manage collections, resources (documents), and chunks of information, facilitating RAG operations. It integrates with external services for tasks like data crawling, vector embeddings (Langchain), and potentially agent-based processing.

The core entities are:

Collection: A logical grouping of resources. Each collection can have its own settings, such as the encoder to use for vectorizing its resources, chunk size, and chunk overlap.
Resource: A document or piece of content within a collection. Resources are broken down into multiple chunks for processing.
Chunk: Smaller, digestible parts of a resource, used for vector storage and retrieval.

Building and Running

The project uses TypeScript and ts-node-dev for development.

Development

To run the project in development mode (with live reloading):

npm run dev

Building

To compile the TypeScript code to JavaScript:

npm run build

Starting (Production)

To start the compiled JavaScript application:

npm start

Testing

To run unit tests:

npm test

To run tests in watch mode:

npm run test:watch

RAG Synchronization Job

To run a specific RAG synchronization job:

npm run rag-sync

Development Conventions

Language: TypeScript
Framework: Express.js
ORM/ODM: Mongoose (for MongoDB)
Project Structure: Follows a typical Node.js/Express project structure with separate directories for config, consumer, error, job, middleware, models, route, service, type, and utility.
Authentication: API key based authentication is used for routes.
Environment Variables: Uses dotenv for managing environment variables.
- QDRANT_URL: The URL for the Qdrant service.
- QDRANT_API_KEY: The API key for authenticating with Qdrant.
- QDRANT_COLLECTION_NAME: The name of the collection to use in Qdrant.
Logging: Uses winston for logging.
Queueing: Uses amqplib for RabbitMQ integration.
HTTP Client: Uses axios.
Web Scraping: Uses puppeteer-extra and cheerio.
Vector Embeddings/LLM Integration: Uses langchain and openai libraries.

Further Actions

Implement additional features related to RAG (e.g., actual embedding generation, retrieval logic).
Add comprehensive tests for the newly created services and routes.

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
ecosystem.config.js		ecosystem.config.js
jest.config.ts		jest.config.ts
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project: RAG as a Service Backend

Project Overview

Building and Running

Development

Building

Starting (Production)

Testing

RAG Synchronization Job

Development Conventions

Further Actions

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Walkover-Web-Solution/hippocampus

Folders and files

Latest commit

History

Repository files navigation

Project: RAG as a Service Backend

Project Overview

Building and Running

Development

Building

Starting (Production)

Testing

RAG Synchronization Job

Development Conventions

Further Actions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages