Chat Version Demo (multi-turn + retrieval for each step + query rewrite): Google Drive Link
A clone of perplexity.ai, an LLM and RAG-powered search engine using LangChain, Chroma Vector Store, and Cohere Text Generation and Embedding Models. Frontend designed using Gradio.
- The user's query is passed through the Google Search API to create a corpus of the 10 most recent and relevant webpages.
- The documents are split and stored into a Chroma vector store
- Using Cohere's text embedding model, relevant snippets are collected using embedding similarity
- The relevant snippets are compiled into context for the RAG application
- The prompt and context are fed into Cohere's Command-R Text Generation LLM, and the output and documents are presented to the user.

