# RAG and LLM Architecture

Welcome to the fascinating world of LLM Architecture and Retrieval-Augmented Generation, commonly known as RAG.&#x20;

<figure><img src="/files/CVU5yhlHZCXhpyLAvKrS" alt=""><figcaption></figcaption></figure>

In the current landscape, the value of Large Language Models (LLMs) in the progression of content understanding and generation is widely acknowledged. However, LLMs come with limitations such as the production of incorrect information, lack of data source verification, and dependence on outdated data. These shortcomings are particularly consequential for businesses that prioritize real-time, precise, and auditable data—commonly identified as key concerns.

<mark style="color:blue;">**Retrieval Augmented Generation (RAG)**</mark> offers a transformative solution to these issues. It elevates the capabilities of LLMs, making them relevant, reliable, and up-to-date.

In this module, we're laying the groundwork for an in-depth exploration of specialized techniques to improve pre-trained Large Language Models (LLMs) for particular use cases.&#x20;

Let's start by understanding

* What RAG is and
* Why it's a crucial component in the LLM ecosystem.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://kdag-iit-kharagpur.gitbook.io/realtime-llm/rag-and-llm-architecture.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.