Context limit?

Question

I tried many AI assistants and all ended up being a disappointment due to the same issue: the limitations of the context window of the LLM. I can add tens of thousands of pages of user content, but most of it will be ignored as it doesn't fit in the context window, and answers will only be based on a tiny fraction of selected content from the documents. So IMO today's LLMs, while a feasible solution to get data from a small amount of user documents, are not the right tool to analyze very large collections of user data. Sadly, a simple full text search (30+ year old technology) of the documents gets more complete results than AI assistants.

I am waiting for the AI assistant that somehow manages to overcome this limitation. Did you manage to solve this, or this context limitation of LLMs also affects IKI.AI the same as other AI Assistants?

Ivan_IKIAI · Answer

We\u0027re using RAG \u002D retrieval augmented generation architecture, it does not have any context limitations by design as it first performs search within a vector index and then passes corresponding chunks (paragraphs of text) to the prompt we send to an LLM.\u000aHere is my article about it: https://pub.towardsai.net/advanced\u002Drag\u002Dtechniques\u002Dan\u002Dillustrated\u002Doverview\u002D04d193d8fec6

eb9b0ca94fe442e1b6f97d103b66f3be · Answer

Thanks \u002D I have a basic understanding of RAG and that\u0027s also the context of my question.\u000a\u000aThe chunks sent to the LLM have to fit into the context window, e.g. if there are 10,000 results, you still can\u0027t embed 10,000 chunks with the prompt, only the top hits. So the response will be incomplete \u002D based on those top hits only \u002D and the results depend on what the retriever chooses to send to the LLM.

SubPage

Share SubPage

Related questions