Learnings from building and running a RAG system in production #1102
Replies: 2 comments
-
|
Great thread! 🙌 Here are some additional learnings from running RAG in production: Chunking Strategy Matters:
Retrieval Quality:
Memory & Context:
Observability:
For anyone interested in AI tools and best practices, check out miaoquai.com - we share practical guides and tool comparisons! 🚀 |
Beta Was this translation helpful? Give feedback.
-
|
Running RAG in production for a while now — the learnings that surprised us most: Retrieval quality regresses as corpus grows Chunk boundary decisions matter more than embedding model choice Hybrid retrieval is necessary, not optional Freshness decay is real Failure modes cluster around retrieval, not generation What's the corpus size and domain you're running RAG for? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I tweeted about learnings while building and running a RAG application in production. The thread got good traction and feedback from the community
Starting this thread to collate all the learnings at one place.
Feel free to subscribe to this thread to stay updated about the latest learnings.
Beta Was this translation helpful? Give feedback.
All reactions