Biggest RAG Hurdles for Beginners?
Hey Everyone,
I'm curious what the group thinks are the biggest pain points for devs getting started with RAG? My list would be:
- hallucination: especially with complex docs
- eval: there are tools to score completions vs retrievals, but what about the rest of the RAG pipeline where the problems actually occur.
- complexity: many pieces of the pipeline to master (parse, extract, convert to LLM friendly data, chunk, embed, create metadata for context, search, rerank, etc) and lots of theories on best approach to each one.
What's everyone else dealing with?