When embedding sites into chromadb, also push the following: - Original path - Relevant PDFs (after OCR, push pdf content as chunks)
When embedding sites into chromadb, also push the following: