Skip to content

document_loaders: confluence: Add factory func for attachment parser #18

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

faxe1008
Copy link

Adds a factory function that can be provided to the Conflunece Loader constructor. The factory function is called with the attachement information and should return a Blobparser that shall be used to extract the content of the document.

This is makes the behaviour of the loader more customizable, because the previous default of converting a PDF -> image and then using tesseract seems excessive.

@faxe1008 faxe1008 force-pushed the confluence_loader_flexible_att_parser branch from ce20bea to e9abe8a Compare April 30, 2025 09:40
Adds a factory function that can be provided to the Conflunece Loader
constructor. The factory function is called with the attachement
information and should return a Blobparser that shall be used to extract
the content of the document.
@faxe1008 faxe1008 force-pushed the confluence_loader_flexible_att_parser branch from e9abe8a to bff361c Compare April 30, 2025 09:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant