Add PDF fragment loader plugin to directory #954

agustif · 2025-04-25T18:59:49Z

Hi! After making this feature for my other arxiv plugin i thought it could be useful for generic PDF's too! so i made

llm-plugin-pdf provides a -f pdf: loader that can load local or remote PDF files as fragments.

A little wrapper around pyMuPDF that will try to parse a PDF text and images into markdown to provide a PDF's files contents as a fragment

this should use way less tokens than feeding a full PDF to a model directly, most papers are actually built from source so they have great support that doesn't rely on clunky OCR (i explored using grobid for other uses, but requires a server which made it a nono for this, pyMuPDF worked well on my tests and i was able to also parse the pdf.images into base64 encoded data so it's all passed as fragments to the model, not only text)

simonw · 2025-05-04T21:51:44Z

This plugin can now be upgraded to pass those images as attachments, not as base64 encoded strings:

Allow fragment loader plugins to return a mixture of fragments and attachments #972

agustif added 2 commits April 25, 2025 20:54

adds lm studio in plugin directory

33f99f2

adds lm llm-plugin-pdf in plugin directory

bf311af

agustif changed the title ~~Docs/add plugin pdf~~ Add PDF fragment loader plugin to directory Apr 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PDF fragment loader plugin to directory #954

Add PDF fragment loader plugin to directory #954

agustif commented Apr 25, 2025 •

edited

Loading

simonw commented May 4, 2025

Add PDF fragment loader plugin to directory #954

Are you sure you want to change the base?

Add PDF fragment loader plugin to directory #954

Conversation

agustif commented Apr 25, 2025 • edited Loading

simonw commented May 4, 2025

agustif commented Apr 25, 2025 •

edited

Loading