You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The PptxReader class in the LlamaIndex library currently does not support WMF/EMF image formats, which leads to the error you're encountering. The error occurs because the PIL library, used for image processing, does not have a loader for WMF/EMF files.
To address this issue, you can consider the following workaround:
Convert WMF/EMF Images: Before processing the PowerPoint file with PptxReader, convert WMF/EMF images to a supported format like JPEG or PNG. This can be done using external tools or libraries that support these formats.
Modify the Code: If you have control over the code, you could modify the PptxReader class to skip WMF/EMF images or handle them differently, such as by logging a warning instead of attempting to process them.
Use a Different Library: Consider using a library that supports WMF/EMF formats for image extraction and processing, and integrate it with your current setup.
These approaches can help you bypass the current limitation until native support for WMF/EMF is potentially added to the library or its dependencies.
Bug Description
PPTXReader can't handle WMF/EMF images inside the power point.
fix/workaround by #17819
Version
0.12.17
Steps to Reproduce
from llama_index.readers.file.slides.base import PptxReader
reader = PptxReader()
reader.load_data("power_point_with_wmf.pptx")
Relevant Logs/Tracbacks
The text was updated successfully, but these errors were encountered: