This utility converts text content of web pages to PDF using LaTeX. The text content is extracted using rdrview, utility that uses port of Firefox's reader view functionality.
You can find the documentation here. See also the handout of my talk at the TUG 2024 conference.
- standalone HTML documents, both local and online
- Epub files
- WARC - Web Archive files
- TeX distribution
- Rdrview
- Curl
- ImageMagick -- for conversion of Gif and Webp images
- CairoSVG -- for conversion of SVG images