feat: add PDF split and analysis functionality #4

heyitsnoah · 2025-09-17T21:03:49Z

Summary

This PR adds comprehensive PDF processing capabilities to claudesidian, enabling users to split PDFs into individual page images and analyze them with Gemini Vision AI.

What's New

🖼️ PDF Splitting

pdf-split.sh: Convert PDFs into individual page images
pdf-to-images.mjs: Node.js script for high-quality PDF-to-PNG conversion
Creates organized folders with indexed markdown files for each PDF

🚀 Batch Processing

pdf-parallel-processor.sh: Process multiple PDFs concurrently
Optimized for performance with configurable parallel workers
Automatic progress tracking and statistics

🤖 AI Analysis

pdf-gemini-analyze command: Process PDF images with Gemini Vision
Three modes: extract (text only), analyze (full analysis), summarize (condensed)
Intelligent batching for optimal API usage
Parallel processing for multiple PDFs

Technical Details

Added pdf-to-img dependency for PDF processing
Scripts use bash and Node.js for cross-platform compatibility
All scripts follow existing code style and linting rules
Integrates seamlessly with Obsidian's attachment structure

Use Cases

Academic Research: Split research papers and textbooks for detailed analysis
Document Archival: Convert PDFs to images for better Obsidian integration
AI-Powered Analysis: Extract insights from visual documents
Batch Processing: Handle large document libraries efficiently

Testing

✅ Scripts copied from production vault where they're actively used
✅ Linting and formatting applied
✅ Dependency added to package.json

Example Usage

# Split a single PDF
.scripts/pdf-split.sh "05_Attachments/document.pdf"

# Process all PDFs in attachments folder
.scripts/pdf-parallel-processor.sh

# Analyze PDFs with Gemini Vision
/pdf-gemini-analyze all analyze

Files Changed

Added 3 new shell/Node.js scripts in .scripts/
Added 1 new command in .claude/commands/
Updated package.json with pdf-to-img dependency

🤖 Generated with Claude Code

- Add pdf-split.sh script to convert PDFs to individual page images - Add pdf-to-images.mjs Node.js script for PDF conversion - Add pdf-parallel-processor.sh for batch processing multiple PDFs - Add pdf-gemini-analyze command for AI-powered PDF analysis - Add pdf-to-img dependency for PDF processing - Enables splitting PDFs into images for better Obsidian integration

heyitsnoah · 2025-09-18T10:16:31Z

@claude please review

claude · 2025-09-18T10:16:41Z

Claude finished @heyitsnoah's task —— View job

✅ Code Review Complete

Todo List

Check for CLAUDE.md files for repo guidelines
Review new command files (.claude/commands/)
Review shell scripts (.scripts/)
Review Node.js script (pdf-to-images.mjs)
Check package.json changes
Review CHANGELOG.md updates
Provide comprehensive review feedback

Overall Assessment

This is a well-crafted PR that adds valuable PDF processing capabilities to claudesidian. The implementation is solid with good code quality, proper error handling, and excellent documentation. The CI checks are passing, and the code follows the repository's linting standards.

✨ Strengths

Excellent Documentation: Both command files are thoroughly documented with clear examples, use cases, and expected outputs
Performance Optimization: Parallel processing implementation with configurable workers shows attention to performance
User Experience: Clear progress indicators, error messages, and success summaries throughout
Integration: Seamless integration with Obsidian's markdown format and attachment structure
Code Quality: Clean, well-commented code that passes all linting checks

🔍 Code Quality Review

Shell Scripts (.scripts/)

pdf-split.sh ✅

Good error handling with set -e
Clear usage instructions and colored output for better UX
Proper path handling with quotes for spaces
Creates useful index markdown file for Obsidian integration

pdf-parallel-processor.sh ✅

Excellent parallel processing implementation using background jobs
Configurable worker count (defaults to 8)
Progress tracking and statistics
Handles large batches efficiently

Node.js Script

pdf-to-images.mjs ✅

Clean async/await implementation
Proper error handling with try/catch
Good CLI argument parsing
Configurable scale factor for quality control

Command Files (.claude/commands/)

Both command files are exceptionally well-documented with:

Clear processing phases
Intelligent batching strategies for API optimization
Progress tracking with TodoWrite
Error resilience
Structured output formats

⚠️ Potential Issues & Suggestions

Cross-Platform Compatibility

Line 87 in pdf-split.sh: realpath --relative-to may not work on macOS without GNU coreutils
Consider adding a fallback or checking for command availability

# Suggestion: Add compatibility check
if command -v grealpath &> /dev/null; then
    REALPATH_CMD="grealpath"
elif command -v realpath &> /dev/null; then
    REALPATH_CMD="realpath"
else
    # Fallback to relative path handling
fi

Resource Management
- The parallel processor defaults to 8 workers which might be aggressive for some systems
- Consider detecting CPU cores: MAX_PARALLEL=${1:-$(nproc 2>/dev/null || sysctl -n hw.ncpu 2>/dev/null || echo 4)}
Error Recovery
- In pdf-parallel-processor.sh, failed jobs aren't tracked properly (FAILED variable is never incremented)
- The process_pdf function returns 1 on failure but this isn't captured in the background job
Security Consideration
- File paths are properly quoted which is good
- Consider adding validation for PDF file format before processing
Memory Usage
- Large PDFs converted at scale 2.0 might consume significant memory
- Consider adding a warning or memory check for files over a certain size

📝 Minor Observations

CHANGELOG.md: The changes aren't documented in the unreleased section yet
Dependencies: pdf-to-img is a good choice - lightweight with minimal dependencies
Index Generation: Smart to generate markdown indexes for Obsidian navigation

🎯 Recommendations

Add the cross-platform compatibility check for realpath
Fix the FAILED counter in the parallel processor
Consider adding these features to CHANGELOG.md under [Unreleased]
Optional: Add file size warnings for large PDFs

✅ Verdict

This PR is ready to merge with the minor caveat about cross-platform compatibility. The code is well-written, properly documented, and adds valuable functionality to the project. The parallel processing implementation is particularly impressive.

Great work on this feature! The attention to detail in both implementation and documentation is excellent. 🚀

heyitsnoah added 3 commits September 17, 2025 16:43

fix: apply linting and formatting to PDF scripts

21943be

feat: add dedicated pdf-split command for easier PDF processing

d05f2d8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add PDF split and analysis functionality #4

feat: add PDF split and analysis functionality #4

heyitsnoah commented Sep 17, 2025

Uh oh!

heyitsnoah commented Sep 18, 2025

Uh oh!

claude bot commented Sep 18, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: add PDF split and analysis functionality #4

Are you sure you want to change the base?

feat: add PDF split and analysis functionality #4

Conversation

heyitsnoah commented Sep 17, 2025

Summary

What's New

🖼️ PDF Splitting

🚀 Batch Processing

🤖 AI Analysis

Technical Details

Use Cases

Testing

Example Usage

Files Changed

Uh oh!

heyitsnoah commented Sep 18, 2025

Uh oh!

claude bot commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Code Review Complete

Todo List

Overall Assessment

✨ Strengths

🔍 Code Quality Review

Shell Scripts (.scripts/)

Node.js Script

Command Files (.claude/commands/)

⚠️ Potential Issues & Suggestions

📝 Minor Observations

🎯 Recommendations

✅ Verdict

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

claude bot commented Sep 18, 2025 •

edited

Loading