Skip to content

Add GoogleKCParser implementation with tests and CLI script #352

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 9 commits into
base: master
Choose a base branch
from

Conversation

dkallipo
Copy link

@dkallipo dkallipo commented Aug 8, 2025

This PR adds the complete implementation for the GoogleKCParser, including:

  • lib/google_kc_parser.rb: Main parser logic.
  • bin/run_parser.rb: CLI script to run the parser on a given HTML file.
  • test/test_parser.rb: Automated tests using Test::Unit with multiple scenarios.
  • Sample HTML input files and matching expected JSON output files.

Usage

Run parser from CLI:
ruby bin/run_parser.rb <html_file>

Run tests:
ruby test/test_parser.rb

Note: In expected JSON files, the order of results matters.

Tested on Ruby 2.6.5, should work on any Ruby 2.5+ environment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant