Skip to content

thatch/regexlint

Folders and files

NameName
Last commit message
Last commit date

Latest commit

6f38504 · Aug 4, 2022
Sep 16, 2020
Jun 30, 2022
Jun 30, 2022
Sep 16, 2020
Jun 10, 2014
Apr 14, 2014
Sep 16, 2020
Sep 13, 2020
Sep 6, 2020
Jun 30, 2022
Sep 16, 2020
Sep 6, 2020

Repository files navigation

Regexlint

Regexlint is intended to be run as a linter against an importable Pygments lexer, and flag problems that will prevent the lexer from performing well (since we don't have branch coverage for lexers :)

As a simple example, alternations are first match, so it will flag this as never matching "elseif":

(else|elseif)

It also understands some Pygments internals, for example bygroups(...)` needs to have the same number of args as the regex has capture groups. Too many will result in duplicate text; too few will result in missing text. There should also not be any gaps between the capture groups, so this example flags two problems:

(r'(foo)\s+(bar)', bygroups(Blah)),

Usage

make demo
    or
regexlint pygments.lexers.web:HtmlLexer
    or
python3 regexlint/cmdline.py pygments.lexers.web

Todo

  • Figure out which phase should remove unnecessary backslashes
  • Write the alternation expander, so that ([ax]|a[bc]) fails the alternation order checks
  • Make more general than just for Pygments

License

This project is licensed under the Apache Public License, see COPYING