Skip to content

Spanish-language AI pattern detection rules #92

@adelaidasofia

Description

@adelaidasofia

Context

I've been using humanizer heavily on bilingual (English/Spanish) content and built a Spanish-language rule library in my fork (adelaidasofia/humanizer) that I'd love to contribute upstream.

What it does

A full set of Spanish-specific AI writing pattern detections, parallel to the existing English rules:

  • Inflated connectors: "en este sentido", "cabe destacar", "es importante mencionar"
  • Formulaic openers: "En el mundo actual", "En la era de"
  • Promotional inflation: "innovador", "disruptivo", "revolucionario" used as filler
  • Spanish copula avoidance: "se erige como", "se posiciona como" (the Spanish equivalent of English AI's "serves as" / "stands as" pattern)
  • Spanglish/code-switching preservation: rules are aware of bilingual writing and don't flag natural code-switching as AI artifacts

The rule set is organized into the same four-tier structure (dead giveaways > reliable tells > moderate signals > weak signals) so it integrates cleanly with the existing priority system.

Why this matters

Spanish is the second most-spoken native language globally. Anyone using humanizer on Spanish content currently gets zero coverage. The rules are language-specific but the architecture (tiered pattern matching, ROI ordering) is the same.

The implementation also includes a fallback prompting approach for other Romance languages (Portuguese, French) that could serve as a template for future language additions.

Happy to send a PR with the full rule set.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions