Skip to content

Musawer1214/pashto-language-resources

license apache-2.0
language
ps
en
tags
pashto
pukhto
pushto
asr
tts
nlp
machine-translation
language-resources
low-resource-languages
speech-recognition

🌍 Pashto Language Resources Hub (Pukhto/Pashto)

Open-source Pashto language technology hub for datasets, models, benchmarks, ASR, TTS, NLP, and machine translation.

This project helps contributors find, verify, and improve Pashto AI resources in one place.

👋 New Here? Start in 2 Minutes

  1. Search technical resources: Pashto Resource Search
  2. Search papers/documentation: Pashto Papers Search
  3. Read beginner docs: docs/README.md

🔗 Quick Links

🎯 Popular Pages

🗂️ Repository Map (Simple)

  • resources/ verified external resources (dataset/model/benchmark/tool/paper/project/code)
  • data/ normalization seeds and dataset workflows
  • asr/ speech recognition notes and baselines
  • tts/ text-to-speech notes and baselines
  • benchmarks/ result schemas and evaluation templates
  • docs/ documentation, SEO, release, and operations guides

🔄 How Updates Work

Automatic (GitHub Actions)

  • Daily workflow (.github/workflows/resource_sync.yml) discovers candidates.
  • Valid non-duplicate entries are promoted into resources/catalog/resources.json.
  • Search data and README views are regenerated.

Manual (Maintainers/Contributors)

  • Run scripts locally to discover, validate, and regenerate outputs.
python -m venv .venv
. .venv/bin/activate
pip install -e ".[dev]"
python scripts/validate_resource_catalog.py
python scripts/generate_resource_views.py
python scripts/validate_repo_contracts.py --require-jsonschema
python scripts/audit_resource_pipeline.py
python scripts/check_links.py
python -m pytest -q

Local Setup

Use one supported local setup path from repo root:

python -m venv .venv
. .venv/bin/activate
pip install --upgrade pip
pip install -e ".[dev]"

Windows PowerShell:

python -m venv .venv
.venv\Scripts\Activate.ps1
python -m pip install --upgrade pip
python -m pip install -e ".[dev]"

🚀 Contributing

📈 SEO and Discoverability

🧾 Releases

Packages

 
 
 

Contributors

Languages