adrianwedd · adrianwedd · Jun 19, 2025
@@ -15,11 +15,14 @@
   ```
 
 ## Setup
+- Create a `.env` file exporting any required tokens such as
+  `GITHUB_TOKEN_REPO_STATS` and `API_KEY`.
 - Source `scripts/setup-env.sh` to validate your Python version, install system
   packages, and configure a virtual environment:
   ```bash
   source scripts/setup-env.sh
   ```
+  The script reads `.env` if present and warns when variables are missing.
 
 ## PR Guidelines
 - Separate large formatting-only commits from functional changes.
@@ -84,3 +87,6 @@ labels: [auto, codex]
 Run `python scripts/codex_task_runner.py --file codex_tasks.md` to validate and
 process tasks. The runner checks for duplicate IDs, missing fields, and invalid
 values before creating issues or printing summaries.
+Use `scripts/validate_tasks.py` to check `tasks.yml` against
+`schemas/task.schema.json` and `scripts/rank_tasks.py` to display tasks ordered
+by priority.
@@ -24,10 +24,9 @@ source scripts/setup-env.sh
 ```
 
 The script verifies Python **3.11** or newer, installs required system packages,
-creates a virtual environment with the pinned dependencies from
-`requirements.lock`, exports `PYTHONPATH`, and installs pre-commit hooks. It also
-loads any variables in a `.env` file or prompts for missing values such as API
-keys.
+creates a virtual environment with the dependencies from `requirements.txt`,
+exports `PYTHONPATH`, and installs pre-commit hooks. If a `.env` file exists it
+is sourced; otherwise missing token variables are only reported.
 
 ## Troubleshooting FAQ
 

@@ -0,0 +1,5 @@
+# End-to-End Smoke Test
+
+Run `scripts/e2e_test.sh` to execute a miniature pipeline using fixture data.
+The script performs enrichment, ranking, and README injection, writing results to
+a temporary directory. Inspect the printed path to review generated artifacts.
@@ -0,0 +1,11 @@
+# FunkyAF Demo
+
+Run `python scripts/funky_demo.py` for an interactive tour of the repository.
+
+Highlights:
+
+1. Progress bars visualize formatting checks and test execution.
+2. Docstrings from key pipeline functions scroll by to explain what each step does.
+3. Fixture validation runs with clear success/failure output.
+4. A miniature pipeline processes fixture repos and renders a metrics table showing repository count and average star count.
+5. Finally, the script invokes `scripts/e2e_test.sh` for a quick smoke test and prints the location of generated artifacts.
@@ -0,0 +1,17 @@
+# End-to-End Pipeline Validation for 0.1.1
+
+This note records an attempt to execute the full refresh pipeline using `scripts/refresh_category.py`.
+
+```bash
+python scripts/refresh_category.py Experimental --output temp_data
+```
+
+The command failed because the environment could not reach `api.github.com`:
+
+```
+Request error: Cannot connect to host api.github.com:443 ssl:default [Network is unreachable]; retrying in 1.0 seconds
+```
+
+This confirms that the pipeline requires network access to fetch repository data. Without internet access the refresh step cannot proceed.
+
+All other tests pass locally using fixture data.
@@ -3,7 +3,7 @@
 This epic captures the remaining work needed to stabilize the Agentic Index pipeline and tooling for the upcoming 0.1.1 release.
 
 ## 1. Finalize Pipeline Automation
-- **End‑to‑End Smoke Test** – Provide a single `scripts/e2e_test.sh` that chains scraping, enrichment, ranking and README injection using fixture data. The script should fail on any step error.
+- **End‑to‑End Smoke Test** – `scripts/e2e_test.sh` chains scraping, enrichment, ranking and README injection using fixture data. The script should fail on any step error.
 - **CI Integration** – Add a GitHub Actions job that runs the smoke test on pull requests.
 - **Rollback Steps** – Document how to revert pipeline state if a step corrupts the dataset.
 
@@ -21,3 +21,10 @@ This epic captures the remaining work needed to stabilize the Agentic Index pipe
 - The smoke test passes locally and in CI.
 - API endpoints reject malformed input with clear error messages.
 - Release notes fully describe new features and fixes.
+
+## 4. Additional Hardening
+- **Non-Interactive Setup** – Allow `scripts/setup-env.sh` to read tokens from a `.env` file so CI can run without prompts.
+- **Pipeline Rollback Guide** – Provide instructions for reverting data if a refresh introduces bad results.
+- **Fixture-Based E2E Test** – Use small fixture data in `scripts/e2e_test.sh` so the pipeline can be validated quickly.
+- **Validation Log** – See `../e2e_pipeline_validation.md` for the latest end-to-end refresh attempt and results.
+- **FunkyAF Demo** – A colorful script (`scripts/funky_demo.py`) walks developers through formatting, tests, fixture validation and a mini pipeline with rich progress bars and metrics tables.
@@ -16,7 +16,9 @@ This step translates observations and identified issues into a concrete, ranked
 
 3.  **Define Tasks**: For each selected task, clearly define its scope and objective. Each task should be "atomic" – small and focused enough to be completed within a reasonable timeframe by one person or a pair.
 
-4.  **Prioritize Tasks**: Assign a priority to each task (e.g., 1-5, with 1 being the highest). This helps determine the order of execution.
+4.  **Prioritize Tasks**: Assign a priority to each task (e.g., 1-5, with 1 being
+    the highest). This helps determine the order of execution. Run
+    `scripts/rank_tasks.py` to display tasks sorted by priority.
 
 5.  **Update `tasks.yml`**: Add the selected tasks to the `tasks.yml` file. Ensure each task entry includes the following fields, consistent with the existing structure:
     *   `id`: A unique integer for the task. Increment from the highest existing ID.

@@ -40,6 +40,7 @@ Accurate and up-to-date documentation is crucial for project maintainability and
           status: done # Updated status
         ```
     *   **Add Newly Discovered Follow-up Tasks**: If any new issues or necessary follow-up work were identified during the cycle (e.g., during "Validate" or "Execute"), add them to `tasks.yml` with an appropriate description, component, priority, and `todo` status. These will be considered in the next cycle's "Decide" step.
+    *   Run `scripts/validate_tasks.py tasks.yml` to verify the file matches the task schema.
 
 ## Output
 

@@ -0,0 +1,35 @@
+# Release 0.1.1 Code Review
+
+This report captures a manual inspection of the repository and a run of the automated tests in preparation for the 0.1.1 release.
+
+## Repository Overview
+- Repository provides CLI commands under `agentic_index_cli` and a nightly refresh workflow via `.github/workflows/update.yml`.
+- Pipeline scripts such as `scripts/trigger_refresh.sh` orchestrate scraping, enrichment, ranking, and README injection.
+
+## Testing Results
+- Formatting checks were executed:
+  - `black --check .` reported no changes.
+  - `isort --check-only .` reported no issues.
+- The full pytest suite succeeded after installing requirements. The tail of the log shows:
+
+```
+    json_file.write_text(json.dumps([r.dict() for r in req.repos]))
+
+-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
+243 passed, 15 skipped, 2 warnings in 23.88s
+```
+
+## Pipeline Validation
+- Reviewed `update.yml` workflow which installs dependencies, runs the CLI pipeline, and opens a refresh PR if data changes.
+- Examined `trigger_refresh.sh` helper and category refresh script for local runs.
+- Attempted to run `scripts/setup-env.sh` but interactive prompts for `GITHUB_TOKEN_REPO_STATS` and `API_KEY` prevented automated execution.
+
+## Recommendations
+- Provide a non-interactive mode for `setup-env.sh` or document `.env` usage to avoid prompts during CI.
+- Implement `scripts/e2e_test.sh` that chains scraping, enrichment, ranking, and README injection using fixture data.
+- Add a GitHub Actions job to run this smoke test on pull requests.
+- Document rollback steps if the pipeline corrupts data.
+- Ship a colorful `funky_demo.py` that guides users through formatting checks, tests, fixture validation and a mini pipeline run with rich progress indicators.
+
+## Conclusion
+The codebase is generally healthy and the test suite passes. Addressing the recommendations above will finalize the 0.1.1 release.
@@ -6,7 +6,7 @@ PyYAML
 pytest-socket
 responses
 rich
-typer[all]
+typer
 click
 jsonschema>=3.2
 pydantic>=2
@@ -17,7 +17,6 @@ pydeps
 sphinx
 sphinx_rtd_theme
 uvicorn
-httpx
 pytest-env
 aiohttp>=3.9.0
 structlog

@@ -0,0 +1,18 @@
+{
+  "$schema": "http://json-schema.org/draft-07/schema#",
+  "title": "Task",
+  "type": "object",
+  "required": ["id", "description", "component", "dependencies", "priority", "status"],
+  "properties": {
+    "id": {"oneOf": [{"type": "string"}, {"type": "integer"}]},
+    "description": {"type": "string"},
+    "component": {"type": "string"},
+    "dependencies": {
+      "type": "array",
+      "items": {"oneOf": [{"type": "string"}, {"type": "integer"}]}
+    },
+    "priority": {"type": "integer", "minimum": 1},
+    "status": {"type": "string", "enum": ["todo", "in-progress", "done"]}
+  },
+  "additionalProperties": false
+}
@@ -0,0 +1,30 @@
+#!/usr/bin/env bash
+set -euo pipefail
+
+# Simple end-to-end smoke test using fixture data
+ROOT="$(git rev-parse --show-toplevel)"
+cd "$ROOT"
+
+ARTIFACTS=$(mktemp -d e2e_demo_XXXX)
+DATA_DIR="$ARTIFACTS/data"
+mkdir -p "$DATA_DIR/by_category"
+
+# Seed with fixture repos
+cp tests/fixtures/data/repos.json "$DATA_DIR/repos.json"
+printf '[]' > "$DATA_DIR/last_snapshot.json"
+touch "$DATA_DIR/top100.md"
+printf '{}' > "$DATA_DIR/by_category/index.json"
+
+export PYTHONPATH="$ROOT"
+
+python -m agentic_index_cli.enricher "$DATA_DIR/repos.json"
+python -m agentic_index_cli.internal.rank_main "$DATA_DIR/repos.json"
+python -m agentic_index_cli.internal.inject_readme \
+  --force --top-n 5 --limit 5 \
+  --repos "$DATA_DIR/repos.json" \
+  --data "$DATA_DIR/top100.md" \
+  --snapshot "$DATA_DIR/last_snapshot.json" \
+  --index "$DATA_DIR/by_category/index.json" \
+  --readme "$ARTIFACTS/README.md"
+
+echo "E2E test complete. Artifacts in $ARTIFACTS"
@@ -0,0 +1,137 @@
+#!/usr/bin/env python3
+"""FunkyAF demonstration showcasing tests and pipeline.
+
+This script provides an over-the-top walkthrough of the repository's
+tooling.  It runs formatting checks, executes the test suite, validates
+fixtures, and spins up a miniature pipeline using fixture data.  Along
+the way it displays docstrings, progress bars, and a small metrics table
+rendered with :mod:`rich` for a deluxe terminal experience.
+"""
+
+from __future__ import annotations
+
+import json
+import os
+import shutil
+import subprocess
+import sys
+import tempfile
+from pathlib import Path
+
+from rich.console import Console
+from rich.markdown import Markdown
+from rich.panel import Panel
+from rich.progress import Progress
+from rich.table import Table
+
+console = Console()
+
+
+def run(cmd: list[str]) -> None:
+    """Execute *cmd* and stream output via :class:`rich.console.Console`."""
+
+    console.rule(f"[bold cyan]$ {' '.join(cmd)}")
+    proc = subprocess.run(cmd, capture_output=True, text=True)
+    if proc.stdout:
+        console.print(proc.stdout)
+    if proc.stderr:
+        console.print(proc.stderr, style="red")
+    if proc.returncode != 0:
+        console.print(f"command failed with {proc.returncode}", style="red")
+        sys.exit(proc.returncode)
+
+
+def show_doc(obj: object) -> None:
+    """Render a function's docstring in a pretty panel."""
+
+    doc = getattr(obj, "__doc__", "(no docstring)") or "(no docstring)"
+    console.print(Panel(Markdown(doc), title=f"{obj.__module__}.{obj.__name__}"))
+
+
+def main() -> None:
+    root = Path(__file__).resolve().parents[1]
+    os.chdir(root)
+
+    console.rule("[bold magenta]🎉 Welcome to the Agentic Index FunkyAF Demo!")
+    console.print(f"Working directory: {root}\n")
+
+    with Progress() as progress:
+        fmt_task = progress.add_task("Formatting", total=2)
+        progress.update(fmt_task, description="black")
+        run(["black", "--check", "."])
+        progress.advance(fmt_task)
+        progress.update(fmt_task, description="isort")
+        run(["isort", "--check-only", "."])
+        progress.advance(fmt_task)
+
+    with Progress() as progress:
+        test_task = progress.add_task("pytest", total=1)
+        run([sys.executable, "-m", "pytest", "-q"])
+        progress.advance(test_task)
+
+    with Progress() as progress:
+        val_task = progress.add_task("validate fixtures", total=2)
+        run([sys.executable, "scripts/validate_fixtures.py"])
+        progress.advance(val_task)
+        run([sys.executable, "scripts/validate_top100.py"])
+        progress.advance(val_task)
+
+    console.rule("🚀 Mini pipeline demo")
+    demo_dir = Path(tempfile.mkdtemp(prefix="demo_artifacts_"))
+    data_dir = demo_dir / "data"
+    data_dir.mkdir()
+    fixture = root / "tests" / "fixtures" / "data" / "repos.json"
+    shutil.copy(fixture, data_dir / "repos.json")
+    (data_dir / "last_snapshot.json").write_text("[]")
+    (data_dir / "top100.md").write_text("")
+    (data_dir / "by_category").mkdir()
+    (data_dir / "by_category" / "index.json").write_text("{}")
+
+    import agentic_index_cli.enricher as enricher
+    import agentic_index_cli.internal.inject_readme as inj
+    import agentic_index_cli.internal.rank_main as rank
+
+    show_doc(enricher.enrich)
+    show_doc(rank.main)
+    show_doc(inj.main)
+
+    inj.REPOS_PATH = data_dir / "repos.json"
+    inj.DATA_PATH = data_dir / "top100.md"
+    inj.SNAPSHOT = data_dir / "last_snapshot.json"
+    inj.BY_CAT_INDEX = data_dir / "by_category" / "index.json"
+    inj.README_PATH = demo_dir / "README.md"
+    inj.readme_utils.README_PATH = inj.README_PATH
+    inj.ROOT = demo_dir
+
+    with Progress() as progress:
+        e_task = progress.add_task("enrich", total=1)
+        enricher.enrich(inj.REPOS_PATH)
+        progress.advance(e_task)
+
+        r_task = progress.add_task("rank", total=1)
+        rank.main(str(inj.REPOS_PATH))
+        progress.advance(r_task)
+
+        i_task = progress.add_task("inject", total=1)
+        inj.main(force=True, top_n=5, limit=5)
+        progress.advance(i_task)
+
+    console.rule("README preview")
+    console.print(inj.README_PATH.read_text())
+
+    data = json.loads((data_dir / "repos.json").read_text())
+    table = Table(title="Fixture Metrics")
+    table.add_column("Repos", justify="right")
+    table.add_column("Average Stars", justify="right")
+    count = len(data["repos"])
+    avg_stars = sum(r["stargazers_count"] for r in data["repos"]) / count
+    table.add_row(str(count), f"{avg_stars:,.1f}")
+    console.print(table)
+    console.print(f"Demo artifacts stored in [bold]{demo_dir}[/]")
+
+    console.rule("smoke test")
+    run(["bash", "scripts/e2e_test.sh"])
+
+
+if __name__ == "__main__":
+    main()
@@ -0,0 +1,30 @@
+#!/usr/bin/env python3
+"""Print tasks sorted by priority."""
+
+from __future__ import annotations
+
+import sys
+from pathlib import Path
+
+sys.path.append(str(Path(__file__).resolve().parent))
+from validate_tasks import validate_tasks
+
+
+def rank_tasks(path: str) -> list[dict]:
+    tasks = validate_tasks(path)
+    return sorted(tasks, key=lambda t: (t.get("priority", 0), str(t.get("id"))))
+
+
+def main(argv: list[str] | None = None) -> int:
+    argv = argv or sys.argv[1:]
+    path = argv[0] if argv else "tasks.yml"
+    for task in rank_tasks(path):
+        pid = task.get("priority")
+        tid = task.get("id")
+        desc = task.get("description", "")
+        print(f"{pid:>2} {tid}: {desc}")
+    return 0
+
+
+if __name__ == "__main__":
+    raise SystemExit(main())