Skip to content

Search URLs should match upstream site shapes #14

Description

@hqhq1025

Problem

Several WebHarbor mirror sites expose generic local /search?q=... URLs from their own search UI even when the upstream sites use more specific URL shapes.

Examples:

  • Amazon upstream uses /s?k=<query>.
  • Booking upstream uses /searchresults.html?ss=<query>.
  • Google Maps upstream uses /maps/search/<query>.
  • ESPN upstream uses /search/_/q/<query>.
  • Apple upstream uses /search/<query>.

The current generic URLs still work for local benchmark tasks, but they reduce realism when agents inspect, copy, or navigate search URLs.

Expected

The site UI should emit realistic upstream-shaped search URLs as the canonical path, while old /search?q=... routes remain available as backward-compatible aliases for existing tasks and trajectories.

Proposed Fix

  • Add canonical search routes for sites with known upstream URL forms.
  • Update site-owned search forms to emit canonical URL shapes.
  • Preserve existing /search?q=... compatibility routes.
  • Add documentation and a regression check for canonical vs legacy search URL policy.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions