Skip to content

Plots/stats/analyses for scaling dynamics #33

@athewsey

Description

@athewsey

AFAIK the plots and stats in LLMeter today all treat each Run as a bucket of IID data points to calculate statistics over - ignoring trends/dynamics during the run.

We're interested for LLMeter to provide more tools for analyzing auto-scaling responses over time to step increases in demand. For example:

  1. Were there significant trends/shifts in TTFT or TTLT/TPOT or error rate over time during the Run, or was it homogeneous?
  2. If so, how long did latency/error rate take to "stabilize", and could we characterize stats for that steady-state excluding the scale-up period?

Open to suggestions on good/practical ways to approach this, but it seems like a gap in the current tooling.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions