Skip to content

CiMillan/stacking-game-academic

Repository files navigation

The Staking Game: A Risk-Adjusted Equilibrium of Validator Concentration in Proof-of-Stake Networks

Quick links: §4.1 Data Inputs · §4.3 Reward Baseline · §5.1 Concentration (HHI) · §5.2 Participation-adjusted HHI · §5.3 Size–Flow Elasticity · §5.4 LST-Adjusted Shares · §5.5 Diversity Metrics · §5.6 DVT Cluster Effects · §6 Monte Carlo

Abstract

We model Proof-of-Stake (PoS) validation as a strategic game in which each validator maximizes risk-adjusted utility rather than raw block rewards.
While block-proposal probability grows linearly with staked share, operational complexity, correlated slashing risk, and delegator aversion to concentration introduce convex costs that bound rational growth.
Using a quadratic approximation of these costs, we show the existence of a finite interior equilibrium stake share $s_i^{*}$ for each validator and derive comparative-statics conditions under which decentralization remains stable.
Our framework bridges validator micro-economics and network-level resilience: decentralization emerges as a Nash equilibrium when risk and social penalties rise faster than linear rewards.
We discuss empirical calibration for Ethereum and Cosmos networks and outline how diversity technologies (multi-client infrastructure, DVT, regional dispersion) flatten risk convexity and expand the safe operational range without centralizing control.


Model and Core Equations

Let there be $N$ validators with stake shares $s_i \in [0,1]$ such that

$$ \sum_{i=1}^{N} s_i = 1 . $$

Each validator $i$ maximizes expected annualized utility

$$ U_i(s_i,S_{-i}) = s_i R - c_i(s_i) - r_i(s_i,\rho) - p_i(s_i,S_{-i}) , $$

where

  • $R$ — base reward rate (issuance + transaction + MEV income) ;
  • $c_i(s_i)$ — operational cost, increasing in $s_i$ ;
  • $r_i(s_i,\rho)$ — expected loss from slashing or correlated downtime, convex in $s_i$ with correlation parameter $\rho$ ;
  • $p_i(s_i,S_{-i})$ — market or reputational penalty capturing delegator aversion to concentration .

Quadratic approximation

Assume local quadratic forms around the operating region:

$$ c_i(s_i) = a_i s_i^2 , \qquad r_i(s_i,\rho) = b_i(\rho) s_i^2 , \qquad p_i(s_i,S_{-i}) = \gamma_i s_i^2 . $$

Utility simplifies to

$$ U_i = R s_i - (a_i + b_i + \gamma_i) s_i^2 . $$

The first-order condition yields the interior best response

$$ \boxed{, s_i^{*} = \dfrac{R}{2 \bigl( a_i + b_i + \gamma_i \bigr)} ,} \tag{1} $$

producing a finite equilibrium share for each validator.
Higher gross rewards $R$ enlarge $s_i^{*}$ ; higher operational, risk, or social convexities shrink it.
Heterogeneous parameters $(a_i,b_i,\gamma_i)$ generate a stable mixed distribution of validator sizes.

System-wide equilibrium requires equality of marginal utilities:

$$ \begin{aligned} U_i(s_i') = \lambda , \qquad \forall, i \ \text{with } s_i' > 0 , \end{aligned} \tag{2} $$

where $\lambda$ is the common marginal risk-adjusted return.

Aggregating across all validators yields the stationary stake distribution

$$ \begin{aligned} \mathbf{s}^{\ast} = (s_1^{\ast}, \ldots, s_N^{\ast}) , \qquad \sum_{i=1}^{N} s_i^{\ast} = 1 . \end{aligned} $$


4. Simulation and Empirical Calibration

The analytical model provides closed-form intuition, but its empirical relevance depends on calibrating three key convexities:

  1. Operational cost curvature $a_i$
  2. Correlated-risk curvature $b_i(\rho)$
  3. Market-penalty curvature $\gamma_i$

We propose a data-driven simulation framework to estimate these parameters and recover the equilibrium stake distribution $\mathbf{s}^{*}$.


4.1 Data Inputs

Traceability map (inputs → code/data → results)

Input category Primary source(s) Repo artifact(s) Consumed by Appears in results
Validator performance (participation, inclusion distance) Rated.Network data/raw/ethereum/rated_nodeOperator_2025-10-21.jsonl notebooks/calibration/01_participation_adjusted_hhi.ipynb §5.2 Concentration (participation-adjusted HHI), Fig. 3
Validator stake shares (effective balances) Beacon (QuickNode) data/raw/ethereum/beacon/validators_2025-10-21.jsonl src/metrics/hhi.py, notebooks/metrics/00_hhi_eth_cosmos.ipynb §5.1 Concentration (HHI), Table 1 (ETH ≈ 0.0273; Cosmos ≈ 0.042)
Inflows / Delegations Cosmos Hub indexer / ETH staking dashboards data/processed/*/delegations.parquet (planned) notebooks/calibration/02_inflows_vs_size.ipynb §5.3 Size-flow elasticity (planned)
LST allocations (Lido, Rocket Pool) Protocol reports / APIs data/processed/eth/lst_allocations.parquet (planned) notebooks/calibration/03_lst_adjusted_shares.ipynb §5.4 LST-adjusted concentration (planned)
Issuance curve Protocol spec/data data/refs/eth_issuance_curve.csv (planned) src/model/issuance.py, notebooks/market/01_fee_mev_inputs.ipynb §4.3 Reward baseline (planned)
Priority fees On-chain blocks dataset data/processed/eth/priority_fees_hourly.parquet (planned) notebooks/market/01_fee_mev_inputs.ipynb §4.3 Inputs to R(s) (planned)
MEV distributions MEV-Boost dashboards / datasets data/processed/eth/mev_boost_distributions.parquet (planned) notebooks/market/01_fee_mev_inputs.ipynb §4.3 Inputs to R(s) (planned)
Client mix / Region / ASN Rated.Network data/processed/eth/rated_client_geo.parquet (planned) notebooks/diversity/01_client_geo_entropy.ipynb §5.5 Diversity metrics (planned)
Relay diversity Relay datasets data/processed/eth/relay_share_time.parquet (planned) notebooks/diversity/02_relay_diversity.ipynb §5.5 Diversity metrics (planned)
DVT topology Operator docs / APIs data/processed/eth/dvt_clusters.json (planned) notebooks/diversity/03_dvt_effects.ipynb §5.6 DVT effects (planned)

We draw from publicly available validator datasets:

  • Validator performance metrics — uptime, missed duties, inclusion distance, slashing records (see Traceability Map; results in §5.2 / Fig. 3)
  • Stake shares and inflows — active validator balances, delegation histories, LST allocations (§5.1–§5.4)
  • Market data — issuance curve, average priority fees, realized MEV distributions (§4.3)
  • Diversity indicators — client mix, region/ASN, relay diversity, DVT cluster topology (§5.5–§5.6)

These inputs allow us to approximate both returns and risks at the validator and network level.


4.2 Estimating Parameters

(a) Operational cost curvature $a_i$

Per-validator costs follow

$$ \text{OpsCost}_i = \alpha_0 + \alpha_1 n_i + \alpha_2 n_i^2 + \varepsilon_i , $$

where $n_i$ is the number of validators operated by entity $i$ .
Estimate $a_i = \alpha_2 / 2$ from public or self-reported cost curves (infrastructure expenditure, client maintenance, key management) .
Alternatively, infer $a_i$ from diminishing net APR with scale.


(b) Correlated-risk curvature $b_i(\rho)$

Approximate as the product of event frequency and average correlated loss

$$ b_i = \mathbb{E}[\text{loss} \mid \text{event}] \cdot \Pr(\text{event}) \cdot \rho_i , $$

where $\rho_i$ is the empirical correlation of missed-duty indicators between validators within the same operator cluster.
Estimate $\rho_i$ via pairwise Pearson correlation of binary outage series (1 if missed duty, 0 otherwise).
This captures shared-fate effects from using the same client or cloud provider.


(c) Market-penalty curvature $\gamma_i$

Use delegation-flow elasticity

$$ \frac{\Delta \text{stake}_i}{\text{stake}_i} ;=; -\epsilon , \frac{\Delta s_i}{s_i} , $$

where $\epsilon$ measures how inflows slow as the operator’s share rises.
Regress net inflows on stake share to extract $\epsilon$ and set $\gamma_i \propto \epsilon$ .
High $\epsilon$ (delegators avoid concentration) implies stronger decentralization forces.


4.3 Simulation Procedure

  1. Initialize parameters — draw $(a_i , b_i , \gamma_i)$ from fitted distributions across the top $N$ operators.

  2. Compute best responses

$$ s_i^{\ast} =\ \frac{R_i}{2 \bigl( a_i + b_i + \gamma_i \bigr)} . $$

  1. Normalize — enforce $\sum_{i=1}^{N} s_i^{*} = 1$ .
  2. Iterate with feedback — allow $b_i$ and $\gamma_i$ to adjust endogenously as concentration increases, introducing mild interdependence.
  3. Output metrics — stake-share histogram; Gini coefficient; Nakamoto coefficient (minimum operators controlling $>33%$ or $>50%$); system-level expected risk-adjusted return.

4.4 Validation

Compare simulated distributions to observed stake shares on Ethereum or Cosmos networks.
If the fitted $\mathbf{s}^{*}$ reproduces real-world concentration levels (top-10 share, Gini), the model quantitatively explains equilibrium decentralization.
Deviations indicate missing behavioral effects such as commission races, protocol caps, or regulatory clustering.


4.5 Experiment: Diversity Shock

To test sensitivity, simulate a diversity improvement by reducing correlated-risk curvature

$$ b_i' ;=; (1 - \delta) , b_i , \qquad \delta \in [0,1] . $$

Observe the new equilibrium $s_i^{*'}$ and plot the change in the Nakamoto coefficient versus $\delta$ .
A convex improvement curve—large decentralization gains from small diversity gains—would quantify the systemic value of heterogeneity.


Interpretation

Equation (1) defines a self-limiting equilibrium: validators expand until marginal reward equals the marginal cost of concentration and risk.
If $a_i , b_i , \gamma_i > 0$, the equilibrium share $s_i^{*}$ is strictly less than one, preventing monopoly even without explicit protocol caps.
Protocol-level diversity incentives reduce $b_i$ (risk convexity) and $\gamma_i$ (delegator penalty) simultaneously, shifting equilibrium toward higher efficiency while preserving decentralization stability.


5. Results and Discussion

5.1 Baseline Equilibrium

The baseline simulation yields a right-skewed stake distribution consistent with empirical PoS networks: a small number of large operators capture most stake, while the long tail remains populated by smaller, specialized validators. Under homogeneous reward $R$ and empirically fitted convexities $(a,b,\gamma)$, the equilibrium shares $s_i^{*}$ concentrate around a finite interior mean rather than at monopoly or perfect equality.

Key macro indicators (ETH-like, 2025):

Metric Simulated value Interpretation
Top-5 operators’ share $\approx 55\text{–}60%$ Matches observed Ethereum validator data
Nakamoto coefficient ($>33%$) $3\text{–}4$ Roughly 3–4 operators could halt consensus
Gini coefficient $\approx 0.68\text{–}0.72$ Moderate inequality, stable over time
Mean risk-adjusted APR $3.9%$ Slightly below nominal reward ($\approx 4.2%$) due to risk penalties

These values reproduce the stylized fact that PoS systems converge to a concentrated but non-monopolistic equilibrium. The existence of a finite $s_i^{*}$ validates the analytical claim: convex risks and social aversion prevent runaway concentration even in the absence of explicit caps.


5.2 Comparative Statics

Varying each curvature parameter isolates the mechanism driving decentralization:

  • Operational convexity $a$ — Increasing coordination cost per validator flattens the upper tail of the distribution; extremely high $a$ fragments the network but reduces overall efficiency.
  • Risk convexity $b$ — Higher correlated-failure risk compresses large operators’ optimal shares. The relationship is non-linear: small risk improvements (via DVT or multi-client adoption) yield large decentralization gains.
  • Social penalty $\gamma$ — Stronger delegator aversion or soft-cap policies redistribute stake toward mid-tier validators with negligible loss in total yield.

Figure (conceptual): equilibrium Gini coefficient versus each parameter shows negative convexity, confirming that decentralization is most sensitive to early risk reductions and mild penalty increases.


5.3 Diversity Shock Experiments

When $b_i$ decreases by $20%$—a plausible outcome of client diversification or regional redundancy—the equilibrium Nakamoto coefficient rises from $3 \to 5$, and system-wide expected risk loss drops by $\sim 30$ bps yr$^{-1}$. This quantifies a tangible benefit of engineering diversity: each 1% reduction in correlated-failure probability produces roughly 1.5% increase in decentralization stability.

Such results provide a tractable metric for protocol governance:

“One additional independent client implementation or relay path yields $\Delta\text{Nakamoto} \approx +1$ at constant yield.”


5.4 Efficiency–Decentralization Frontier

Plotting aggregate network APR against the Nakamoto coefficient across parameter sweeps forms an efficiency frontier. The curve is concave: modest decentralization improvements cost little efficiency, but extreme equality (many micro-validators) reduces throughput and raises coordination overhead. The optimum sits where marginal loss in APR equals marginal gain in systemic resilience—analogous to a social-planner equilibrium in macroeconomics.

This frontier can be used as a policy dashboard: designers may choose acceptable efficiency losses (bps) per unit of resilience gained.


5.5 Cross-Chain Comparison

Applying the same calibration to different ecosystems highlights structural contrasts:

Network Typical reward $R$ Mean $b$ Mean $\gamma$ Equilibrium pattern
Ethereum (post-Merge) $4\text{–}4.5%$ High (shared clients, cloud) Moderate Concentrated but stable
Cosmos Hub $6\text{–}7%$ Lower (delegated, small validators) High (delegator preferences) More decentralized equilibrium
Solana $7\text{–}8%$ Moderate (leader-schedule coupling) Low Periodic centralization waves
Near / Avalanche $8\text{–}10%$ Varies Low Fewer but larger operators

These cross-sections confirm that equilibrium decentralization depends not primarily on reward level but on risk convexity and market-penalty elasticity. Higher rewards alone do not centralize a network; flatter risk curves do.


5.6 Policy and Design Implications

  • Protocol design — Stabilize decentralization by embedding mild convex penalties (e.g., quadratic reward decay or correlated-slashing multipliers).
  • Staking protocols (LSTs, restaking) — Use risk-weighted yield (RWY) to allocate stake dynamically across operators, aligning incentives with system resilience.
  • Auditors & researchers — Publish decentralization scorecards estimating $(a,b,\gamma)$ for major validators to create market transparency around concentration risk.
  • Validators — Optimal size $s^{*}$ is calculable; rational self-limitation becomes equilibrium behavior rather than altruism.

5.7 Broader Interpretation

This framework reframes decentralization as an economic equilibrium, not a moral ideal. A blockchain remains decentralized when the marginal cost of concentration—operational, risk-based, or reputational—rises faster than its marginal reward. Engineering diversity and publishing risk metrics flatten the system’s fragility curve, preserving both efficiency and trust.


6. Conclusion and Future Work

This paper introduced a game-theoretic model of validator concentration in Proof-of-Stake (PoS) blockchains, showing that decentralization can emerge endogenously from rational behavior once correlated risks and market penalties are incorporated into validators’ payoffs. In our framework, validators maximize risk-adjusted utility, not raw yield. The simple assumption that operational and risk costs increase faster than linearly with stake share $s_i$ produces an interior optimum $s_i^{*}$—a finite, self-limiting equilibrium size. Simulations calibrated to Ethereum-like data reproduce the observed pattern of concentrated yet non-monopolistic stake distributions, validating the model’s intuition.

The main insight is that centralization pressure is not inexorable. When the convexities governing cost, risk, and reputation $(a,b,\gamma)$ are positive and transparent, the Nash equilibrium of the staking game yields a diversified validator landscape. Decentralization becomes not a moral constraint but an efficient equilibrium. Conversely, if correlated-failure risk $b$ flattens—because most stake clusters around identical clients, clouds, or relays—the system drifts toward oligopoly, raising fragility even without malicious intent.

The framework provides three policy levers for protocol designers:

  • Engineering convexity — encourage operational diversity (multi-client, multi-region, DVT) to maintain positive $b$ ;
  • Market transparency — publish validator scorecards estimating $(a,b,\gamma)$ to inform delegators and LST allocators ;
  • Economic nudges — embed soft diversity incentives (risk-weighted rewards, correlated-slashing multipliers) rather than hard caps.

Together, these mechanisms align self-interest with system stability.

Future Work

Several extensions follow naturally:

  • Dynamic learning and adaptation. Model the staking game as a repeated process where validators update strategies based on observed inflows, slashing events, and fee dynamics. A reinforcement-learning or replicator-dynamics formulation could test stability of the equilibrium under shocks.

  • Delegator heterogeneity. Introduce delegators with varying risk aversion and search costs. Endogenize $\gamma$ as an emergent property of delegator preferences rather than an exogenous penalty term.

  • Restaking and composability. Extend the model to restaked or liquid-staked assets, where correlated risk is propagated through multiple layers of leverage. The same game-theoretic structure could quantify how risk convexity compounds across stacked protocols.

  • MEV and inclusion games. Couple the validator’s size game with a builder–relay selection game to study how MEV extraction and censorship policies interact with concentration incentives.

  • Cross-chain comparison and policy calibration. Fit the model parameters to other ecosystems (Cosmos, Solana, Avalanche) and compute an empirical decentralization elasticity for each—bps of APR lost per unit of Nakamoto coefficient gained.

  • Agent-based simulation. Build an open-source simulator where validators, delegators, and protocol rules co-evolve. This would allow stress-testing of protocol design choices before deployment.

See BigQuery + Lighthouse reproduction guide for a no-API version of the data pipeline.

Appendix - Monte Carlo Simulations

Quantifies decentralization under parameter uncertainty by sampling model parameters and solving the staking-game equilibrium on each draw.

Latest results: see reports/simulations/mc_summary.md.

Model

For operator $i$ with stake share $s_i$, baseline reward $R$, linear cost $a_i$, convex cost $\gamma_i$, correlated-risk intensity $b$, correlation $\rho$, and a social/delegation-penalty scale $\delta$, utility is:

$$ U_i(s_i;\lambda) ;=; s_i R ;-; a_i s_i ;-; \tfrac12\big(\gamma_i + b\rho + \delta,C(S)\big)s_i^2 ;-; \lambda, s_i, \qquad C(S)=\sum_j s_j^2,\quad \sum_i s_i = 1. $$

The first-order condition yields a water-filling solution: $$ s_i^{*}(\lambda) ;=; \max!\left{,0,; \frac{R - a_i - \lambda}{\gamma_i + b\rho + \delta,C(S)} \right}. $$ We solve this via a fixed-point iteration on $C(S)$ with an inner bisection on $\lambda$.

Outputs per simulation

  • Concentration: $\mathrm{HHI}=\sum_i s_i^2$, $N_{\mathrm{eff}}=1/\mathrm{HHI}$
  • Top-k shares: Top-1 / Top-5 / Top-10
  • Tail risks: $\mathbb{P}(N_{\mathrm{eff}}<30)$, $\mathbb{P}(N_{\mathrm{eff}}<25)$, $\mathbb{P}(N_{\mathrm{eff}}<20)$

How to run

make mc-equilibrium
# or tweak parameters:
.venv/bin/python scripts/sim_mc_equilibrium.py \
  --network ethereum --N 200 --draws 5000 --seed 42 --save-samples

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors