From: Jérôme Benoit <jerome.benoit@piment-noir.org>
Date: Fri, 14 Nov 2025 16:46:56 +0000 (+0100)
Subject: docs(reforcexy): refine README
X-Git-Url: https://git.piment-noir.org/?a=commitdiff_plain;h=f2d20320ecef33ac08bb5e001eedb71ab58ffaba;p=freqai-strategies.git

docs(reforcexy): refine README

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
---

diff --git a/ReforceXY/reward_space_analysis/README.md b/ReforceXY/reward_space_analysis/README.md
index 3ad0397..901514f 100644
--- a/ReforceXY/reward_space_analysis/README.md
+++ b/ReforceXY/reward_space_analysis/README.md
@@ -433,9 +433,9 @@ done
 
 Combine with other overrides cautiously; use distinct `out_dir` per configuration.
 
-### PBRS Rationale
+### PBRS Configuration
 
-Canonical mode seeks near zero-sum shaping (Î¦ terminal â 0) ensuring invariance: reward differences reflect environment performance, not potential leakage. Non-canonical modes or additives (entry/exit) trade strict invariance for potential extra signal shaping. Progressive release & spike cancel adjust temporal release of Î¦. Choose canonical for theory alignment; use non-canonical or additives only when empirical gain outweighs invariance guarantees. Symbol Î¦ denotes potential. See invariance condition and drift correction mechanics under PBRS section.
+Canonical mode enforces zero-sum shaping (Î¦ terminal â 0) for theoretical invariance. Non-canonical modes or additives modify this behavior. Choose canonical for standard PBRS compliance; use non-canonical when specific shaping behavior is required.
 
 ### Real Data Comparison