]> Piment Noir Git Repositories - freqai-strategies.git/commit
chore(reforcexy): add dynamic potential based rewards (#6)
authorJérôme Benoit <jerome.benoit@piment-noir.org>
Mon, 13 Oct 2025 21:39:10 +0000 (23:39 +0200)
committerGitHub <noreply@github.com>
Mon, 13 Oct 2025 21:39:10 +0000 (23:39 +0200)
commit3f01976437bfda6a4cda2367cd0983ef060e0bdf
treeedb830b7b864e7ab13d49882d1683add90e4a80b
parent223d33a5eab4bed40529f47cbae7d45115312a92
chore(reforcexy): add dynamic potential based rewards (#6)

* chore(reforcexy): add dynamic potential based rewards

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* refactor(reforcexy): align tunables namespace

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* refactor(reforcexy): factor out pnl_target validation

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* refactor(reforcexy): cleanup PBRS integration

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* refactor(reforcexy): factor out default idle duration multiplier

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* chore: cleanup variable initialization

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* refactor(reforcexy): factor out PBRS next state definition

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* chore: revert copilot incorrect suggestions

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* refactor(reforcexy): PBRS integration code cleanups

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* refactor(reforcexy): handle overtflow

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* refactor(reforcexy): avoid code duplication

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* refactor(reforcexy): add PBRS instrumentation

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* refactor(reforcexy): factor out PBRS signal computation

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* refactor(reforcexy): add check to PBRS logic

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* chore: sync PBRS in RSA

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* refactor(reforcexy): cleanup PBRS integration in RSA

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* Apply suggestion from @Copilot

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* docs: add PBRS

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
* fix(reforcexy): ensure model gamma is transmitted to env

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
---------

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
ReforceXY/reward_space_analysis/README.md
ReforceXY/reward_space_analysis/reward_space_analysis.py
ReforceXY/reward_space_analysis/test_reward_space_analysis.py
ReforceXY/user_data/freqaimodels/ReforceXY.py
quickadapter/user_data/strategies/Utils.py