[RLlib] - Enable multi-learner setup for hybrid stack BC #46436

simonsays1980 · 2024-07-04T14:56:48Z

Why are these changes needed?

This PR does fix a bug in hybrid stack BC which was hindering multi-learner setups for offline RL.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…i-learner setups in BC impossible on the hybrid stack. Signed-off-by: simonsays1980 <[email protected]>

sven1977

LGTM! Thanks for this fix @simonsays1980 .

… and added logic in BC to add the used module ID if existent. Signed-off-by: simonsays1980 <[email protected]>

sven1977 · 2024-07-08T12:44:20Z

rllib/algorithms/bc/bc.py

+            train_results = self.learner_group.update_from_batch(
+                batch=train_batch.as_multi_agent(
+                    module_id=list(self.config.policies)[0]
+                    if self.config.policies


self.config.policies should always be available and you should always be able to iterate through it. So I don't think, we need this if-check here.

sven1977 · 2024-07-08T12:44:29Z

rllib/examples/offline_rl/pretrain_bc_single_agent_evaluate_as_multi_agent.py

@@ -147,6 +147,7 @@
        TRAINING_ITERATION: args.stop_iters,
    }

+    args.local_mode = True


Good catch!

sven1977 · 2024-07-08T12:44:42Z

rllib/policy/sample_batch.py

@@ -907,14 +908,20 @@ def get(self, key, default=None):
            return default

    @PublicAPI
-    def as_multi_agent(self) -> "MultiAgentBatch":
-        """Returns the respective MultiAgentBatch using DEFAULT_POLICY_ID.
+    def as_multi_agent(self, module_id: Optional[ModuleID] = None) -> "MultiAgentBatch":


…able and removed local mode from example which is a relict from debugging. Signed-off-by: simonsays1980 <[email protected]>

Added conversion to multi-agent batch which was missing and made mult…

5754c18

…i-learner setups in BC impossible on the hybrid stack. Signed-off-by: simonsays1980 <[email protected]>

simonsays1980 self-assigned this Jul 4, 2024

simonsays1980 added rllib RLlib related issues rllib-offline-rl Offline RL problems rllib-oldstack-cleanup Issues related to cleaning up classes, utilities on the old API stack labels Jul 4, 2024

sven1977 approved these changes Jul 4, 2024

View reviewed changes

sven1977 marked this pull request as ready for review July 4, 2024 15:43

sven1977 requested a review from ArturNiederfahrenhorst as a code owner July 4, 2024 15:43

sven1977 enabled auto-merge (squash) July 4, 2024 15:43

github-actions bot added the go add ONLY when ready to merge, run all tests label Jul 4, 2024

simonsays1980 added 2 commits July 8, 2024 13:45

Merge branch 'master' into enable-multi-learner-for-bc-hybrid-stack

008b9be

Added an optional 'ModuleID' to the 'as_multi_agent' batch conversion…

734682b

… and added logic in BC to add the used module ID if existent. Signed-off-by: simonsays1980 <[email protected]>

github-actions bot disabled auto-merge July 8, 2024 12:06

sven1977 reviewed Jul 8, 2024

View reviewed changes

Removed if-check as 'AlgorithmConfig.policies' should always be avail…

332078b

…able and removed local mode from example which is a relict from debugging. Signed-off-by: simonsays1980 <[email protected]>

sven1977 merged commit 14a0be7 into ray-project:master Jul 9, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] - Enable multi-learner setup for hybrid stack BC #46436

[RLlib] - Enable multi-learner setup for hybrid stack BC #46436

simonsays1980 commented Jul 4, 2024 •

edited

Loading

sven1977 left a comment

sven1977 Jul 8, 2024

sven1977 Jul 8, 2024

simonsays1980 Jul 8, 2024

sven1977 Jul 8, 2024

[RLlib] - Enable multi-learner setup for hybrid stack BC #46436

[RLlib] - Enable multi-learner setup for hybrid stack BC #46436

Conversation

simonsays1980 commented Jul 4, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

sven1977 left a comment

Choose a reason for hiding this comment

sven1977 Jul 8, 2024

Choose a reason for hiding this comment

sven1977 Jul 8, 2024

Choose a reason for hiding this comment

simonsays1980 Jul 8, 2024

Choose a reason for hiding this comment

sven1977 Jul 8, 2024

Choose a reason for hiding this comment

simonsays1980 commented Jul 4, 2024 •

edited

Loading