Parallel RL by QueensGambit · Pull Request #225 · QueensGambit/CrazyAra

QueensGambit · 2026-01-09T13:21:32Z

This PR allows generating multiple games on a single GPU.
You need to set the variable Number_Parallel_Games (default: 8) to configure how many games you want to run in parallel. No change of Batch_Size is necessary, as it automatically increases the usual Batch_Size by times Number_Parallel_Games, i.e., times 8.
The individual parallel games are exported into separate files and later merged into a single file via Python.

The rlSettings->numberChunks is divided by searchSettings->numberParallelGames for each TrainDataExporter object. On the GPU, it uses a queue and a future.get() setup to handle the requests.

Generating one RL package on an Nvidia A100 takes 3 hours and 15 minutes by default. When using a parallelization of 8, it takes 1 hour and 15 minutes, resulting in an approximate threefold speed up.

Note, the parallel RL loop is not compatible with the Mixture of Experts (MoE) setup.
It also includes a small version of the A0 resnet.

Add option "Number Parallel Games" Add ->get_local_batch_size() Make NeuralNetAPIUser a member

Rename id to agentID

Make some variables to *

parallel_rl

+ make condition variable as MCTSAgent member

with searchSettings->batchSize

…arallel_rl

Use get_default_model()

Remove mxnet code

QueensGambit added 30 commits November 22, 2025 13:36

Start with parallel RL

a1e8ba0

Add option "Number Parallel Games" Add ->get_local_batch_size() Make NeuralNetAPIUser a member

Separate the neural user from SearchThread

d33e361

Create run_inference() wrapper

53ef3d9

Rename id to agentID

Add selfplayFileMutex

440d72e

Make some variables to *

Fix some compilation problems

8b9da6a

Fix remaining compile errors

ead1cec

Add run_selfplay_thread and gameThreads

f52db48

Merge branch 'parallel_rl' of github.com:QueensGambit/CrazyAra into

089cc73

parallel_rl

Put MCTSAgent and RawNetAgent into vectors

6155580

Fix compile errors

623786c

Add mutex, condition variable and batch counter

a29ab1d

Use batch variables

4852316

Make use of NeuralNetAPIUser object for SearchThread

93213b8

Fix compile problems

318ee3f

Add helper method handle_fwd_pass()

1c0f08d

Fill information for batchCounter and batchMutex

0701cb7

Use shared_ptr for batchCounter and batchMutex

2072932

Add numberOfGames / NUMBER_OF_PARALLEL_GAMES

3b90112

Add agentID to name identifier

eceab21

Use unique_ptr instead of raw objects

7189a1a

Add lock for initialisation + update offsets

85c6685

Initialize timeManager

45503ca

Add debug message

6647985

Try recursive_mutex

95da4ba

Fix compile problem recursive_mutex

b4f8cd5

Change to recursive_mutex

5267112

Revert recursive mutex

37f8b7d

+ make condition variable as MCTSAgent member

Move cout statement

a0c999c

Avoid iteration

6d2f1b1

Use CXX 20 and barriers

836c4ff

QueensGambit added 29 commits December 10, 2025 17:02

Update maxBatchSize for worker

3558aa2

Use fwd_pass_queue()

5814148

Use correct clipping

ae22508

Simplify expression for handle_fwd_pass()

31f3453

Give each MCTSAgent its own nnUser

48e2173

Maybe fix inference bug

f8ed215

Remove unused variables

01bed69

Fix game sample export

abc79c8

Integrate risev3-large into train loop

5b20661

Update safeguard condition

3f02e58

Update rl_config.py

442342b

Update MCTSAgent::set_root_node_predictions()

1e87662

Add get_alpha_zero_model_small(), resnet-small

a0dfacd

Make RL loop fully parallel

a0aba56

Fix compile errors

0991ecf

Compressing individual files

07627c3

Adjust export for num_parallel_games > 1

4c389ee

Replace get_local_batch_size() with get_main_batch_size()

c841471

Change local batch size in UCI-options

64f04db

Update default value for MaxInitPly

3fea06e

Replace searchSettings->get_local_batch_size()

d566a03

with searchSettings->batchSize

Replace searchSettings->get_local_batch_size()

675a588

with searchSettings->batchSize

Merge branch 'parallel_rl' of github.com:QueensGambit/CrazyAra into p…

e833bc6

…arallel_rl

readd increment of gameIdx

accc3ef

Update go command for selfplay

24fd90e

Update SelfPlay::max_samples_per_iteration()

56c0138

Change one logging.info call to logging.debug()

31b2e7c

Update generate_random_nn.py

2df5b92

Use get_default_model()

Update generate_random_nn.py

295f066

Remove mxnet code

QueensGambit merged commit bb3b5b6 into master Jan 12, 2026
0 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel RL#225

Parallel RL#225
QueensGambit merged 72 commits into
masterfrom
parallel_rl

QueensGambit commented Jan 9, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

QueensGambit commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

QueensGambit commented Jan 9, 2026 •

edited

Loading