-
Notifications
You must be signed in to change notification settings - Fork 390
Multi-tier checkpointing + orbax replicator #1332
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
ehorning
wants to merge
29
commits into
apple:main
Choose a base branch
from
ehorning:orbax-mtc-testing
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
29 commits
Select commit
Hold shift + click to select a range
6b87f81
orbax
f61131f
mount ckpt volume and create replicator file
f6cacc3
renaming
264aea6
move stuff
f62c377
use orbax replicator
79db845
remove cleanup
93438f2
wait and rename
e5252e1
jax init + moving stuff
0cdc871
replicator restore tweaks
fba65dd
update restore objects + mesh shape
2ae0d0f
rip out unnecessary process management logic from oecp
a189306
unnecessary jax init
6e3fa28
comment out non-tensor stuff
b1199ee
cleanup
c28f970
cleanup
4d3f30d
more cleanup
19151d9
remove special non-tensor checkpointing
2541476
remove unnecessary changes
ceeef4d
more cleanup
12c1e46
raise errors
485e184
cleanup
c7662db
conditional volume mount
dc60555
more cleanup
161b179
Merge branch 'main' into orbax-mtc-testing
ehorning 21f4453
logging
c249122
Merge branch 'main' into orbax-mtc-testing
ehorning 69b12c4
orbax-cp version + dp var
9882bd4
orbax group install
7dba635
Merge branch 'main' into orbax-mtc-testing
ehorning File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.