Skip to content

Move to sharded data loading#52

Open
PatrickRMiles wants to merge 3 commits intoLBANN:mainfrom
PatrickRMiles:miles30/sharded_dataloading
Open

Move to sharded data loading#52
PatrickRMiles wants to merge 3 commits intoLBANN:mainfrom
PatrickRMiles:miles30/sharded_dataloading

Conversation

@PatrickRMiles
Copy link
Copy Markdown
Collaborator

This PR makes changes to the data loading so that individual GPUs only load their own shards of data, rather than loading full samples and then sharding. This should help reduce memory requirements at higher problem scales.

@PatrickRMiles PatrickRMiles changed the title Draft: Move to sharded data loading Move to sharded data loading Apr 16, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Load samples in sharded manner, rather than loading full samples then sharding

1 participant