Add fp16, multi-GPU training script (toy dataset) by changjonathanc · Pull Request #123 · bigscience-workshop/metadata

changjonathanc · 2022-01-14T14:14:16Z

Changed an argument in load_dataset so it tries to read private datasets.
Added gradient_step logging, so we can see time v.s. gradient step on wandb
Adds 2 sub-experiments
1. - fp16
2. - fp16, 2 GPU
- note: I only have 1 GPU, so I couldn't test this one, but I included a local_test.sh that can be used to test.
The toy dataset used is now privated. To run the experiments, you'd need to configure the huggingface access tokens with huggingface-cli login.

SaulLu · 2022-01-14T18:03:26Z

bsmetadata/experiments/with_metadata.py

 logger = logging.getLogger(__name__)


+load_dataset = functools.partial(load_dataset, use_auth_token=True)


This is something we don't necessarily need in our workflow I think as we work on the dataset already cloned locally. After that, I understand that you may need it for other tests on your side. ☺️

* master: (141 commits) build: bump nltk to 3.6.7 for security and performance (bigscience-workshop#130) build: bump nltk to 3.6.7 for security and performance (#5) Add fp16, multi-GPU training script (toy dataset) (bigscience-workshop#123) create dataset with html, timestamp, url, datasource, generation length and website description metadata and tittles, footers and headers from HTML (bigscience-workshop#119) remove `#SBATCH --gres=gpu:0 ` from `03_create_dataset.slurm` (bigscience-workshop#121) Add joint training slurm script (bigscience-workshop#111) Add features types for the metadata to extract and test multiprocessing (bigscience-workshop#118) feat: add a feature to choose where to extract metadata (bigscience-workshop#116) Use dateutil to parse date (bigscience-workshop#117) feat: change how the entity extraction process use ids (bigscience-workshop#115) add `path_or_url_flair_ner_model` in order to execute the entity extraction on a partition without internet (bigscience-workshop#106) delete old submodule delete ds_store style check style & quality imports handle IndexError for `wikipedia_desc_utils` (bigscience-workshop#102) handle the comment specific type not recognized by pyarrow (bigscience-workshop#83) quality check Change torch version + make it optional (bigscience-workshop#82) ... # Conflicts: # bsmetadata/metadata_utils.py

changjonathanc added 6 commits January 14, 2022 21:09

Copy all scripts from base script

65dbdaf

Allow use private dataset

9369f8e

Add fp16 accelerate_config

93da0c5

Copy from fp16 scripts

619af02

Add local testing script

42dbb42

Log number of gradient steps

572aab6

changjonathanc requested a review from SaulLu January 14, 2022 14:14

SaulLu reviewed Jan 14, 2022

View reviewed changes

change job names

c98ec8e

SaulLu merged commit 6552342 into master Jan 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fp16, multi-GPU training script (toy dataset)#123

Add fp16, multi-GPU training script (toy dataset)#123
SaulLu merged 7 commits intomasterfrom
JC/joint_training_fp16

changjonathanc commented Jan 14, 2022

Uh oh!

SaulLu Jan 14, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		logger = logging.getLogger(__name__)


		load_dataset = functools.partial(load_dataset, use_auth_token=True)

Conversation

changjonathanc commented Jan 14, 2022

Uh oh!

SaulLu Jan 14, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants