Fixed issue #135; rename `total_batch_size` to `train_batch_size` #137

ravi-mosaicml · 2021-12-06T22:30:19Z

Implemented issue #135.
Also renamed total_batch_size to train_batch_size. Updated hparams.

…ize` Implemented issue mosaicml#135. Also renamed `total_batch_size` to `train_batch_size`. Updated hparams.

ravi-mosaicml · 2021-12-06T22:31:42Z

@anisehsani -- it would be awesome if each evaluator could support subset num batches

ajaysaini725 · 2021-12-07T14:54:15Z

composer/core/state.py

    # but the getter will always return a Precision enum
    precision: Union[str, types.Precision]  # type: ignore
    _precision: types.Precision = field(init=False)  # but store an enum internally
+    steps_per_epoch: int = -1  # type: ignore


If steps_per_epoch is a property then it shouldn't need to be defined here too right?

Yea...it's nonstandard.

Here, I defined it as a field so you can set steps_per_epoch as part of the __init__. But it's optional since it has a default value.

__init_ implicitly calls the property setter, which then sets the private variable _steps_per_epoch. The public getter steps_per_epoch always returns an int -- the actual length of the dataloader (if _steps_per_epoch is None), or the artificially reduced length (if the _steps_per_epoch is not None)

It seems like so far there isn't a reason to set steps_per_epoch to the __init__ when constructing State though so what about removing it for now and then if a need arises it can be added back? Just b/c it's confusing as is

composer/loggers/tqdm_logger.py

composer/trainer/trainer.py

ajaysaini725

LGTM 👍

ajaysaini725 · 2021-12-07T22:40:58Z

composer/core/state.py

    # but the getter will always return a Precision enum
    precision: Union[str, types.Precision]  # type: ignore
    _precision: types.Precision = field(init=False)  # but store an enum internally
+    steps_per_epoch: int = -1  # type: ignore


It seems like so far there isn't a reason to set steps_per_epoch to the __init__ when constructing State though so what about removing it for now and then if a need arises it can be added back? Just b/c it's confusing as is

composer/core/state.py

…ize` (mosaicml#137) 1. Remove the `subset_num_batches` from the dataset hparams. Synthetic datasets should instead use the length of the real dataset as the size, or have a configurable size 2. Add `train_subset_num_batches` and `eval_subset_num_batches` to the trainer hparams 3. Add a check in the trainer that ensures that, if this field is set, then `DatasetHparams.shuffle is False`, or otherwise emit a warning that every epoch may be using a different subset of samples 4. Renamed `total_batch_size` to `train_batch_size`. Updated hparams.

Fixed issue mosaicml#135; rename total_batch_size to `train_batch_s…

2992505

…ize` Implemented issue mosaicml#135. Also renamed `total_batch_size` to `train_batch_size`. Updated hparams.

ravi-mosaicml requested review from abhi-mosaic, hanlint and anisehsani December 6, 2021 22:30

ravi-mosaicml added 8 commits December 6, 2021 14:37

Fixed synthetic samplers

b6adc05

Fixed dataset reg test

f7f6350

timeouts

3ad316e

timeouts

28b4fd8

Fixed trainer fit

fc39b80

Set train shuffle to false

6d97f26

Formatting

3aa8d68

isort

301e100

ajaysaini725 reviewed Dec 7, 2021

View reviewed changes

Addressed PR comments

789edf9

ajaysaini725 approved these changes Dec 7, 2021

View reviewed changes

ravi-mosaicml added 2 commits December 7, 2021 15:40

PR feedback

ecba224

Fixed bug

4d14e01

ravi-mosaicml merged commit 6d8498c into mosaicml:dev Dec 8, 2021

ravi-mosaicml deleted the ravi/i135 branch December 8, 2021 00:13

ravi-mosaicml mentioned this pull request Dec 13, 2021

Move subset_num_batches from Dataloader hparams to trainer hparams #135

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixed issue #135; rename `total_batch_size` to `train_batch_size` #137

Fixed issue #135; rename `total_batch_size` to `train_batch_size` #137

Uh oh!

ravi-mosaicml commented Dec 6, 2021

Uh oh!

ravi-mosaicml commented Dec 6, 2021

Uh oh!

ajaysaini725 Dec 7, 2021

Uh oh!

ravi-mosaicml Dec 7, 2021

Uh oh!

ajaysaini725 Dec 7, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ajaysaini725 left a comment

Uh oh!

ajaysaini725 Dec 7, 2021

Uh oh!

Uh oh!

Uh oh!

Fixed issue #135; rename total_batch_size to train_batch_size #137

Fixed issue #135; rename total_batch_size to train_batch_size #137

Uh oh!

Conversation

ravi-mosaicml commented Dec 6, 2021

Uh oh!

ravi-mosaicml commented Dec 6, 2021

Uh oh!

ajaysaini725 Dec 7, 2021

Choose a reason for hiding this comment

Uh oh!

ravi-mosaicml Dec 7, 2021

Choose a reason for hiding this comment

Uh oh!

ajaysaini725 Dec 7, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ajaysaini725 left a comment

Choose a reason for hiding this comment

Uh oh!

ajaysaini725 Dec 7, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Fixed issue #135; rename `total_batch_size` to `train_batch_size` #137

Fixed issue #135; rename `total_batch_size` to `train_batch_size` #137