Trainer

Topic	Replies	Views	Activity
Issue during test stage when load_from_checkpoint	5	2442	September 27, 2023
How to keep lr fixed at first N epoch, and then use cosineAnnealingLR in the rest of training	0	204	September 25, 2023
LR Finder MNIST	2	684	September 18, 2023
Reloading model with trainer.fit(ckpt_path) and overrides callback	0	285	August 14, 2023
Method `on_train_batch_end` of `LightningModule` happens after callbacks `on_train_batch_end` - is this configurable?	0	244	August 9, 2023
ModelCheckpoint and EarlyStopping don't seem to work?	0	296	August 6, 2023
'tuple' object has no attribute 'trainer'	2	558	August 2, 2023
How to resume training	9	38496	July 31, 2023
RuntimeError: Early stopping conditioned on metric `val_loss` which is not available	1	374	July 24, 2023
How do I convert different LightningModules?	3	235	July 18, 2023
Is it possible to use a single Trainer to train multiple versions of the same model in parallel?	0	190	July 17, 2023
Clarification on log_every_n_steps with accumulate_grad_batches	1	314	July 16, 2023
How do I continue training the model ？	2	525	July 6, 2023
KeyError: 'No action for destination key "trainer.devices" to set its default.'	1	1112	July 4, 2023
Limit steps per epoch	10	1957	July 4, 2023
How to suppress trainer from printing directly to console?	1	444	June 6, 2023
Training stuck on resume	1	840	May 31, 2023
Confusing # of optimizer steps when using gradient accumulation with DeepSpeed	0	643	May 25, 2023
Training when data is stored in batches	2	236	May 21, 2023
Trainer prints every step in validation	2	1579	May 17, 2023
Weird result in convolutional network	2	470	May 14, 2023
Retraining a model with new data	1	309	May 9, 2023
How to use SWA with a cyclic scheduler	0	395	May 7, 2023
Resume training / load module from DeepSpeed checkpoint	14	3439	May 6, 2023
Resuming training gives different model result / weights	0	726	May 4, 2023
Wonder if _update_learning_rates is properly implemented	0	149	April 19, 2023
Why is the Trainer instance saved inside the DataModule during checkpoint save?	2	318	April 11, 2023
Trainer.validate/test with ckpt_path does not resume global_step	3	208	April 7, 2023
Is gradient clipping done before or after gradients accumulation?	2	694	April 5, 2023
Multiple dataloaders and epoch calculation	0	166	April 1, 2023