Number of steps drifts for `val_check_interval` when gradient accumulation turned on
|
|
0
|
284
|
March 26, 2023
|
Global_step increased at new epoch regardless of gradient accumulation
|
|
2
|
719
|
March 26, 2023
|
Incorrect batch size being inferred using trainer.fit(), correct batch size in dataloader? What could be going wrong? [PyLightning]
|
|
1
|
503
|
March 26, 2023
|
Model Works on CPU but Error out while running on GPU
|
|
1
|
819
|
March 25, 2023
|
How to continue training for more epochs?
|
|
1
|
1156
|
March 25, 2023
|
Changing batch size during trainig
|
|
3
|
1992
|
March 20, 2023
|
Modifying the Trainer when calling Trainer.fit() multiple times
|
|
2
|
1351
|
February 18, 2023
|
Error while training simclr model
|
|
0
|
203
|
February 12, 2023
|
Question about auto_lr_find()
|
|
1
|
2217
|
January 31, 2023
|
How do I prevent initial validation run in Trainer 1.9.0?
|
|
1
|
328
|
January 24, 2023
|
Save_last and monitor in ModelCheckpoint
|
|
0
|
157
|
January 23, 2023
|
Why `precision=16` for me is almost useless for speeding up?
|
|
1
|
1028
|
January 16, 2023
|
Resume_from_checkpoint not work
|
|
4
|
5487
|
December 7, 2022
|
Dealing with large dataset
|
|
1
|
3382
|
December 3, 2022
|
Auto_lr_find dependence on initial learning rate
|
|
1
|
480
|
November 22, 2022
|
Gradient Accumulation with Dual (optimizer, scheduler) Training
|
|
0
|
426
|
November 10, 2022
|
Filename for last checkpoint
|
|
1
|
644
|
November 7, 2022
|
How to get the checkpoint path?
|
|
11
|
17553
|
November 2, 2022
|
Why in progress bar there is no train_acc display?
|
|
0
|
692
|
July 8, 2022
|
Issue Regarding DETR on custom data
|
|
0
|
310
|
June 7, 2022
|
Target size that is different to the input size
|
|
10
|
12065
|
May 19, 2022
|
Precision doesn't work
|
|
0
|
677
|
April 14, 2022
|
How to use `LightningCLI` to start training from a checkpoint at epoch 0?
|
|
0
|
869
|
February 19, 2022
|
How to customize trainer in order to restrict parameter range during training?
|
|
2
|
652
|
January 30, 2022
|
Modules that have backward hooks assigned cannot be compiled
|
|
1
|
720
|
January 29, 2022
|
ModelCheckpoint docs for every_n_epochs==None
|
|
1
|
696
|
January 29, 2022
|
How to deal with lr_find_temp_model_**.ckpt
|
|
2
|
615
|
January 29, 2022
|
Dose PL validate and train at the same time?
|
|
1
|
2107
|
January 29, 2022
|
Where is accelerator_connector?
|
|
1
|
1227
|
January 29, 2022
|
No `training_step()` method defined
|
|
10
|
7996
|
January 9, 2022
|