Saving and loading optimizer state

thevishnupradeep · February 16, 2022, 4:47am

I am trying to load and fine tune Google’s ViT model using hugging face. I’m trying to use torch.optim.AdamW for optimisation.

How do I load and save state_dict of the optimizer that was defined in configure_optimizers callback?

Thank you.

aniketmaurya · March 31, 2022, 12:49pm

Hi @thevishnupradeep, model state_dict is available with lit_model.state_dict() method. You can save it using torch.save(...) and load it back with torch.load(...)

Here is an example -

class BoringModel(pl.LightningModule):
    
    def __init__(self):
        super().__init__()
        self.lr = 1e-3
        self.model = NeuralNet()
        
    def forward(self, x):
        return self.model(x)
    
    def configure_optimizers(self):
        return torch.optim.AdamW(self.parameters())

model = BoringModel()

# state available
model.state_dict()

# save the state
torch.save(model.state_dict(), "state.pkl")

# load the state
torch.load("state.pkl")

Also, we have deprecated this forum so I would request you to use our Github Discussions for quicker response.

passion_E14 · January 12, 2023, 7:09am

This is not what the question is asking. I think the question is asking how we can load an optimizer’s state dict, or how to restore an optimizer.