How to use Adaptive Gradient Clipping in PL?

Hello, forum people!

How do I inject Adaptive Gradient Clipping (AGC from the [2102.06171] High-Performance Large-Scale Image Recognition Without Normalization) into Pytorch Lightning trainer? Somewhere between the loss.backward and optimizer.step?

Hello, my apology for the late reply. We are slowly converging to deprecate this forum in favor of the GH build-in version… Could we kindly ask you to recreate your question there - Lightning Discussions