bugfix: Update modeling_t5.T5Stack.forward() for Gradient Checkpointing
#2
by Panda-vid - opened
Update checkpoint() call such that parameters for the layer_module object are passed correctly.
plenz changed pull request status to closed
The feature only works with older transformer versions