I posted it here too: https://livebook.manning.com/forum?product=chollet3&comment=585044 the torch train_step implementations need to self.zero_grad(), or else the results are terrible
I posted it here too: https://livebook.manning.com/forum?product=chollet3&comment=585044
the torch train_step implementations need to self.zero_grad(), or else the results are terrible