Fit More and Train Faster With ZeRO via DeepSpeed and FairScale - Tech Sentiments