Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel - Tech Sentiments