May 28, 2023 Paper Summary #9 - Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training