A lot of Dean's initial work was getting models effectively training on that kind of hardware using distributed SGD.