建立使用非同步隨機梯度下降法的分散式訓練之多參數伺服器模型

Deep Neural Networks(DNNs) is very successful and has drawn more and more attentions from researchers all over the world. A huge demand of training jobs are challenging the development of both software tools and hardware systems. Distributed training is a common approach to speed up these jobs. In this paper, we propose a new method to address one of the problem in expanding the scale of your training environment, and we will also explain the model and tools behind.

並列關鍵字

Deep Learning ； Deep Neural Networks ； Distributed Machine Learning ； Queueing Networks ； TensorFlow

參考文獻

[1] M. Abadi and al. Tensorflow: Largescale machine learning on heterogeneous systems.

Google Scholar

[2] C. M. Bishop. Pattern recognition and machine learning. springer, 2006.

Google Scholar

[3] L. Bottou. Largescale machine learning with stochastic gradient descent. In Proceedings of COMPSTAT’2010, pages 177–186, 2010.

Google Scholar

[4] L. Bottou. Stochastic gradient descent tricks. In Neural Networks: Tricks of the Trade Second Edition, pages 421–436, 2012.

Google Scholar

[5] C. Brezinski and M. R. Zaglia. Extrapolation methods: theory and practice, volume 2. Elsevier, 2013.

Google Scholar

國際替代計量

建立使用非同步隨機梯度下降法的分散式訓練之多參數伺服器模型

全文下載

主題瀏覽