摘要详情

ID / 提交时间

20 / 2018-02-13 14:32:33

标题

A METHOD FOR STOCHASTIC OPTIMIZATION

关键字

Convergence rate,DC bias,Neural network

主题及专题

全体主题

状态

终稿

作者

banu prasad / BE

摘要

We introduce Adam, an algorithm for first-order gradient-based optimization of
stochastic objective functions, based on adaptive estimates of lower-order moments. The method is straightforward to implement, is computationally efficient,
has little memory requirements, is invariant to diagonal rescaling of the gradients,
and is well suited for problems that are large in terms of data and/or parameters.
The method is also appropriate for non-stationary objectives and problems with
very noisy and/or sparse gradients. The hyper-parameters have intuitive interpretations and typically require little tuning. Some connections to related algorithms,
on which Adam was inspired, are discussed. We also analyze the theoretical convergence properties of the algorithm and provide a regret bound on the convergence rate that is comparable to the best known results under the online convex
optimization framework. Empirical results demonstrate that Adam works well in
practice and compares favorably to other stochastic optimization methods. Finally,
we discuss AdaMax, a variant of Adam based on the infinity norm.

重要日期

会议日期

02月26日

2018

至

02月28日

2018
02月28日 2018

注册截止日期
01月22日 2019

初稿截稿日期

承办单位

aconf

联系方式

banuprasad
ba******@yahoo.com
988********
812*********

登录查看完整联系方式

移动端

在手机上打开

小程序

打开微信小程序

客服

扫码或点此咨询

International conference on techmobility (ICT)

摘要详情

重要日期

会议日期

承办单位

联系方式