template<typename Architecture_t, typename Layer_t = VGeneralLayer<Architecture_t>, typename DeepNet_t = TDeepNet<Architecture_t, Layer_t>>
class TMVA::DNN::TAdam< Architecture_t, Layer_t, DeepNet_t >
Adam Optimizer class.
This class represents the Adam Optimizer.
Definition at line 45 of file Adam.h.
|
Scalar_t | fBeta1 |
| The Beta1 constant used by the optimizer.
|
|
Scalar_t | fBeta2 |
| The Beta2 constant used by the optimizer.
|
|
Scalar_t | fEpsilon |
| The Smoothing term used to avoid division by zero.
|
|
std::vector< std::vector< Matrix_t > > | fFirstMomentBiases |
| The decaying average of the first moment of the past bias gradients associated with the deep net.
|
|
std::vector< std::vector< Matrix_t > > | fFirstMomentWeights |
| The decaying average of the first moment of the past weight gradients associated with the deep net.
|
|
std::vector< std::vector< Matrix_t > > | fSecondMomentBiases |
| The decaying average of the second moment of the past bias gradients associated with the deep net.
|
|
std::vector< std::vector< Matrix_t > > | fSecondMomentWeights |
| The decaying average of the second moment of the past weight gradients associated with the deep net.
|
|
DeepNet_t & | fDeepNet |
| The reference to the deep net.
|
|
size_t | fGlobalStep |
| The current global step count during training.
|
|
Scalar_t | fLearningRate |
| The learning rate used for training.
|
|
template<typename Architecture_t , typename Layer_t = VGeneralLayer<Architecture_t>, typename DeepNet_t = TDeepNet<Architecture_t, Layer_t>>
The decaying average of the first moment of the past weight gradients associated with the deep net.
Definition at line 55 of file Adam.h.
template<typename Architecture_t , typename Layer_t = VGeneralLayer<Architecture_t>, typename DeepNet_t = TDeepNet<Architecture_t, Layer_t>>
The decaying average of the second moment of the past bias gradients associated with the deep net.
Definition at line 62 of file Adam.h.
template<typename Architecture_t , typename Layer_t = VGeneralLayer<Architecture_t>, typename DeepNet_t = TDeepNet<Architecture_t, Layer_t>>
The decaying average of the second moment of the past weight gradients associated with the deep net.
Definition at line 60 of file Adam.h.