From: Effective hyperparameter optimization using Nelder-Mead method in deep learning
Name
Description
Iteration
20,000
Batch size
100
Learning rate decay policy
Multistep (gamma=0.1, step value={15,000,18,000}) [29]