Specify which shape axis the channel is specified
Do not select CUDNN operator, if available
Epsilon to prevent div 0. Must be no less than CUDNN_BN_MIN_EPSILON defined in cudnn.h when using cudnn (usually 1e-5)
Fix gamma while training
Momentum for moving average
Output the mean and inverse std
Whether use global moving statistics instead of local batch-norm. This will force change batch-norm into a scale shift operator.