The axis to perform layer normalization. Usually, this should be be axis of the channel dimension. Negative values means indexing from right to left.
An epsilon
parameter to prevent division by 0.
Output the mean and std calculated along the given axis.
This Param Object is specifically used for LayerNorm