MishBackward#
MishBackward operation computes gradient for Mish.
\[\dst & = \diffdst * \frac{e^{\src} * \omega}{\delta^{2}}\]
where
\[\begin{split}\omega & = e^{3\src} + 4 * e^{2\src} + e^{\src} * (4 * \src + 6) + 4 * (\src + 1) \\
\delta & = e^{2\src} + 2 * e^{\src} + 2\end{split}\]
Operation Attributes#
MishBackward operation does not support any attribute.
Execution Arguments#
The inputs and outputs must be provided according to the below index order when constructing an operation.
Inputs#
Index  | 
Argument Name  | 
Required or Optional  | 
|---|---|---|
0  | 
  | 
Required  | 
1  | 
  | 
Required  | 
Outputs#
Index  | 
Argument Name  | 
Required or Optional  | 
|---|---|---|
0  | 
  | 
Required  | 
Supported Data Types#
MishBackward operation supports the following data type combinations.
Src  | 
Diff_dst  | 
Diff_src  | 
|---|---|---|
f32  | 
f32  | 
f32  | 
f16  | 
f16  | 
f16  | 
bf16  | 
bf16  | 
bf16  |