The Softmax output function transforms a previous layer’s output into a vector of probabilities. It is commonly used for multi-class classification. Given an input vector and a weighting vector we have:

References
The Softmax output function transforms a previous layer’s output into a vector of probabilities. It is commonly used for multi-class classification. Given an input vector and a weighting vector we have:

References