Chilfox

Introduction
Core Concept
Explanation
Problems & Solutions for Cross-Entropy Loss

❯

❯

❯

Cross Entropy Loss

Cross-Entropy Loss

Type

digest

Source Link

D-DL4CV-Lec03b-Loss_Function

Introduction

Core Concept

After we computed the scores, we transform them into probability to give us better intuition about the classifier’s confidence. Next, we

Explanation

Let the $s$ be the output of score function, i.e.,

s = f (x_{i}, W, b)

We convert the score into probability using “Softmax Function” The conditional probability represent the score image $x_{i}$ get on label $k$

P (Y = k ∣ X = x_{i}) = \frac{exp ( s _{k} )}{\sum _{j} exp ( s _{j} )}

Then, the loss of image $x_{i}$ is defined as

L_{i} = - lo g P (Y = y_{i} ∣ x = x_{i})

Putting it together we get the equation below:

L_{i} = - lo g (\frac{exp ( s _{y_{i}} )}{\sum _{j} exp ( s _{j} )})

Problems & Solutions for Cross-Entropy Loss

Q1: What is min/max possible loss $L_{i}$ A1: min: $0$ , max: $\infty$

Q2: If all scores are small random values, what is the loss for any input? A2: $lo g K$ , where $K$ is number of labels

At the start of the training, the scores will be small random value, we can then compare the loss value with $- lo g (\frac{1}{K})$ to check if there's bug in our code

關係圖譜

反向連結

Loss Function

Created with Quartz v4.5.1 © 2026