Chilfox

❯

❯

❯

D DL4CV Lec21b Value_Function

D-DL4CV-Lec21b-Value_Function

Type

digest

Aliases

Value Function

Source Link

D-DL4CV-Lec21-Reinforcement_Learning

Value Function

The value function at state $s$ computes the expected reward by following policy $π$

V^{π} (s) = E [t \geq 0 \sum γ^{t} r_{t} ∣ s_{0} = s, π]

關係圖譜

反向連結

D-DL4CV-Lec21-Reinforcement_Learning

Created with Quartz v4.5.1 © 2026