Value Function The value function at state s computes the expected reward by following policy π Vπ(s)=E[t≥0∑γtrt∣s0=s,π]