Misc Exercises on part two

Ex1

Task:

Compute forward through a simple 2-timestep RNN for input [‘o’, ‘c’] (i.e. embed ‘o’, then ‘c’).
At each step, follow the RNN equations for hidden states.
Compute the Loss as $Loss = Loss_{1} + Loss_{2}$ (details of loss not specified, but just follow the architecture for now).

Step 1: Initialization

Step 2: First input (‘o’)

Step 3: Second input (‘c’)

(Optional) Step 4: Compute Output and Loss

Usually, there’s one last layer for outputs ( $y_{t} = W_{y} \cdot h_{t}$ )
For each time step:
- $y_{1} = - 1 \cdot 5 = - 5$
- $y_{2} = - 1 \cdot (- 24) = 24$
Loss would be calculated here if target values were given. For now, just sum the losses at each step as in the diagram.

Step	Input	Embedding	$h_{t - 1}$	$h_{t}$	Output $y_{t}$
1	‘o’	5	0	5	$- 5$
2	‘c’	1	5	$- 24$	$24$

General formula:
At each $t$ :

h_{t} = W_{x} x_{t} + W_{h} h_{t - 1}

y_{t} = W_{y} h_{t}

That’s it!

Go forward step-by-step with the given embeddings and weights.
Use the RNN update each time.
Apply the output weight.
If you had targets (labels), you could then compute $Loss_{1}$ and $Loss_{2}$ with a loss function.