-
Notifications
You must be signed in to change notification settings - Fork 15
Open
Description
Hi,
In prepare_fastspeech.ipynb file,
about
F = torch.mean(torch.max(alignments, dim=-1)[0], dim=-1)
r, c = torch.argmax(F).item()//4, torch.argmax(F).item()%4
location = torch.max(alignments[r,c], dim=1)[1]My understanding is: In the first line, the tensor shape changed from (layer_num, target_length, source_length) to (layer_num, target_length), and to (layer_num).
But I don't understand what's the mean of "4", and why use the layer num to calculate the location?
If there is a problem with my understanding, thanks for pointing out.
Metadata
Metadata
Assignees
Labels
No labels