Transformer - Attention Is All You Need

June 12, 2017

Demystifying the Transformer architecture, explaining the Encoder, Decoder, and Attention mechanisms block by block with PyTorch implementation