Back
A deep dive into the FeedForward network and how RMSNorm, RoPE, Attention, and FeedForward assemble into a complete Transformer Block.
llm
transformer
minimind
feedforward
swiglu
architecture