Joye Personal Blog

Blog Notes Projects Links About Contact

Back

Blog

Page 2 - Showing 5 of 13 posts

2025年12月18日

Understanding Attention: From Q, K, V to Multi-Head

A deep dive into Attention, the Transformer's core engine: grasp Q, K, V via a database-query analogy, master Multi-Head, and clear up Softmax vs RMSNorm.

13 min read
- llm
- transformer
- minimind
- attention
- multi-head
2025年12月17日

RoPE: From Permutation Invariance to Multi-Frequency

A deep dive into RoPE (Rotary Position Embedding), the standard position encoding for modern LLMs: the math, the engineering, and floating-point precision.

12 min read
- llm
- transformer
- minimind
- rope
- position encoding
2025年12月16日

Why Transformers Need Normalization: Gradients to RMSNorm

A deep dive into why deep neural networks need normalization, and how RMSNorm became standard in modern LLMs

10 min read
2025年10月23日

Frontend Intern Interviews at Chinese Startups: A Prep Guide

A systematic rundown of the technical topics, application data, and a complete prep checklist for frontend internship interviews at smaller Chinese companies.

8 min read
- internship
- frontend
- interview
- react
2025年7月1日

My First-Ever Pull Request

A sophomore reflects on landing his first successful open-source pull request

6 min read
- open source
- frontend