2026
-
Sutton RL Day 3:Dynamic Programming
TLDR: Dynamic programming turns known MDP dynamics into iterative policy evaluation and improvement through Bellman updates.
-
Sutton RL Day 1:RL Problem 与 MDP 基础
TLDR: RL is interaction for long-term reward: policy chooses actions, reward gives feedback, value estimates future return, and Bellman equations connect the pieces.
-
《自私的基因》第10章:你为我搔痒,我就骑在你的头上
TLDR: Group living and apparent altruism can often be read through self-protection, kinship, exploitation, reciprocal exchange, and the costs of being alone.
-
《自私的基因》第9章:两性战争
TLDR: Sexual conflict begins with unequal parental investment, pushing males and females toward different strategies around mating, care, loyalty, and display.
-
Crafting Interpreters 第 2 章笔记:A Map of the Territory
TLDR: This note maps the interpreter pipeline from source text through tokens, parsing, semantic analysis, code generation, and runtime choices.
-
Crafting Interpreters 第 3 章笔记:The Lox Language
TLDR: Lox is the small language that carries the book: expressive enough for classes, closures, and control flow, but compact enough to implement twice.
-
Crafting Interpreters 第 4 章笔记:Scanning
TLDR: Scanning is the first hard boundary in an interpreter: raw characters become tokens, and the rest of the language pipeline finally has structure to work with.
-
《自私的基因》第7章:计划生育
TLDR: Reproduction is a trade-off between more offspring and better-supported offspring, so restraint can be self-interested rather than species-minded.