0

A Three-Phase Factual Recall Circuit in Gemma-2B and Gemma-12B-IT

https://towardsdatascience.com/a-three-phase-factual-recall-circuit-in-gemma-2b-and-gemma-12b-it/(towardsdatascience.com)
A mechanistic interpretability study uses activation patching to localize factual recall circuits within the Gemma model family. The research identifies a consistent three-phase circuit for how facts are processed: storage in the early-to-mid layers, routing via distributed attention heads, and readout in the final layers. This pattern was found to replicate at scale, appearing in both Gemma-2B and Gemma-12B-IT, with the residual stream being the dominant causal component for storing facts. The study also notes that tokenizer differences can create challenges for cross-model comparisons.
0 pointsby will222 hours ago

Comments (0)

No comments yet. Be the first to comment!

Want to join the discussion?