Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
32 commits
Select commit Hold shift + click to select a range
1e0b33d
updating loading in santa coder demo to use transformer bridge
degenfabian Aug 19, 2025
9ea175f
add santa coder demo to CI
degenfabian Aug 19, 2025
1275468
Merge remote-tracking branch 'origin/dev-3.x' into santa_coder_demo_t…
bryce13950 Aug 20, 2025
8174969
Merge remote-tracking branch 'origin/dev-3.x' into santa_coder_demo_t…
bryce13950 Aug 22, 2025
39a7344
Merge remote-tracking branch 'origin/dev-3.x' into santa_coder_demo_t…
bryce13950 Aug 26, 2025
b86a5d9
Merge remote-tracking branch 'origin/dev-3.x' into santa_coder_demo_t…
bryce13950 Sep 4, 2025
b1967a5
Merge remote-tracking branch 'origin/dev-3.x' into santa_coder_demo_t…
bryce13950 Sep 5, 2025
70215a7
Merge remote-tracking branch 'origin/dev-3.x' into santa_coder_demo_t…
bryce13950 Sep 6, 2025
1d2c01f
Merge remote-tracking branch 'origin/dev-3.x' into santa_coder_demo_t…
bryce13950 Sep 7, 2025
2435749
Merge remote-tracking branch 'origin/dev-3.x' into santa_coder_demo_t…
bryce13950 Sep 10, 2025
4ce65dd
Merge remote-tracking branch 'origin/dev-3.x' into santa_coder_demo_t…
bryce13950 Sep 10, 2025
fff177f
Merge remote-tracking branch 'origin/dev-3.x' into santa_coder_demo_t…
bryce13950 Sep 12, 2025
d9e91df
Merge remote-tracking branch 'origin/dev-3.x' into santa_coder_demo_t…
bryce13950 Sep 12, 2025
217f18a
Merge remote-tracking branch 'origin/dev-3.x' into santa_coder_demo_t…
bryce13950 Sep 12, 2025
15075d0
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 10, 2025
6ae11dc
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 13, 2025
18e9054
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 14, 2025
6a1b35b
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 14, 2025
ce6cda5
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 15, 2025
f8427a3
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 15, 2025
75cd717
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 15, 2025
09ea95c
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 16, 2025
fcf7fbf
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 16, 2025
dc5bd66
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 16, 2025
0493da0
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 16, 2025
603cc33
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 16, 2025
6be81c0
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 17, 2025
ec1d020
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Oct 23, 2025
00b5c9a
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Nov 12, 2025
0ec5a13
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Nov 12, 2025
f5ffca7
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Nov 12, 2025
387e8e5
Merge remote-tracking branch 'origin/dev-3.x-folding' into santa_code…
bryce13950 Nov 20, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .github/workflows/checks.yml
Original file line number Diff line number Diff line change
Expand Up @@ -240,6 +240,7 @@ jobs:
# - "No_Position_Experiment"
- "Othello_GPT"
- "Patchscopes_Generation_Demo"
- "Santa_Coder"
# - "T5"
steps:
- uses: actions/checkout@v3
Expand Down
13 changes: 5 additions & 8 deletions demos/Santa_Coder.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -54,7 +54,7 @@
},
{
"cell_type": "code",
"execution_count": 2,
"execution_count": null,
"id": "da9f5a40",
"metadata": {},
"outputs": [
Expand Down Expand Up @@ -97,11 +97,7 @@
"\n",
"import transformer_lens\n",
"import transformer_lens.utils as utils\n",
"from transformer_lens.hook_points import (\n",
" HookedRootModule,\n",
" HookPoint,\n",
") # Hooking utilities\n",
"from transformer_lens import HookedTransformer, HookedTransformerConfig, FactoredMatrix, ActivationCache\n",
"from transformer_lens.model_bridge import TransformerBridge\n",
"\n",
"torch.set_grad_enabled(False)\n",
"\n",
Expand Down Expand Up @@ -132,7 +128,7 @@
},
{
"cell_type": "code",
"execution_count": 4,
"execution_count": null,
"id": "1f7ac1e1",
"metadata": {},
"outputs": [
Expand All @@ -154,7 +150,8 @@
"source": [
"# Disable folding norms and folding norms and biases so that intermediate value\n",
"# in between transformer blocks can be compared\n",
"bloom = HookedTransformer.from_pretrained(\"bloom-560m\",fold_ln=False, fold_value_biases=False, center_writing_weights=False)"
"bloom = TransformerBridge.boot_transformers(\"bloom-560m\",fold_ln=False, fold_value_biases=False, center_writing_weights=False)\n",
"bloom.enable_compatibility_mode()"
]
},
{
Expand Down
Loading