Commit 8b52c217 authored by ywolter's avatar ywolter
Browse files

add thoughts

parent b64d6f5f
Loading
Loading
Loading
Loading
+6 −1
Original line number Diff line number Diff line
@@ -10,5 +10,10 @@
    * 
* pretrain models from scratch
    * encoder (bert), encoder-decoder(t5 or bart), decoder (gpt2)

## 2026-03-09
* Wie kann man die Modelle anpassen, dass Q=K?
    * Vererbung? (e.g. BertEqual model, dass von BERT erbt und nur Q=K verändert ist)
* Pipeline für models mit Q=K und models mit Q!=K
* 
+1 −0
Original line number Diff line number Diff line
@@ -5,6 +5,7 @@ description = "Organisierte Recherche zu Auswirkungen naiven Gleichsetzens von S
readme = "README.md"
requires-python = ">=3.12"
dependencies = [
    "transformers>=5.3.0",
    "typer>=0.24.1",
]

+436 −1

File changed.

Preview size limit exceeded, changes collapsed.