Masterarbeit current status
Motivational accountability!
Status
# Theory
Intro: [## ]
Grammar: [################## ]
LM theory: [## ]
Task desc.: [####### ]
Literature: [##### ]
# Code
LMentry: [############ ]
UA-CBT: [#### ]
Eval code: [ ]
Run exps: [ ]
Plan
- Finish UA-CBT to a reasonable extent
- Dig deep into formats / eval harnesses / benchmarks code,
- write the relevant theory as I go
- find cool UA LMs to use for my tests
- Finish basic code for evaluation and experiments to have it ready
- Finish the existing tasks, LMentry and UA-CBT as the key ones
- (they alone would be enough honestly)
- Run experiments, and hopefully write the paper based on what I have!
- Finish the Pravda dataset eval task code
- Solve for real the pandoc issues etc. and have code for camera-ready citations, glosses, etc.
- Write the additional tasks if I have any time left at this point
- Run all experiments
Stack
231002-2325 Masterarbeit toread stack:
-
<_(@taskCBT) “The goldilocks principle: Reading children’s books with explicit memory representations” (2015) / Felix Hill, Antoine Bordes, Sumit Chopra, Jason Weston: z / https://arxiv.org/abs/1511.02301 / 10.48550/ARXIV.1511.02301 _> ↩︎
-
<_(@Guo2023) “Evaluating Large Language Models: A Comprehensive Survey” (2023) / Zishan Guo, Renren Jin, Chuang Liu, Yufei Huang, Dan Shi, Supryadi, Linhao Yu, Yan Liu, Jiaxuan Li, Bojian Xiong, Deyi Xiong: z / http://arxiv.org/abs/2310.19736 / _> ↩︎
Nel mezzo del deserto posso dire tutto quello che voglio.