04 Dec 2022

Sparse language models are a thing

The GPT3¹ paper mentioned that it’s 10x bigger than any previous non-sparse LM.

So - sparse LMs () are LMs with A LOT of params where only a subset is used for each incoming example.²

Nel mezzo del deserto posso dire tutto quello che voglio.

serhii.net