Miscellaneous notes – Augustas Macijauskas

deep learning

LLMs

tokenization

There seems to be a lot overlap between the tokenizers of Llama 3 and GPT-4. How similar are they?

scientific computing

numerical methods

Large language models (predictably) learn to represent the semantic meaning of sentences.

deep learning

LLMs

visualisation

Large language models (predictably) learn to represent the semantic meaning of sentences.