yabs.io

Yet Another Bookmarks Service

Search

Results

How LLMs Work, Explained Without Math - miguelgrinberg.com

[https://blog.miguelgrinberg.com/post/how-llms-work-explained-without-math] - 2024-05-13 18:48:29 - public:xxx

ai, algorithms, artificial, code, explained, intelligence, learn, llama, llm - 9 | id:1492049 -

LLaMA 7B GPU Memory Requirement - Transformers - Hugging Face Forums

[https://discuss.huggingface.co/t/llama-7b-gpu-memory-requirement/34323/6] - 2024-03-04 10:10:38 - public:mzimmerm

ai, code, generate, llama, llm, model, newspeak, train - 8 | id:1489782 -

With the optimizers of bitsandbytes (like 8 bit AdamW), you would need 2 bytes per parameter, or 14 GB of GPU memory.

codellama (Code Llama) - Huggingface model for generating programs. Maybe can be used for Newspeak?

[https://huggingface.co/codellama] - 2024-03-03 08:48:06 - public:mzimmerm

ai, code, generate, huggingface, language, llama, model, newspeak, program - 9 | id:1489750 -

Follow Tags

Code - Please Log In To follow this tag
llama - Please Log In To follow this tag

Export: