Search

Results

[https://datasciencedojo.com/blog/master-large-language-models/#] - 2024-03-04 19:25:57 - public:mzimmerm

ai, doc, highlevel, llm, model, train - 6 | id:1489796 -

[https://www.reddit.com/r/MachineLearning/comments/12xu5ls/p_llm_for_a_new_language/] - 2024-03-04 19:15:48 - public:mzimmerm

ai, highlevel, llm, model, train - 5 | id:1489794 -

High level how to train a model

[https://discuss.huggingface.co/t/llama-7b-gpu-memory-requirement/34323/6] - 2024-03-04 10:10:38 - public:mzimmerm

With the optimizers of bitsandbytes (like 8 bit AdamW), you would need 2 bytes per parameter, or 14 GB of GPU memory.

[https://blog.replit.com/llm-training] - 2024-03-02 10:18:28 - public:mzimmerm

ai, doc, language, llm, model, train - 6 | id:1489728 -

Hi level only talk about training for a language

[https://huggingface.co/blog/how-to-train] - 2024-03-02 09:48:13 - public:mzimmerm

Describes how to train a new language (desperanto) model.