Search

Results

Run an LLM Locally with LM Studio - KDnuggets

[https://www.kdnuggets.com/run-an-llm-locally-with-lm-studio] - 2024-03-12 02:29:38 - public:mzimmerm

ai, doc, llm, lmstudio - 4 | id:1489896 -

Document about LM Studio

Optimum

[https://huggingface.co/docs/optimum/index] - 2024-03-11 19:44:39 - public:mzimmerm

ai, doc, huggingface, llm, model, optimum, repo, small, transformer - 9 | id:1489894 -

Optimum is an extension of Transformers that provides a set of performance optimization tools to train and run models on targeted hardware with maximum efficiency. It is also the repository of small, mini, tiny models.

BERT Transformers – How Do They Work? | Exxact Blog

[https://www.exxactcorp.com/blog/Deep-Learning/how-do-bert-transformers-work] - 2024-03-11 04:39:00 - public:mzimmerm

ai, bert, doc, good, llm, parameter, progress, todo, transformer - 9 | id:1489882 -

Excellent document about BERT transformers / models and their parameters: - L=number of layers. - H=size of the hidden layer = number of vectors for each word in the sentence. - A = Number of self-attention heads - Total parameters.

Generative pre-trained transformer - Wikipedia

[https://en.wikipedia.org/wiki/Generative_pre-trained_transformer] - 2024-03-11 04:14:03 - public:mzimmerm

ai, doc, llm, todo - 4 | id:1489879 -

(1) Most cost effective GPU for local LLMs? : LocalLLaMA

[https://www.reddit.com/r/LocalLLaMA/comments/12vxxze/most_cost_effective_gpu_for_local_llms/] - 2024-03-05 00:49:23 - public:mzimmerm

ai, doc, llm, model, optimize, perform - 6 | id:1489804 -

GGML quantized models. They would let you leverage CPU and system RAM, instead of having to rely on a GPU’s. This could save you a fortune, especially if go for some used AMD Epyc platforms. This could be more viable for the larger models, especially the 30B/65B parameters models which would still press or exceed the VRAM on the P40.

Optimizing LLMs for Speed and Memory

[https://huggingface.co/docs/transformers/v4.35.2/en/llm_tutorial_optimization] - 2024-03-05 00:46:21 - public:mzimmerm

ai, doc, huggingface, llm, model, optimize, perform - 7 | id:1489803 -

7 Steps to Mastering Large Language Models (LLMs) - KDnuggets

[https://www.kdnuggets.com/7-steps-to-mastering-large-language-models-llms] - 2024-03-04 19:35:57 - public:mzimmerm

ai, doc, highlevel, llm, train - 5 | id:1489799 -

A Step-by-Step Guide to Training Your Own Large Language Models (LLMs). | by Sanjay Singh | GoPenAI

[https://blog.gopenai.com/a-step-by-step-guide-to-training-your-own-llm-2d81ff810695] - 2024-03-04 19:34:25 - public:mzimmerm

ai, doc, highlevel, llm, train - 5 | id:1489798 -

7 steps to master large language models (LLMs) | Data Science Dojo

[https://datasciencedojo.com/blog/master-large-language-models/#] - 2024-03-04 19:25:57 - public:mzimmerm

ai, doc, highlevel, llm, model, train - 6 | id:1489796 -

Up to date List of LLM Models

[https://docs.google.com/spreadsheets/d/1kT4or6b0Fedd-W_jMwYpb63e1ZR3aePczz3zlbJW-Y4/edit#gid=741531996] - 2024-03-04 19:13:58 - public:mzimmerm

ai, doc, list, llm, model - 5 | id:1489793 -

Large Language Models for Domain-Specific Language Generation: How to Train Your Dragon | by Andreas Mülder | Medium

[https://medium.com/@andreasmuelder/large-language-models-for-domain-specific-language-generation-how-to-train-your-dragon-0b5360e8ed76] - 2024-03-04 09:45:59 - public:mzimmerm

ai, article, code, doc, generate, llm, train - 7 | id:1489780 -

training a model like Llama with 2.7 billion parameters outperformed a larger model like Vicuna with 13 billion parameters. Especially when considering resource consumption, this might be a good alternative to using a 7B Foundation model instead of a full-blown ChatGPT. The best price-to-performance base model for our use case turned out to be Mistral 7b. The model is compact enough to fit into an affordable GPU with 24GB VRAM and outperforms the other models with 7B parameters.

Replit — How to train your own Large Language Models

[https://blog.replit.com/llm-training] - 2024-03-02 10:18:28 - public:mzimmerm

ai, doc, language, llm, model, train - 6 | id:1489728 -

Hi level only talk about training for a language

How to train a new language model from scratch using Transformers and Tokenizers

[https://huggingface.co/blog/how-to-train] - 2024-03-02 09:48:13 - public:mzimmerm

yabs.io

Yet Another Bookmarks Service