Run an LLM Locally with LM Studio - KDnuggets

[https://www.kdnuggets.com/run-an-llm-locally-with-lm-studio] - 2024-03-12 02:29:38 - public:mzimmerm

ai, doc, llm, lmstudio - 4 | id:1489896 -

Document about LM Studio

Optimum

[https://huggingface.co/docs/optimum/index] - 2024-03-11 19:44:39 - public:mzimmerm

ai, doc, huggingface, llm, model, optimum, repo, small, transformer - 9 | id:1489894 -

Optimum is an extension of Transformers that provides a set of performance optimization tools to train and run models on targeted hardware with maximum efficiency. It is also the repository of small, mini, tiny models.

BERT Transformers – How Do They Work? | Exxact Blog

[https://www.exxactcorp.com/blog/Deep-Learning/how-do-bert-transformers-work] - 2024-03-11 04:39:00 - public:mzimmerm

ai, bert, doc, good, llm, parameter, progress, todo, transformer - 9 | id:1489882 -

Excellent document about BERT transformers / models and their parameters: - L=number of layers. - H=size of the hidden layer = number of vectors for each word in the sentence. - A = Number of self-attention heads - Total parameters.

Generative pre-trained transformer - Wikipedia

[https://en.wikipedia.org/wiki/Generative_pre-trained_transformer] - 2024-03-11 04:14:03 - public:mzimmerm

ai, doc, llm, todo - 4 | id:1489879 -

AMD Ryzen AI CPUs & Radeon 7000 GPUs Can Run Localized Chatbots Using LLMs Just Like NVIDIA's Chat With RTX

[https://wccftech.com/amd-ryzen-ai-cpus-radeon-7000-gpus-localized-chatbot-llms-like-nvidia-chat-with-rtx/] - 2024-03-11 03:27:50 - public:mzimmerm

ai, amd, apu, doc, gpu, install, lmstudio, rocm, software, studio, todo - 11 | id:1489878 -

LM Studio can be installed on Linux with APU or GPU (looks like it needs the AI CPU though??) and run LLM. Install on Laptop and test if it works.

What is Epoch in Machine Learning?| UNext | UNext

[https://u-next.com/blogs/machine-learning/epoch-in-machine-learning/] - 2024-03-09 23:27:13 - public:mzimmerm

ai, doc, epoch - 3 | id:1489872 -

Training and Validation Loss in Deep Learning | Baeldung on Computer Science

[https://www.baeldung.com/cs/training-validation-loss-deep-learning] - 2024-03-09 23:23:38 - public:mzimmerm

ai, doc, error, evaluate, loss, train, validate - 7 | id:1489871 -

A Step-by-Step Guide to Model Evaluation in Python | by Shreya Singh | Medium

[https://medium.com/@jscvcds/a-step-by-step-guide-to-model-evaluation-in-python-3a72dee92560] - 2024-03-09 07:22:53 - public:mzimmerm

ai, doc, evaluate, model, todo - 5 | id:1489866 -

(1) Most cost effective GPU for local LLMs? : LocalLLaMA

[https://www.reddit.com/r/LocalLLaMA/comments/12vxxze/most_cost_effective_gpu_for_local_llms/] - 2024-03-05 00:49:23 - public:mzimmerm

ai, doc, llm, model, optimize, perform - 6 | id:1489804 -

GGML quantized models. They would let you leverage CPU and system RAM, instead of having to rely on a GPU’s. This could save you a fortune, especially if go for some used AMD Epyc platforms. This could be more viable for the larger models, especially the 30B/65B parameters models which would still press or exceed the VRAM on the P40.

Optimizing LLMs for Speed and Memory

[https://huggingface.co/docs/transformers/v4.35.2/en/llm_tutorial_optimization] - 2024-03-05 00:46:21 - public:mzimmerm

ai, doc, huggingface, llm, model, optimize, perform - 7 | id:1489803 -

7 Steps to Mastering Large Language Models (LLMs) - KDnuggets

[https://www.kdnuggets.com/7-steps-to-mastering-large-language-models-llms] - 2024-03-04 19:35:57 - public:mzimmerm

ai, doc, highlevel, llm, train - 5 | id:1489799 -

A Step-by-Step Guide to Training Your Own Large Language Models (LLMs). | by Sanjay Singh | GoPenAI

[https://blog.gopenai.com/a-step-by-step-guide-to-training-your-own-llm-2d81ff810695] - 2024-03-04 19:34:25 - public:mzimmerm

ai, doc, highlevel, llm, train - 5 | id:1489798 -

GenAI Stack Exchange

[https://genai.stackexchange.com/] - 2024-03-04 19:32:40 - public:mzimmerm

account, ai, doc, forum, stack, stackexchange - 6 | id:1489797 -

7 steps to master large language models (LLMs) | Data Science Dojo

[https://datasciencedojo.com/blog/master-large-language-models/#] - 2024-03-04 19:25:57 - public:mzimmerm

ai, doc, highlevel, llm, model, train - 6 | id:1489796 -

Up to date List of LLM Models

[https://docs.google.com/spreadsheets/d/1kT4or6b0Fedd-W_jMwYpb63e1ZR3aePczz3zlbJW-Y4/edit#gid=741531996] - 2024-03-04 19:13:58 - public:mzimmerm

ai, doc, list, llm, model - 5 | id:1489793 -

Large Language Models for Domain-Specific Language Generation: How to Train Your Dragon | by Andreas Mülder | Medium

[https://medium.com/@andreasmuelder/large-language-models-for-domain-specific-language-generation-how-to-train-your-dragon-0b5360e8ed76] - 2024-03-04 09:45:59 - public:mzimmerm

ai, article, code, doc, generate, llm, train - 7 | id:1489780 -

training a model like Llama with 2.7 billion parameters outperformed a larger model like Vicuna with 13 billion parameters. Especially when considering resource consumption, this might be a good alternative to using a 7B Foundation model instead of a full-blown ChatGPT. The best price-to-performance base model for our use case turned out to be Mistral 7b. The model is compact enough to fit into an affordable GPU with 24GB VRAM and outperforms the other models with 7B parameters.

Yelp Review Classification. Using Embedding, CNN and LSTM | by Zhiwei Zhang | Medium

[https://medium.com/@zhiwei_zhang/yelp-review-classification-b2816d990429] - 2024-03-02 10:55:13 - public:mzimmerm

ai, doc, train, yelp - 4 | id:1489731 -

Simpliest start with ai. Use the Github code linked in

Replit — How to train your own Large Language Models

[https://blog.replit.com/llm-training] - 2024-03-02 10:18:28 - public:mzimmerm

ai, doc, language, llm, model, train - 6 | id:1489728 -

Hi level only talk about training for a language

How to train a new language model from scratch using Transformers and Tokenizers

[https://huggingface.co/blog/how-to-train] - 2024-03-02 09:48:13 - public:mzimmerm

ai, best, doc, good, language, llm, model, todo, train - 9 | id:1489725 -

Describes how to train a new language (desperanto) model.

yabs.io

Yet Another Bookmarks Service

Search

Results