Mit wenigen Klicks zum eigenen KI-Chatbot: Warum ihr dieses Tool kennen solltet

[https://t3n.de/news/lokaler-ki-chatbot-lm-studio-1642969/] - 2024-08-28 06:03:49 - public:mzimmerm

ai, home, local, model, todo, train - 6 | id:1492821 -

Train and use my model

Who needs GitHub Copilot when you can roll your own AI code assistant at home • The Register

[https://www.theregister.com/AMP/2024/08/18/self_hosted_github_copilot] - 2024-08-26 16:07:40 - public:mzimmerm

ai, code, good, home, ide, intellij, local, model, todo, train - 10 | id:1492807 -

The Best GPUs for Deep Learning in 2023 — An In-depth Analysis

[https://timdettmers.com/2023/01/30/which-gpu-for-deep-learning/] - 2024-03-22 02:50:52 - public:mzimmerm

ai, good, gpu, learn, llm, todo, train - 7 | id:1490076 -

Training and Validation Loss in Deep Learning | Baeldung on Computer Science

[https://www.baeldung.com/cs/training-validation-loss-deep-learning] - 2024-03-09 23:23:38 - public:mzimmerm

ai, doc, error, evaluate, loss, train, validate - 7 | id:1489871 -

7 Steps to Mastering Large Language Models (LLMs) - KDnuggets

[https://www.kdnuggets.com/7-steps-to-mastering-large-language-models-llms] - 2024-03-04 19:35:57 - public:mzimmerm

ai, doc, highlevel, llm, train - 5 | id:1489799 -

A Step-by-Step Guide to Training Your Own Large Language Models (LLMs). | by Sanjay Singh | GoPenAI

[https://blog.gopenai.com/a-step-by-step-guide-to-training-your-own-llm-2d81ff810695] - 2024-03-04 19:34:25 - public:mzimmerm

ai, doc, highlevel, llm, train - 5 | id:1489798 -

7 steps to master large language models (LLMs) | Data Science Dojo

[https://datasciencedojo.com/blog/master-large-language-models/#] - 2024-03-04 19:25:57 - public:mzimmerm

ai, doc, highlevel, llm, model, train - 6 | id:1489796 -

LLM for a new language : MachineLearning

[https://www.reddit.com/r/MachineLearning/comments/12xu5ls/p_llm_for_a_new_language/] - 2024-03-04 19:15:48 - public:mzimmerm

ai, highlevel, llm, model, train - 5 | id:1489794 -

High level how to train a model

Introduction to Constructing Your Dataset | Machine Learning | Google for Developers

[https://developers.google.com/machine-learning/data-prep/construct/construct-intro] - 2024-03-04 11:38:48 - public:mzimmerm

ai, dataset, train - 3 | id:1489790 -

LLaMA 7B GPU Memory Requirement - Transformers - Hugging Face Forums

[https://discuss.huggingface.co/t/llama-7b-gpu-memory-requirement/34323/6] - 2024-03-04 10:10:38 - public:mzimmerm

ai, code, generate, llama, llm, model, newspeak, train - 8 | id:1489782 -

With the optimizers of bitsandbytes (like 8 bit AdamW), you would need 2 bytes per parameter, or 14 GB of GPU memory.

Large Language Models for Domain-Specific Language Generation: How to Train Your Dragon | by Andreas Mülder | Medium

[https://medium.com/@andreasmuelder/large-language-models-for-domain-specific-language-generation-how-to-train-your-dragon-0b5360e8ed76] - 2024-03-04 09:45:59 - public:mzimmerm

ai, article, code, doc, generate, llm, train - 7 | id:1489780 -

training a model like Llama with 2.7 billion parameters outperformed a larger model like Vicuna with 13 billion parameters. Especially when considering resource consumption, this might be a good alternative to using a 7B Foundation model instead of a full-blown ChatGPT. The best price-to-performance base model for our use case turned out to be Mistral 7b. The model is compact enough to fit into an affordable GPU with 24GB VRAM and outperforms the other models with 7B parameters.

google-research/bert: TensorFlow code and pre-trained models for BERT

[https://github.com/google-research/bert] - 2024-03-03 06:35:49 - public:mzimmerm

ai, bert, train - 3 | id:1489739 -

Simple Machine Learning Model in Python in 5 lines of code | by Raman Sah | Towards Data Science

[https://towardsdatascience.com/simple-machine-learning-model-in-python-in-5-lines-of-code-fe03d72e78c6] - 2024-03-02 11:26:11 - public:mzimmerm

ai, example, simple, todo, train - 5 | id:1489732 -

Yelp Review Classification. Using Embedding, CNN and LSTM | by Zhiwei Zhang | Medium

[https://medium.com/@zhiwei_zhang/yelp-review-classification-b2816d990429] - 2024-03-02 10:55:13 - public:mzimmerm

ai, doc, train, yelp - 4 | id:1489731 -

Simpliest start with ai. Use the Github code linked in

Fine-tune a pretrained model

[https://huggingface.co/docs/transformers/training] - 2024-03-02 10:39:40 - public:mzimmerm

ai, bert, code, example, good, huggingface, llm, notebook, progress, train, train-bert-on-yelp, tutorial - 12 | id:1489730 -

Use the Bert model to train on Yelp dataset

BigCode - Open and responsible development of LLMs for code

[https://www.bigcode-project.org/] - 2024-03-02 10:21:57 - public:mzimmerm

account, ai, computer, language, model, train - 6 | id:1489729 -

BigCode is an open scientific collaboration working on the responsible development and use of large language models for code

Replit — How to train your own Large Language Models

[https://blog.replit.com/llm-training] - 2024-03-02 10:18:28 - public:mzimmerm

ai, doc, language, llm, model, train - 6 | id:1489728 -

Hi level only talk about training for a language

How to train a new language model from scratch using Transformers and Tokenizers

[https://huggingface.co/blog/how-to-train] - 2024-03-02 09:48:13 - public:mzimmerm

ai, best, doc, good, language, llm, model, todo, train - 9 | id:1489725 -

Describes how to train a new language (desperanto) model.

Beyond Self-Attention: How a Small Language Model Predicts the Next Token | Shyam's Blog

[https://shyam.blog/posts/beyond-self-attention/] - 2024-02-17 17:32:12 - public:xxx

AI, LLM, machinelearning, ML, train, tutorials - 6 | id:1489566 -

yabs.io

Yet Another Bookmarks Service

Search

Results