Search

Results

BERT Transformers – How Do They Work? | Exxact Blog

[https://www.exxactcorp.com/blog/Deep-Learning/how-do-bert-transformers-work] - 2024-03-11 04:39:00 - public:mzimmerm

ai, bert, doc, good, llm, parameter, progress, todo, transformer - 9 | id:1489882 -

Excellent document about BERT transformers / models and their parameters: - L=number of layers. - H=size of the hidden layer = number of vectors for each word in the sentence. - A = Number of self-attention heads - Total parameters.

Generative pre-trained transformer - Wikipedia

[https://en.wikipedia.org/wiki/Generative_pre-trained_transformer] - 2024-03-11 04:14:03 - public:mzimmerm

ai, doc, llm, todo - 4 | id:1489879 -

AMD Ryzen AI CPUs & Radeon 7000 GPUs Can Run Localized Chatbots Using LLMs Just Like NVIDIA's Chat With RTX

[https://wccftech.com/amd-ryzen-ai-cpus-radeon-7000-gpus-localized-chatbot-llms-like-nvidia-chat-with-rtx/] - 2024-03-11 03:27:50 - public:mzimmerm

ai, amd, apu, doc, gpu, install, lmstudio, rocm, software, studio, todo - 11 | id:1489878 -

LM Studio can be installed on Linux with APU or GPU (looks like it needs the AI CPU though??) and run LLM. Install on Laptop and test if it works.

A Step-by-Step Guide to Model Evaluation in Python | by Shreya Singh | Medium

[https://medium.com/@jscvcds/a-step-by-step-guide-to-model-evaluation-in-python-3a72dee92560] - 2024-03-09 07:22:53 - public:mzimmerm

ai, doc, evaluate, model, todo - 5 | id:1489866 -

How to train a new language model from scratch using Transformers and Tokenizers

[https://huggingface.co/blog/how-to-train] - 2024-03-02 09:48:13 - public:mzimmerm

yabs.io

Yet Another Bookmarks Service