Search

Results

google-research/bert: TensorFlow code and pre-trained models for BERT

[https://github.com/google-research/bert/] - 2024-03-11 04:44:09 - public:mzimmerm

ai, bert, github, home, llm, mini, model, tiny, transformer - 9 | id:1489883 -

BERT model home on github

BERT Transformers – How Do They Work? | Exxact Blog

[https://www.exxactcorp.com/blog/Deep-Learning/how-do-bert-transformers-work] - 2024-03-11 04:39:00 - public:mzimmerm

ai, bert, doc, good, llm, parameter, progress, todo, transformer - 9 | id:1489882 -

Excellent document about BERT transformers / models and their parameters: - L=number of layers. - H=size of the hidden layer = number of vectors for each word in the sentence. - A = Number of self-attention heads - Total parameters.

google/bert_uncased_L-4_H-256_A-4 · Hugging Face

[https://huggingface.co/google/bert_uncased_L-4_H-256_A-4] - 2024-03-11 04:19:21 - public:mzimmerm

ai, bert, huggingface, llm, model, parameter, small, todo - 8 | id:1489880 -

Repository of all Bert models, including small. Start using this model for testing.

Training Bert on Yelp - Copy of training.ipynb - Colaboratory

[https://colab.research.google.com/drive/1FhwrZ05umMvj4cshnEMUOLxjD9ynvCy9#scrollTo=nCFiAJ55LcLt] - 2024-03-05 07:57:07 - public:mzimmerm

ai, bert, huggingface, model, notebook, progress, yelp - 7 | id:1489813 -

Getting Started w/BERT.ipynb - Colaboratory

[https://colab.research.google.com/drive/1YtTqwkwaqV2n56NC8xerflt95Cjyd4NE?usp=sharing] - 2024-03-03 07:30:53 - public:mzimmerm

ai, bert, course, jupiter, notebook - 5 | id:1489743 -

Jupyter notebook to test Bert

BERT 101 - State Of The Art NLP Model Explained

[https://huggingface.co/blog/bert-101] - 2024-03-03 06:50:18 - public:mzimmerm

ai, bert, best, good, model, progress, summary, transform - 8 | id:1489741 -

Best summary of Natural Language Processing and terms - model (a language model - e.g. BertModel, defines encoder and decoder and their properties), transformer (a specific neural network based on attention paper), encoder (series of transformers on input), decoders (series of transformers on output). Bert does NOT use decoder. TensorFlow and PyTorch are possible backends to Transformers (NN). Summary: BERT is a highly complex and advanced language model that helps people automate language understanding.

BERT vs GPT: A Tale of Two Transformers That Revolutionized NLP | by Tavva Prudhvith | Medium

[https://medium.com/@prudhvithtavva/bert-vs-gpt-a-tale-of-two-transformers-that-revolutionized-nlp-11fff8e61984] - 2024-03-03 06:41:37 - public:mzimmerm

ai, bert, good, gpt, model, transform - 6 | id:1489740 -

google-research/bert: TensorFlow code and pre-trained models for BERT

[https://github.com/google-research/bert] - 2024-03-03 06:35:49 - public:mzimmerm

ai, bert, train - 3 | id:1489739 -

Fine-tune a pretrained model

[https://huggingface.co/docs/transformers/training] - 2024-03-02 10:39:40 - public:mzimmerm

ai, bert, code, example, good, huggingface, llm, notebook, progress, train, train-bert-on-yelp, tutorial - 12 | id:1489730 -

yabs.io

Yet Another Bookmarks Service

Search

Results

google-research/bert: TensorFlow code and pre-trained models for BERT

BERT Transformers – How Do They Work? | Exxact Blog

google/bert_uncased_L-4_H-256_A-4 · Hugging Face

Training Bert on Yelp - Copy of training.ipynb - Colaboratory

Getting Started w/BERT.ipynb - Colaboratory

BERT 101 - State Of The Art NLP Model Explained

BERT vs GPT: A Tale of Two Transformers That Revolutionized NLP | by Tavva Prudhvith | Medium

google-research/bert: TensorFlow code and pre-trained models for BERT

Fine-tune a pretrained model

Follow Tags

Export: