Search

Results

google-research/bert: TensorFlow code and pre-trained models for BERT

[https://github.com/google-research/bert/] - 2024-03-11 04:44:09 - public:mzimmerm

ai, bert, github, home, llm, mini, model, tiny, transformer - 9 | id:1489883 -

BERT model home on github

BERT Transformers – How Do They Work? | Exxact Blog

[https://www.exxactcorp.com/blog/Deep-Learning/how-do-bert-transformers-work] - 2024-03-11 04:39:00 - public:mzimmerm

ai, bert, doc, good, llm, parameter, progress, todo, transformer - 9 | id:1489882 -

Excellent document about BERT transformers / models and their parameters: - L=number of layers. - H=size of the hidden layer = number of vectors for each word in the sentence. - A = Number of self-attention heads - Total parameters.

google/bert_uncased_L-4_H-256_A-4 · Hugging Face

[https://huggingface.co/google/bert_uncased_L-4_H-256_A-4] - 2024-03-11 04:19:21 - public:mzimmerm

ai, bert, huggingface, llm, model, parameter, small, todo - 8 | id:1489880 -

Repository of all Bert models, including small. Start using this model for testing.

Fine-tune a pretrained model

[https://huggingface.co/docs/transformers/training] - 2024-03-02 10:39:40 - public:mzimmerm

ai, bert, code, example, good, huggingface, llm, notebook, progress, train, train-bert-on-yelp, tutorial - 12 | id:1489730 -

yabs.io

Yet Another Bookmarks Service