Search
Results
deepseek-ai (DeepSeek)
[https://huggingface.co/deepseek-ai] - - public:mzimmerm
They have the 1.3B version!!! This may be the best to start with Newspeak. Should work train even on huggingcface
BERT 101 - State Of The Art NLP Model Explained
[https://huggingface.co/blog/bert-101] - - public:mzimmerm
Best summary of Natural Language Processing and terms - model (a language model - e.g. BertModel, defines encoder and decoder and their properties), transformer (a specific neural network based on attention paper), encoder (series of transformers on input), decoders (series of transformers on output). Bert does NOT use decoder. TensorFlow and PyTorch are possible backends to Transformers (NN). Summary: BERT is a highly complex and advanced language model that helps people automate language understanding.
How to train a new language model from scratch using Transformers and Tokenizers
[https://huggingface.co/blog/how-to-train] - - public:mzimmerm
Describes how to train a new language (desperanto) model.