Search
Results
deepseek-ai (DeepSeek)
[https://huggingface.co/deepseek-ai] - - public:mzimmerm
They have the 1.3B version!!! This may be the best to start with Newspeak. Should work train even on huggingcface
deepseek-ai/deepseek-coder-6.7b-instruct ยท Hugging Face
[https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct] - - public:mzimmerm
Another possible model. For coding capabilities, Deepseek Coder achieves state-of-the-art performance among open-source code models on multiple programming languages and various benchmarks.
StarCoder: A State-of-the-Art LLM for Code
[https://huggingface.co/blog/starcoder] - - public:mzimmerm
Article has comparison with other code-LLM models
Fine-tune a pretrained model
[https://huggingface.co/docs/transformers/training] - - public:mzimmerm
Use the Bert model to train on Yelp dataset