Search
Results
bigcode (BigCode)
[https://huggingface.co/bigcode] - - public:mzimmerm
Research community developing various code models, small and big. Models may not be instruct
WizardLM (WizardLM)
deepseek-ai (DeepSeek)
[https://huggingface.co/deepseek-ai] - - public:mzimmerm
They have the 1.3B version!!! This may be the best to start with Newspeak. Should work train even on huggingcface
Can Ai Code Results - a Hugging Face Space by mike-ravkine
[https://huggingface.co/spaces/mike-ravkine/can-ai-code-results] - - public:mzimmerm
Comparison of LLM models for coding
openchat/openchat-3.5-0106 · Hugging Face
[https://huggingface.co/openchat/openchat-3.5-0106] - - public:mzimmerm
Open source with lots of information. Uses Multiple undrelying models. Not sure how I would train for it
Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face
[https://huggingface.co/blog/mixtral] - - public:mzimmerm
The Mixtral model is new, and seems to be good. Click on “Demo“ to test it
StarCoder: A State-of-the-Art LLM for Code
[https://huggingface.co/blog/starcoder] - - public:mzimmerm
Article has comparison with other code-LLM models
codellama (Code Llama) - Huggingface model for generating programs. Maybe can be used for Newspeak?
Fine-tune a pretrained model
[https://huggingface.co/docs/transformers/training] - - public:mzimmerm
Use the Bert model to train on Yelp dataset