Search
Results
bigcode (BigCode)
[https://huggingface.co/bigcode] - - public:mzimmerm
Research community developing various code models, small and big. Models may not be instruct
WizardLM (WizardLM)
deepseek-ai (DeepSeek)
[https://huggingface.co/deepseek-ai] - - public:mzimmerm
They have the 1.3B version!!! This may be the best to start with Newspeak. Should work train even on huggingcface
Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face
[https://huggingface.co/blog/mixtral] - - public:mzimmerm
The Mixtral model is new, and seems to be good. Click on “Demo“ to test it
StarCoder: A State-of-the-Art LLM for Code
[https://huggingface.co/blog/starcoder] - - public:mzimmerm
Article has comparison with other code-LLM models
stabilityai (Stability AI) - Stable Diffusion running on Huggingface
[https://huggingface.co/stabilityai] - - public:mzimmerm
Chat, models. Not open source, but instruct and relatively small (3B). The 3B instruct may be the best to try on Newspeak.