Search
Results
Can Ai Code Results - a Hugging Face Space by mike-ravkine
Comparison of LLM models for coding
(26) Discord
openchat/openchat-3.5-0106 · Hugging Face
Open source with lots of information. Uses Multiple undrelying models. Not sure how I would train for it
Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face
The Mixtral model is new, and seems to be good. Click on “Demo“ to test it
StarCoder: A State-of-the-Art LLM for Code
Article has comparison with other code-LLM models
huybery/Awesome-Code-LLM: An awesome and curated list of best code-LLM for research.
Hannibal046/Awesome-LLM: Awesome-LLM: a curated list of Large Language Model
Large language models and the rise of the AI code generators | InfoWorld
Review of LLM specialized for code generation
Large language model - Wikipedia
List of LLM models on Wikipedia
List of datasets for machine-learning research - Wikipedia
Mistral AI - Wikipedia
fast.ai – fast.ai—Making neural nets uncool again
OpenAI Codex - Wikipedia
Model which generates code for Python, Javascript, Go, Shell, Perl, Swifg, Ruby, PHP
codellama (Code Llama) - Huggingface model for generating programs. Maybe can be used for Newspeak?
Introducing Gemini: Google’s most capable AI model yet
Advanced coding Our first version of Gemini can understand, explain and generate high-quality code in the world’s most popular programming languages, like Python, Java, C++, and Go. Using a specialized version of Gemini, we created a more advanced code generation system, AlphaCode 2,
AI Code Tools: The Ultimate Guide in 2024
AI Code tools : Good summary. Does not talk about which pre-trained model they use. One is gemini (bard) -> alphacode2
Getting Started w/BERT.ipynb - Colaboratory
Jupyter notebook to test Bert
Introduction - Hugging Face NLP Course
Natural Languge processing - full course.
BERT 101 - State Of The Art NLP Model Explained
Best summary of Natural Language Processing and terms - model (a language model - e.g. BertModel, defines encoder and decoder and their properties), transformer (a specific neural network based on attention paper), encoder (series of transformers on input), decoders (series of transformers on output). Bert does NOT use decoder. TensorFlow and PyTorch are possible backends to Transformers (NN). Summary: BERT is a highly complex and advanced language model that helps people automate language understanding.
BERT vs GPT: A Tale of Two Transformers That Revolutionized NLP | by Tavva Prudhvith | Medium
google-research/bert: TensorFlow code and pre-trained models for BERT
Methods and tools for efficient training on a single GPU
Simple Machine Learning Model in Python in 5 lines of code | by Raman Sah | Towards Data Science
Yelp Review Classification. Using Embedding, CNN and LSTM | by Zhiwei Zhang | Medium
Simpliest start with ai. Use the Github code linked in
Fine-tune a pretrained model
Use the Bert model to train on Yelp dataset
BigCode - Open and responsible development of LLMs for code
BigCode is an open scientific collaboration working on the responsible development and use of large language models for code
Replit — How to train your own Large Language Models
Hi level only talk about training for a language
Hugging Face – The AI community building the future.
My Account and profile on Huggingface - the home of AI transformers, models, training sets
The 29 Best (And Free) ChatGPT And Generative AI Courses And Resources
How to train a new language model from scratch using Transformers and Tokenizers
Describes how to train a new language (desperanto) model.