Large Language Models (LLM)¶

ChatGPT Is An Extra-Ordinary Python Programmer
StartChat Playground by Hugging Face
What is ChatGPT doing and why does it work
GPT in 60 Lines of NumPy
privateGPT
Pushing Prompt Engineering to the Limit
How Foundation Model Providers Comply with the Draft EU AI Act
A Gentle Introduction to LLM APIs
All You Need to Know to Build Your First LLM App
Mastering Prompt Engineering
How to Run LLMs Locally
LangChain: Building applications with LLMs through composability
DeclarAI: turning Python code into production-ready LLM tasks
Open Source LLMs To Power A LLM Application
Large language models, explained with a minimum of math and jargon
Inside GPT: Understanding the text generation
Llama 2: Open Foundation and Fine-Tuned Chat Models
Understand how BERT constructs state-of-the-art embeddings
codellama
NLP tasks via LLM
From encoding to embeddings
Large Language Models: Sentence-BERT
Methods For Improving Your Large Language Model
Vector Databases and How to Use Them to Augment LLM
Large Language Models: RoBERTa, a Robustly Optimized BERT Approach
DeepEval: Unit Testing for LLMs
Attention Sinks in LLMs for endless fluency
Generative AI exists because of the transformer: this is how it works
OpenLLM Leaderboard
All you need to know to Develop using Large Language Models
LMQL: a programming language for large language models
GPT-Engineer
Chatbot Arena: Benchmarking LLMs in the Wild
magentic: easily integrate Large Language Models into your Python code
Hard Truths About Generative AI for Technology Leaders
AlphaCodium: From Prompt Engineering to Flow Engineering
Cheshire-Cat: Production ready AI assistant framework
OLMo: a State-of-the-Art, Truly Open LLM and Framework
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting
Cohere For AI Launches Aya: an LLM Covering More Than 100 Languages
A non-exhaustive but essential list of key papers that underpins text-to-video Deep Generative model like SORA
Do large language models understand the world?
A Visual Guide to Mamba and State Space Models
Gemma: una nuova famiglia di modelli aperti
DSPy: the framework for programming - not prompting! - foundation models
Text Embeddings: Comprehensive Guide
Developers with AI assistants need to follow the pair programming model
LLM Evaluation
A programming framework for agentic AI
Gemma 2 optimized for your local machine
GraphRAG: a modular graph-based Retrieval-Augmented Generation (RAG) system
Explaining generative language models to (almost) anyone
Auditing the Ask Astro LLM Q&A app
The Rise of the LLM OS: From AIOS to MemGPT and beyond
A Visual Guide to Quantization
Unsloth: Finetune Llama 3.1, Mistral, Phi and Gemma
Open WebUI: user-friendly WebUI for LLMs
LangDrive: train LLMs on private data
llmware: unified framework for building enterprise RAG pipelines with small, specialized models
giskard: Open-Source Evaluation & Testing for LLMs and ML models
talkd/dialog: RAG LLM Ops App for easy deployment and testing
LLM sampling
AI models collapse when trained on recursively generated data
Trace: AutoDiff for AI Systems and LLM Agents
Will We Run Out of Data? Limits of LLM Scaling Based on Human-Generated Data
LitGPT: 2high-performance LLMs with recipes to pretrain, finetune and deploy at scale
How to build a basic LLM GPT model from Scratch in Python
guidance: a guidance language for controlling large language models
"Attention, Please!": A Visual Guide To The Attention Mechanism
How LLMs Work, Explained Without Math
litellm: Python SDK, proxy server to call LLM APIs using the OpenAI format
guardrails: adding guardrails to large language models
Burr: build applications that make decisions (chatbots, agents, simulations). Monitor, trace, persist, and execute on your own infrastructure
el: a language model programming library
Model2Vec: Distill a Small Fast Model from any Sentence Transformer
Beyond Traditional Testing: Addressing the Challenges of Non-Deterministic Software
JIT Implementation: A Python Library That Implements Your Code at Runtime
Open Source Frameworks for Building Generative AI Applications
ChainLit: Build Conversational AI in minutes
A RAG from scratch to query the scikit-learn documentation
Introduction to Large Language Models
DataChain: AI-data warehouse to enrich, transform and analyze unstructured data
Simplemind: Python client for AI providers
Official code repo for the O'Reilly Book "Hands-On Large Language Models"
Docling: parse documents and export them to the desired format with ease and speed
Posting: the modern API client that lives in your terminal
Large Chainsaw Model
My colleague Julius
Can LLMs write better code if you keep asking them to "write better code"?
MyST: Community-driven tools for technical communication
TabbyML: self-hosted AI coding assistant
The 2025 AI Engineer Reading List
TabPFN: Foundation Model for Tabular Data
Foundations of Large Language Models
LLM code generation workflow
Writing an LLM from scratch
Aider: AI pair programming in your terminal
codegen: Python SDK to Interact with Intelligent Code Generation Agents
agx: AI Powered Analytics App
torchexplorer: interactively inspect module inputs, outputs, parameters, and gradients
token-explorer: a simple tool to explore different possible paths that an LLM might sample
Transformers and Large Language Models cheatsheet for Stanford's CME 295
CoRT (Chain-of-Recursive-Thoughts): AI think harder when it argues with itself repeatedly
agenticSeek: fully Local Manus AI
blast: browser-LLM Auto-Scaling Technology
I'd rather read the prompt
The Problem with "Vibe Coding"
Emerging Patterns in Building GenAI Products
Dummy's Guide to Modern LLM Sampling
A cheat sheet for why using ChatGPT is not bad for the environment
The Cultural Divide between Mathematics and AI
The Hidden Cost of AI Coding
AI code is legacy code from day one
36 Alternatives to LLM Context