Pretraining

nanollama: Train Your Own Llama 3 From Scratch on Custom Data
Pretrain Llama 3 architecture models from raw text, export to GGUF, and run with llama.cpp. Forked from Karpathy's nanochat. 46M to 7B parameters.
Feb 23, 2026