Georgi Gerganov's CPU-and-Metal LLM inference engine in C/C++. Powers Ollama, LM Studio, and basically every 'local LLM' app on the planet.

From Wikipedia

llama.cpp is an open-source software library that performs inference on various large language models such as Llama. It is co-developed alongside the GGML project, a general-purpose tensor library.

Read on Wikipedia ↗

Open source ↗

01
Lv 1 · Browser0 pts
0 / 100 to Lv 2+1 / 200px scrolled
Theme
Display
Density