Component technology layer

Inference and serving runtimes

10

alternatives

9

open source

1

node kinds

2026-06-10

retrieved

Local runtimes, inference servers, and LLM gateways.

GitHub stars among selected inference and serving alternatives.

Ranked alternatives

Popularity-ranked entries linked back to graph nodes and deterministic sources.

RankComponentKindOpennessPopularity metricTasksSource
1Ollamasoftware:ollamasoftwareopen source173,713GitHub stars as of 2026-06-10local-inference, model-servingsource
2llama.cppsoftware:llama.cppsoftwareopen source115,795GitHub stars as of 2026-06-10local-inference, model-servingsource
3vLLMsoftware:vllmsoftwareopen source82,361GitHub stars as of 2026-06-10model-serving, inferencesource
4LiteLLMsoftware:litellmsoftwareopen core49,818GitHub stars as of 2026-06-10model-routing, api-proxysource
5LocalAIsoftware:localaisoftwareopen source46,758GitHub stars as of 2026-06-10local-inferencesource
6Raysoftware:raysoftwareopen source42,824GitHub stars as of 2026-06-10distributed-inferencesource
7SGLangsoftware:sglangsoftwareopen source28,903GitHub stars as of 2026-06-10model-servingsource
8Text Generation Inferencesoftware:text-generation-inferencesoftwareopen source10,860GitHub stars as of 2026-06-10model-servingsource
9Triton Inference Serversoftware:triton-inference-serversoftwareopen source10,742GitHub stars as of 2026-06-10model-servingsource
10KServesoftware:kservesoftwareopen source5,561GitHub stars as of 2026-06-10model-servingsource