Best LLM Inference Tools

Wrap Gemini CLI, Antigravity, ChatGPT Codex, Claude Code, Qwen Code, iFlow as an OpenAI/Gemini/Claude/Codex compatible API service, allowing you to enjoy the free Gemini 2.5 Pro, GPT 5, Claude, Qwen model through API

31,040 ★Gopermissive

Label Studio

Multi-type data labeling and annotation

27,229 ★TypeScriptpermissive

MLX

Array framework for Apple silicon

26,036 ★C++permissive

MLflow

Open source AI/ML lifecycle platform

25,815 ★Pythonpermissive

omlx

LLM inference server with continuous batching and SSD caching for Apple Silicon, managed from the macOS menu bar.

12,528 ★Pythonpermissive

OpenMAIC

Open Multi-Agent Interactive Classroom — Get an immersive, multi-agent learning experience in just one click

16,923 ★TypeScriptstrong-copyleft

Langfuse

Open source LLM engineering platform

26,786 ★TypeScriptpermissive

Weights & Biases

ML experiment tracking

11,048 ★Pythonpermissive

text-generation-webui

Local LLM interface with text, vision, and training

46,958 ★Pythonstrong-copyleft

TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

13,578 ★Pythonunknown

Explore More Categories

Best AI Agent Frameworks Best Python Developer Tools Best DevOps & Infrastructure Tools Best CLI Tools

🤖

AI & Machine Learning

🌐

Web & Frontend

⚙️

Infrastructure & DevOps

Data & Storage

Developer Tools

Security & Auth

Observability

Self-Hosted & Apps