
TensorRT-LLM
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
13.2k ★growthPythontrending
The Lens
Analysis coming soon — this tool was recently discovered and is queued for editorial review.
Score
57/100 · C+
Adoption27/30
Maintenance20/25
Community5/20
License5/15
Analysis0/10
License: Other
Review license manually.
Commercial use: ✗ Restricted
About
- Owner
- NVIDIA Corporation (Organization)
- Stars
- 13,236
- Forks
- 2,238
Explore Further
More tools in the directory
Get tools like this delivered weekly
The Open Source Drop — the best new open source tools, analyzed. Free.