TensorRT does: NVIDIA's inference optimizer that quantizes and fuses operations

TensorRT optimizes deep learning inference by quantizing and fusing operations for NVIDIA GPUs

Image: BigRiz, CC BY-SA 3.0, via Wikimedia Commons

TensorRT does: NVIDIA's inference optimizer that quantizes and fuses operations

TensorRT optimizes deep learning inference by quantizing and fusing operations for NVIDIA GPUs

Related concepts

One email a day: 5 concepts + the 5 stories that matter →

Swipe through 100 ML concepts daily

Open TickerNews