ARM processors are the most widely used family of instruction set architectures
Image: kallerna, CC BY-SA 4.0, via Wikimedia Commons
ARM processors are the most widely used family of instruction set architectures
ARM Holdings licenses its RISC instruction set architectures to other companies for device production. These processors are integral to a vast array of devices, from smartphones to supercomputers. Their widespread adoption underscores ARM's significant impact on the technology industry.
Example
Fugaku, the world's fastest supercomputer from 2020 to 2022, uses ARM processors.
Understanding ARM's dominance helps grasp the importance of RISC architectures in modern computing.
a Triton @triton.jit decorator does: compiles a Python function into a GPU kernel
@triton.jit decorator compiles Python function into a GPU kernel
SASS is: the actual machine code that runs on NVIDIA GPU hardware
SASS: compiled machine code executing on NVIDIA GPU hardware
XLA does for TensorFlow/JAX: compiles computation graphs for TPU/GPU execution
XLA compiles computation graphs for TPU/GPU execution
torch.compile does in PyTorch 2.0: traces and optimizes the computation graph
torch.compile optimizes computation graph by tracing and compiling it for efficiency
instruction-level parallelism (ILP) achieves: multiple operations per clock cycle
Instruction-level parallelism (ILP) achieves: Multiple operations per clock cycle
CUDA
CUDA enables parallel computation on GPUs
One email a day: 5 concepts + the 5 stories that matter →
Swipe through 100 ML concepts daily
Open TickerNews