- controls how many requests an [[API]] can handle in a period of time (typically RPS / requests p/sec)