Skip to main content

AKU plans and scaling

Inkless uses Aiven Kafka Units (AKUs) to help you size Apache Kafka services based on throughput instead of hardware resources. An AKU represents the amount of traffic a service can handle. You estimate the expected throughput when creating the service. This estimate determines the initial AKU level and the scaling range.

How AKUs work

  • Each AKU corresponds to a specific throughput capacity. It represents the compute and memory resources required to meet that throughput.
  • The initial AKU level is derived from the expected throughput estimate provided during service creation.
  • The service monitors throughput over time.
  • When throughput remains above the threshold for the current AKU level for a period of time, the service scales up within your configured limits.
  • When throughput remains low for a sustained period, the service scales down.

Scaling changes the number of AKUs in use, which affects AKU-hour billing. Scaling actions do not affect topic configuration or data retention.

Throughput measurement

Inkless measures two types of traffic:

  • Ingress: Data written to topics by producers.
  • Egress: Data read from topics by consumers, connectors, and mirroring processes.

Both ingress and egress affect the number of AKUs required. You can track ingress and egress usage in the Service utilisation view, which also shows the AKU thresholds.

Autoscaling limits

Depending on your cloud provider and account, you can configure:

  • Minimum AKUs: The lowest capacity the service can scale down to.
  • Maximum AKUs: The highest capacity the service can scale up to.

Inkless scales automatically within these limits. Scaling occurs only when throughput remains above or below a threshold for a sustained period.

Storage and AKUs

Storage does not influence AKU scaling:

  • Diskless topics write directly to object storage.
  • Classic topics use local disk for recent data and move older segments to object storage through tiered storage.

Storage and compute scale independently, so you can adjust retention without changing AKU levels.

When to adjust AKU ranges

Adjust your AKU limits when:

  • Workload throughput increases for sustained periods.
  • Traffic spikes begin to persist for longer periods.
  • Reducing costs during low-traffic periods requires a lower maximum AKU.
  • The workload needs a guaranteed minimum level of throughput.

For details on how AKU usage affects billing, see Billing.

Related pages