Skip to content

High Availability

Ontul provides fault tolerance and high availability through multi-Master leader election, Worker health monitoring, and automatic failure recovery.

Master High Availability

Multiple Masters can run simultaneously in a leader/follower configuration:

  • Leader Election: Apache ZooKeeper (via Curator) elects a primary Master using a leader latch. The leader owns all write operations to the state store (RocksDB).
  • State Replication: The leader Master replicates catalog metadata, IAM policies, KMS keys, sessions, and connection credentials to follower Masters via the internal NIO protocol.
  • Automatic Failover: If the leader Master fails, ZooKeeper elects a new leader, which reloads persisted state from RocksDB and resumes operations.

All Masters can serve client queries — only write operations (catalog registration, IAM changes, etc.) are routed to the leader.

Worker Health Monitoring

The Master continuously monitors Worker health:

  • Scheduled health checks via the NIO protocol (configurable interval, default 10 seconds)
  • Configurable timeout (default 5 seconds) and failure threshold (default 3 consecutive failures)
  • Unhealthy Workers are automatically excluded from query planning
  • Recovered Workers are automatically re-included

Service Discovery

Masters and Workers register as ephemeral nodes in ZooKeeper. When a node joins or leaves the cluster, all other nodes are notified automatically. No manual configuration of cluster membership is required.

State Store

All cluster state is stored in embedded RocksDB — no external database is needed:

  • Catalog metadata and configurations
  • IAM users, groups, policies, and access keys
  • KMS encrypted keystore
  • Sessions and connection credentials
  • Job history and audit logs

Backup & Restore

Ontul supports cluster state backup and restore:

  • Backup creates a RocksDB checkpoint as a compressed archive
  • Backups can be stored locally or on S3
  • Restore loads a backup archive and rebuilds the state store