The company provides mission-critical airport gate management systems used worldwide. Their RabbitMQ deployment on Kubernetes was experiencing publish failures, cluster failover issues, and quorum queue synchronization problems that threatened system availability.
RabbitMQ was co-located with PostgreSQL on shared Kubernetes nodes, causing resource contention. Quorum queue synchronization was failing at 3,000 IOPS storage, and cluster partitions were causing zombie bindings and message publishing failures. The team needed to upgrade to RabbitMQ 4.2 while maintaining stability across Azure and VMware environments.
Kubernetes clusters across Azure and VMware, RabbitMQ with PostgreSQL co-location, Prometheus/Grafana monitoring, C and Node.js client applications.
AceMQ conducted a multi-week intensive assessment with daily sessions, performing disk benchmarking with FIO, analyzing RAFT consensus logs, and designing infrastructure isolation strategies. The team provided comprehensive documentation covering failover procedures, monitoring, backup/restore, and federation.
The client achieved stable RabbitMQ operations on Kubernetes with proper resource isolation, automated recovery procedures, and comprehensive monitoring — ensuring reliable messaging for airport systems worldwide.
Hardening RabbitMQ in Kubernetes environments with StatefulSet tuning, quorum queue optimization, storage isolation, and memory/network configuration.
Improving observability with Prometheus, Grafana, alerting, queue visibility, disk/memory thresholds, and retry metrics.
Whether you need architecture advisory, 24/7 support, or full managed services, AceMQ has the expertise to help.