Problem: The customer was running Zabbix 7.4 in a Docker-based environment and attempted to configure SAML authentication using Microsoft Entra ID. Although the SAML integration was partially configured and no critical errors were visible, the customer faced several issues. SAML authentication could not be selected as the default authentication method in the Zabbix UI. Users […]
Monitoring 8 Jan 2025 Resolving IPA Healthcheck Errors Due to Nonexistent ServersProblem: The client, a company using FreeIPA for identity management, encountered issues when running the ipa-healthcheck command. The system was returning errors related to non-existent servers, which had been decommissioned as part of a recent infrastructure migration. These errors were causing the ipa-healthcheck command to fail and reported old servers that no longer existed in […]
Security 6 Jan 2025 Optimizing Cassandra Cluster Performance on AzureProblem: The client was experiencing performance issues with a self-managed Cassandra database cluster hosted on Azure VMs. A recent surge in data traffic led to high CPU utilization, causing significant system slowness and increased latency. The environment utilized SSDs for storage, but earlier attempts at recommended SSD optimizations yielded no significant improvement. In light of […]
Database 3 Jan 2025 Resolving Indexing Failures in OpenSearch During High Availability TestingProblem: The client implemented a 4-node OpenSearch cluster to ensure high availability for their application. When all four nodes were operational, both indexing and searching worked seamlessly. However, during a high availability test where two nodes were intentionally turned off, the indexing process stalled, and no documents were processed. Indexing resumed only after the two […]
Data Analytics 1 Jan 2025 High Resource Utilization from Istio Sidecar ContainersProblem: The client, a FinTech company, managing thousands of microservices using Istio in sidecar proxy mode, faced high CPU and memory utilization. This was caused by the overhead from Istio sidecars, which were handling: Traffic encryption and decryption with mTLS. Traffic routing (rate limiting, retries) and policy management. Telemetry generation for monitoring and tracing tools. […]
Communication 30 Dec 2024 Resolving Timeout Issues for Internal Services in Istio-Managed EKS ClustersProblem: The client used Istio to manage service communication in a distributed microservices architecture. Centralized services, including Gitlab, Keycloak, Vault, and others, were hosted in an Amazon EKS cluster and accessed via a WireGuard-based VPN mesh (Netbird) from 10 external Kubernetes clusters. Despite having all services exposed through Istio ingress gateways, external clusters experienced frequent […]
Communication 27 Dec 2024 Optimizing DNS Resolution and Resolving Readiness Delays in Kubernetes with Istio and CrossplaneProblem: The client reported delays in the readiness of ingress virtual services and difficulty accessing services through DNS names. Despite using Istio for service-to-service communication and centralized services like Keycloak, Gitlab, Vault, and others, the setup was taking too long, especially when resolving DNS names for these services. The delay was primarily due to Crossplane […]
Communication 25 Dec 2024 Migration and Upgrade of Cassandra Cluster from On-Premises to AWSProblem: The client planned to migrate a 5-node Cassandra cluster from an on-premises environment (version 3.11.8) to AWS (target version 4.1.5). The client requested guidance on the best migration strategy that ensures no downtime. Additionally, the client requested information on backup and restore procedures for the migration. Solution: The expert recommended a step-by-step approach. First […]
Database 23 Dec 2024 Optimizing Performance in a Cassandra Cluster Experiencing High CPU UsageProblem: The client experienced issues with uneven data distribution across nodes after adding new nodes to an existing Cassandra cluster. Upon reviewing the “nodetool status” output, it was observed that new nodes were not receiving equal data compared to existing ones, resulting in significant data discrepancies between nodes. The client sought assistance in understanding why […]
Database