Problem: An official Apache NiFi Docker image (standard distribution) was scanned by a security tool and reported a high-severity finding: sonatype-2024-3350 (commons-collections version 3.2.2). The scanner identified the vulnerable class (SetUniqueList) inside the nifi-toolkit-assembly artifact (zip:bin, NiFi distribution version 2.8.0). The deployment context was a stock NiFi image pulled from the vendor registry — no […]
Case Studies Data Management and Analytics Data Analytics 3 Apr 2026 Expanding a KRaft Kafka Cluster by Two Brokers without Data LossProblem: Client requested a step‑by‑step procedure to add two new servers to an existing three‑node Kafka cluster running in KRaft mode, with the explicit requirement to avoid any data loss. The cluster’s existing configuration (server.properties) and current controller.quorum.voters format were provided for reference. The observable concern was that adding voters and brokers incorrectly could cause […]
Knowledge Base Case Studies Data Management and Analytics Data Analytics 16 Mar 2026 Debezium MySQL CDC: handling tables with spaces in their namesProblem: A MySQL table named with embedded spaces (schema-qualified as “dbo.Sourcing Id Master”) could not be used in the Debezium connector’s message.key.columns setting. The connector configuration contained the entry “message.key.columns”: “dbo.Sourcing Id Master:id” and the connector validation rejected it with an invalid-format error referencing the expected pattern for message.key.columns. Attempts to escape characters in the […]
Case Studies Data Management and Analytics Data Analytics 25 Feb 2026 Implementing Process-Group-Level RBAC in Apache NiFiProblem: A production team requested guidance to implement multi-tenancy and fine-grained RBAC in Apache NiFi so different users/groups would have isolated view and edit rights at the Process Group level. Requested capabilities included: allowing certain users to view and edit only a specific Process Group (no access to other canvas areas), defining Read‑Only vs Read/Write […]
Case Studies Data Management and Analytics Data Analytics 30 Dec 2025 Apache Airflow & Kubernetes: Job Creation Failures Caused by Stuck Deletions and FinalizersProblem: The client experienced repeated failures in two Apache Airflow DAGs responsible for launching Kubernetes Jobs. Each DAG followed a delete-and-recreate pattern using a fixed Kubernetes Job name. Although the Airflow task responsible for deleting the Job reported success, the subsequent Job creation consistently failed with a Kubernetes conflict error indicating that the Job already […]
Data Analytics 3 Nov 2025 Apache Kafka Broker Timeout and Message Loss IncidentProblem: The client, reported repeated network timeouts and instability in their Apache Kafka environment, resulting in missing Call Detail Records for specific time periods. The issue manifested as persistent connectivity errors between Kafka brokers and 100% CPU utilization on the node hosting critical services (UMS, App, and ODF pods). Process: Step 1: Initial Identification The […]
Data Analytics 1 Oct 2025 Prometheus Sidecar Vulnerability Leads to Compliance FailuresProblem: A Prometheus deployment failed a security compliance scan when Prisma Cloud flagged a high-severity vulnerability in the sidecar container image kiwigrid/k8s-sidecar:1.30.9. The issue was linked to the pip package, which was identified as vulnerable to arbitrary code execution. This raised concerns about both compliance and the security of the monitoring stack. Process: Step 1: […]
Data Analytics 5 Sep 2025 Auth Image is Not Updating in Airflow Scheduler PodProblem: The client reported that their Apache Airflow Scheduler pod was not automatically detecting updates from a Persistent Volume Claim (PVC) named airflow-dags. Meanwhile, a related airflow-triggerer pod using the same PVC was updating files as expected. To temporarily resolve the issue, the client had to manually restart (“bounce”) the scheduler pod to force it to recognize new […]
Data Analytics 2 Jun 2025 Implementing SSL and High Availability in a Multi-Node OpenSearch ClusterProblem: A financial services client encountered critical SSL-related errors while deploying a two-node OpenSearch 1.3.6 cluster for high availability. Despite both nodes appearing operational, accessing indices or interacting with the cluster through a Java application resulted in errors such as: SSLHandshakeException: Insufficient buffer remaining for AEAD cipher fragment CertPathBuilderException: unable to find valid certification path […]
Data Analytics