A
Airflow
Azure Cloud Partner
AWS Cloud Partner
Oracle Cloud Partner
Google GCP include Partner
Airflow scheduler executes your tasks on an array of workers while following the specified dependencies.
ABY
Apache Airbyte
Azure Cloud Partner
AWS Cloud Partner
Google GCP include Partner
Data Management and Analytics
Apache Airbyte is an open-source data integration engine that helps you consolidate your data in your data warehouses, lakes and databases.
ARF
Apache Airflow
Azure Cloud Partner
AWS Cloud Partner
Oracle Cloud Partner
Google GCP include Partner
The Apache Airflow scheduler executes your tasks on an array of workers while following the specified dependencies.
AHD
Apache Hadoop
Apache Hadoop is an open-source distributed processing framework that is used to process and analyze large datasets. It is based on the MapReduce programming model and can be deployed on clusters of commodity hardware.
MAP
Apache Hadoop MapReduce
Azure Cloud Partner
Data Management and Analytics
The Apache Hadoop MapReduce platform enables distributed data processing across large-scale clusters by dividing workloads into map and reduce tasks, delivering reliable and scalable batch data computation.
AHI
Apache Hive
Data Management and Analytics
Apache Hive is an open-source data warehouse software project built on top of Apache Hadoop. It provides a SQL-like interface for querying data stored in HDFS.
AIB
Apache Iceberg
Azure Cloud Partner
Data Management and Analytics
Apache Iceberg is an open table format for huge analytic datasets, enabling high-performance SQL querying on data lakes.
API
Apache Ignite
Azure Cloud Partner
Apache Ignite is an open-source in-memory data grid that can store and process data in memory for high performance and scalability.
KAF
Apache Kafka
Azure Cloud Partner
AWS Cloud Partner
Apache Kafka is a distributed event store and stream-processing platform.. The project aims to provide a unified, high-throughput, low-latency platform for handling real-time data feeds.
KYL
Apache Kylin
Azure Cloud Partner
AWS Cloud Partner
Data Management and Analytics
Apache Kylin is primarily needed for organizations that require real-time analytics and business intelligence (BI) on large-scale datasets.