AHD
Apache Hadoop
Apache Hadoop is an open-source distributed processing framework that is used to process and analyze large datasets. It is based on the MapReduce programming model and can be deployed on clusters of commodity hardware.
MAP
Apache Hadoop MapReduce
Azure Cloud Partner
Data Management and Analytics
The Apache Hadoop MapReduce platform enables distributed data processing across large-scale clusters by dividing workloads into map and reduce tasks, delivering reliable and scalable batch data computation.
YAR
Apache Hadoop YARN
Azure Cloud Partner
Data Management and Analytics
The Apache Hadoop YARN platform enables efficient resource management and job scheduling across Hadoop clusters, serving as the architectural center for scaling and executing distributed data processing applications.
AHB
Apache Hbase
Azure Cloud Partner
Apache HBase is an open-source NoSQL database that is built on top of Hadoop. It is a distributed, scalable, and fault-tolerant database that is well-suited for storing large amounts of unstructured data.
HDFS
Apache HDFS
Azure Cloud Partner
Infrastructure and Network
Apache HDFS is an open-source distributed file system that is part of the Apache Hadoop ecosystem. It is designed to store large amounts of data across clusters of commodity hardware.
AHI
Apache Hive
Data Management and Analytics
Apache Hive is an open-source data warehouse software project built on top of Apache Hadoop. It provides a SQL-like interface for querying data stored in HDFS.
AHS
Apache HTTP Server
Azure Cloud Partner
DevOps
Apache HTTP Server is an open-source web server that is one of the most popular web servers in the world. It is known for its stability, security, and performance.
AIB
Apache Iceberg
Azure Cloud Partner
Data Management and Analytics
Apache Iceberg is an open table format for huge analytic datasets, enabling high-performance SQL querying on data lakes.
API
Apache Ignite
Azure Cloud Partner
Apache Ignite is an open-source in-memory data grid that can store and process data in memory for high performance and scalability.
KAF
Apache Kafka
Azure Cloud Partner
AWS Cloud Partner
Apache Kafka is a distributed event store and stream-processing platform.. The project aims to provide a unified, high-throughput, low-latency platform for handling real-time data feeds.