Redis

By |2022-07-04T14:29:38+00:00February 1st, 2018|Informative, Redis, Technologies|

Overview Of Redis Architecture Redis is a in-memory, key-value data store. Redis is the most popular key-value data store. Redis is used by all big IT brands in this world. Amazon Elastic Cache supports [...]

Apache Tez

By |2024-07-31T13:22:34+00:00February 1st, 2018|Apache Tez, Informative, Technologies|

Apache Tez Introduction The Apache TEZ® project is aimed at building an application framework which allows for a complex directed-acyclic-graph of tasks for processing data. It is currently built atop Apache Hadoop YARN. The 2 [...]

Apache Drill

By |2022-07-04T14:43:17+00:00February 1st, 2018|Apache Drill, Informative, Technologies|

Apache Drill: Drill is an Apache open-source SQL query engine for Big Data exploration. Apache Drill is designed from the ground up to support high-performance analysis on the semi-structured and rapidly evolving data [...]

Presto

By |2022-07-04T14:38:41+00:00February 1st, 2018|Informative, Presto, Technologies|

WHAT IS PRESTO? Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. Presto was designed and written from [...]

Apache Storm 

By |2024-07-31T13:21:13+00:00February 1st, 2018|Apache Storm, Informative, Technologies|

Apache Storm OVERVIEW A system for processing streaming data in real time Apache™ Storm adds reliable real-time data processing capabilities to Enterprise Hadoop. Storm on YARN is powerful for scenarios requiring real-time analytics, machine [...]

Apache Cassandra

By |2024-07-31T12:56:48+00:00February 1st, 2018|Apache Cassandra, Informative, Technologies|

What is Apache Cassandra™? Apache Cassandra™, a top level Apache project born at Facebook and built on Amazon’s Dynamo and Google’s BigTable, is a distributed database for managing large amounts of structured data across many commodity servers, while [...]

Spark SQL 

By |2022-07-04T14:47:26+00:00January 31st, 2018|Informative, SparkSQL, Technologies|

Apache Spark is a lightning-fast cluster computing framework designed for fast computation. It is of the most successful projects in the Apache Software Foundation. Spark SQL is a new module in Spark which integrates relational processing with [...]

Apache Hive 

By |2024-07-31T13:08:10+00:00January 31st, 2018|Apache Hive, Informative, Technologies|

What does Apache Hive do? The Apache Hive™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage and queried using SQL syntax. Built on top of Apache Hadoop™, Hive provides the following [...]

CONTACT US