ThirdEye Data Logo - For Official Use

Apache Hbase

By |2024-07-31T10:17:16+00:00February 1st, 2018|Uncategorized|

What is Apache HBase? Apache Hbase is a popular and highly efficient Column-oriented NoSQL database built on top of Hadoop Distributed File System that allows performing read/write operations on large datasets in real time [...]

Apache Ignite

By |2024-07-31T13:09:00+00:00February 1st, 2018|Uncategorized|

What is Apache Ignite? Apache Ignite is a memory-centric distributed database, caching, and processing platform for transactional, analytical, and streaming workloads delivering in-memory speeds at petabyte scaleDurable Memory Ignite's durable memory component treats RAM [...]

Apache Oozie 

By |2022-07-04T13:34:33+00:00February 1st, 2018|Uncategorized|

OVERVIEW The blueprint for Enterprise Hadoop includes Apache™ Hadoop’s original data storage and data processing layers and also adds components for services that enterprises must have in a modern data architecture: data integration and [...]

Mongo DB 

By |2022-07-04T13:51:55+00:00February 1st, 2018|Uncategorized|

MongoDB is an open-source document database that provides high performance, high availability, and automatic scaling. Document Database A record in MongoDB is a document, which is a data structure composed of field and value [...]

Apache Drill

By |2022-07-04T14:43:17+00:00February 1st, 2018|Uncategorized|

Apache Drill: Drill is an Apache open-source SQL query engine for Big Data exploration. Apache Drill is designed from the ground up to support high-performance analysis on the semi-structured and rapidly evolving data [...]

Presto

By |2022-07-04T14:38:41+00:00February 1st, 2018|Uncategorized|

WHAT IS PRESTO? Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. Presto was designed and written from [...]

Apache Storm 

By |2024-07-31T13:21:13+00:00February 1st, 2018|Uncategorized|

Apache Storm OVERVIEW A system for processing streaming data in real time Apache™ Storm adds reliable real-time data processing capabilities to Enterprise Hadoop. Storm on YARN is powerful for scenarios requiring real-time analytics, machine [...]

Spark Streaming 

By |2022-07-04T13:47:11+00:00February 1st, 2018|Uncategorized|

Spark Streaming is an extension of the core Spark API that enables scalable, high-throughput, fault-tolerant stream processing of live data streams. Spark Streaming can be used to stream live data and processing can happen [...]

Apache Cassandra

By |2024-07-31T12:56:48+00:00February 1st, 2018|Uncategorized|

What is Apache Cassandra™? Apache Cassandra™, a top level Apache project born at Facebook and built on Amazon’s Dynamo and Google’s BigTable, is a distributed database for managing large amounts of structured data across many commodity servers, while [...]

Apache Hive 

By |2024-07-31T13:08:10+00:00January 31st, 2018|Uncategorized|

What does Apache Hive do? The Apache Hive™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage and queried using SQL syntax. Built on top of Apache Hadoop™, Hive provides the following [...]

CONTACT US