About Dj Das

This author has not yet filled in any details.
So far Dj Das has created 396 blog entries.

Amazon EMR

By |2026-02-23T14:02:04+00:00February 1st, 2018|AWS, Technologies|

What Is Amazon EMR? Amazon EMR is a managed cluster platform that simplifies running big data frameworks, such as Apache Hadoop and Apache Spark, on AWS to process and analyze vast amounts of data. By using these [...]

Apache Hbase

By |2024-07-31T10:17:16+00:00February 1st, 2018|Uncategorized|

What is Apache HBase? Apache Hbase is a popular and highly efficient Column-oriented NoSQL database built on top of Hadoop Distributed File System that allows performing read/write operations on large datasets in real time [...]

Apache Ignite

By |2024-07-31T13:09:00+00:00February 1st, 2018|Uncategorized|

What is Apache Ignite? Apache Ignite is a memory-centric distributed database, caching, and processing platform for transactional, analytical, and streaming workloads delivering in-memory speeds at petabyte scaleDurable Memory Ignite's durable memory component treats RAM [...]

Hadoop MapReduce

By |2022-07-04T14:23:30+00:00February 1st, 2018|Uncategorized|

MapReduce Tutorial: Introduction In this MapReduce Tutorial blog, I am going to introduce you to MapReduce, which is one of the core building blocks of processing in Hadoop framework. Before moving ahead, I would [...]

Apache Oozie 

By |2022-07-04T13:34:33+00:00February 1st, 2018|Uncategorized|

OVERVIEW The blueprint for Enterprise Hadoop includes Apache™ Hadoop’s original data storage and data processing layers and also adds components for services that enterprises must have in a modern data architecture: data integration and [...]

Mongo DB 

By |2022-07-04T13:51:55+00:00February 1st, 2018|Uncategorized|

MongoDB is an open-source document database that provides high performance, high availability, and automatic scaling. Document Database A record in MongoDB is a document, which is a data structure composed of field and value [...]

TensorFlow

By |2022-07-04T13:40:18+00:00February 1st, 2018|Uncategorized|

TensorFlow Architecture We designed TensorFlow for large-scale distributed training and inference, but it is also flexible enough to support experimentation with new machine learning models and system-level optimizations. This document describes the system architecture [...]

Apache Kudu

By |2022-07-04T14:35:39+00:00February 1st, 2018|Uncategorized|

Introducing Apache Kudu Kudu is a columnar storage manager developed for the Apache Hadoop platform. Kudu shares the common technical properties of Hadoop ecosystem applications: it runs on commodity hardware, is horizontally scalable, and [...]

Redis

By |2026-02-23T09:10:22+00:00February 1st, 2018|Open Source, Technologies|

Overview Of Redis Architecture Redis is a in-memory, key-value data store. Redis is the most popular key-value data store. Redis is used by all big IT brands in this world. Amazon Elastic Cache supports [...]

Apache Tez

By |2026-03-02T13:25:29+00:00February 1st, 2018|Open Source, Technologies|

Apache Tez Introduction The Apache TEZ® project is aimed at building an application framework which allows for a complex directed-acyclic-graph of tasks for processing data. It is currently built atop Apache Hadoop YARN. The 2 [...]

CONTACT US