About Dj Das

This author has not yet filled in any details.
So far Dj Das has created 395 blog entries.

Cassandra Range Query Made Simple

By |2019-01-02T08:56:51+00:00November 28th, 2018|Pranab Ghosh|

In Cassandra, rows are hash partitioned  by default. If you want to data sorted by some attribute, column name sorting feature of Cassandra is usually exploited. If you look at the Cassandra slice range [...]

Hive Plays Well with JSON

By |2018-11-28T11:08:04+00:00November 28th, 2018|Pranab Ghosh|

Hive Plays Well with JSON Hive is an abstraction on Hadoop Map Reduce. It provides a SQL like interface for querying HDFS data, whch accounts for most of it’s popularity.  In Hive, table structured data [...]

Data Normalization with Spark

By |2019-01-02T07:34:58+00:00November 27th, 2018|Pranab Ghosh|

Data Normalization with Spark Data normalization is a required data preparation step for many Machine Learning algorithms. These algorithms are sensitive to the relative values of the feature attributes. Data normalization is the process of bringing all the [...]

Anomaly Detection with Robust Zscore

By |2019-07-17T13:47:16+00:00November 27th, 2018|Pranab Ghosh|

Anomaly Detection with Robust Zscore Anomaly detection with with various statistical modeling based techniques are simple and effective. The Zscore based technique is one among them. Zscore is defined as the absolute difference between [...]

Ruling with Drools Rule Engine

By |2018-11-27T09:45:21+00:00November 22nd, 2018|Pranab Ghosh|

In a project several years ago I built a rule engine from scratch.  In a recent project, which needed a rule engine, I decided to take different route. I decided to give  Drools rule engine [...]

CONTACT US