Hive Plays Well with JSON

By |2018-11-28T11:08:04+00:00November 28th, 2018|Uncategorized|

Hive Plays Well with JSON Hive is an abstraction on Hadoop Map Reduce. It provides a SQL like interface for querying HDFS data, whch accounts for most of it’s popularity.  In Hive, table structured data [...]

Data Normalization with Spark

By |2019-01-02T07:34:58+00:00November 27th, 2018|Uncategorized|

Data Normalization with Spark Data normalization is a required data preparation step for many Machine Learning algorithms. These algorithms are sensitive to the relative values of the feature attributes. Data normalization is the process of bringing all the [...]

CONTACT US