Apache Spark 

By |2026-03-12T08:26:37+00:00January 30th, 2018|Technologies|

What is Apache Spark? Apache Spark is a fast and general engine for large-scale data processing. Speed Run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk. Apache Spark [...]

Apache Pig

By |2026-03-12T08:23:31+00:00January 30th, 2018|Technologies|

What is Apache Pig? Apache Pig is a high-level language platform developed to execute queries on huge datasets that are stored in HDFS using Apache Hadoop. It is similar to SQL [...]

Apache Hadoop

By |2026-03-12T08:17:57+00:00January 29th, 2018|Technologies|

Apache Hadoop The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters [...]

CONTACT US