Free software Archives

Apache Tez

By Dj Das|2022-07-04T13:45:33+00:00February 1st, 2018|Apache Tez, Informative, Technologies|

Introduction The Apache TEZ® project is aimed at building an application framework which allows for a complex directed-acyclic-graph of tasks for processing data. It is currently built atop Apache Hadoop YARN. The 2 main design [...]

Apache Drill

By Dj Das|2022-07-04T14:43:17+00:00February 1st, 2018|Apache Drill, Informative, Technologies|

Apache Drill: Drill is an Apache open-source SQL query engine for Big Data exploration. Apache Drill is designed from the ground up to support high-performance analysis on the semi-structured and rapidly evolving data [...]

Presto

By Dj Das|2022-07-04T14:38:41+00:00February 1st, 2018|Informative, Presto, Technologies|

WHAT IS PRESTO? Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. Presto was designed and written from [...]

Apache Hive

By Dj Das|2022-07-04T13:41:40+00:00January 31st, 2018|Apache Hive, Informative, Technologies|

The Apache Hive™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage and queried using SQL syntax. Built on top of Apache Hadoop™, Hive provides the following features: Tools to enable easy [...]

Apache Flink

By Dj Das|2022-07-04T14:01:44+00:00January 31st, 2018|Apache Flink, Informative, Technologies|

Introduction to Apache Flink® Below is a high-level overview of Apache Flink and stream processing. Continuous Processing for Unbounded Datasets Features: Why Flink? Flink, the streaming model, and bounded datasets The [...]

Cloudera Impala

By Dj Das|2022-07-04T14:21:52+00:00January 31st, 2018|Cloudera Impala, Informative, Technologies|

Cloudera Impala provides fast, interactive SQL queries directly on your Apache Hadoop data stored in HDFS or HBase. In addition to using the same unified storage platform, Impala also uses the same metadata, SQL [...]

Apache Kafka

By Dj Das|2022-07-04T14:18:59+00:00January 30th, 2018|Apache Kafka, Informative, Technologies|

We think of a streaming platform as having three key capabilities: It lets you publish and subscribe to streams of records. In this respect it is similar to a message queue or enterprise messaging [...]

Apache Spark

By Dj Das|2022-07-04T14:31:47+00:00January 30th, 2018|Apache Spark, Informative, Technologies|

Apache Spark is a fast and general engine for large-scale data processing. Speed Run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk. Apache Spark has an advanced DAG [...]

Apache Pig

By Dj Das|2022-07-04T13:49:15+00:00January 30th, 2018|Apache Pig, Informative, Technologies|

Apache Pig is a high-level language platform developed to execute queries on huge datasets that are stored in HDFS using Apache Hadoop. It is similar to SQL query language but applied [...]

Apache Hadoop

By Dj Das|2022-07-04T14:34:41+00:00January 29th, 2018|Apache Hadoop, Informative, Technologies|

The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers [...]

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

AI & Machine Learning

Generative AI & ChatGPT

Big Data & Engineering

Digital Transformation

Automating Tasks

Know Your Customers

Project Types

Manufacturing

Retail

Healthcare

Energy, Oil & Gas

IT

AdTech

NGO

More...

Presto

Apache Kafka

The Multi-Faceted Journey of Determining ML Model Success Criteria

Customization of LLM ChatBots with Retrieval Augmented Generation

Comparative Study on AI and OCR for Data Extraction

Beware of Fraudulent Activities

Data and AI Workshop in Hubballi, 2023

New Office Inauguration in Hubballi

Round Table Conference and Announcements in Hubballi, 2023

ThirdEye Data Expands its Full-Fledged Operations in Hubballi

Silicon Valley based ThirdEye Data Opens Hubbali, Karnataka Offices for Delivering Data & AI Services for Worldwide Customers

Time Series Sequence Anomaly Detection with Markov Chain

Services We Offer

Tailored Solutions

Explore Us

Talk To Us