An Idea about Technology: Big Data Tool- Hive: A simple overview

In this blog I am here introduce some basic about Hive data warehouse system. The main goal is to understand hive and then working with hive. When we talk about hive then first question comes that is:

What is Hive? So ans is:

Hive is a data warehousing package/infrastructure built on top of Hadoop.It provides an SQL like dialect, called Hive Query Language(HQL) for querying data stored in a Hadoop cluster.HQL is the Hive query language. Like all SQL dialects in widespread use, it doesn’t fully conform to any particular revision of the SQL standard. It is perhaps closest to MySQL’s dialect, but with significant differences. Hive offers no support for rowlevel inserts, updates, and deletes. Hive doesn’t support transactions.So we can't compare it with RDBMS. Hive adds extensions to provide better performance in the context of Hadoop and to integrate with custom extensions and even external programs. It is well suited for batch processing data like: Log processing, Text mining, Document indexing, Customer-facing business intelligence, Predictive modeling, hypothesis testing etc.
We cannot compare it with traditional database system and it is not designed for online transaction processing and does not offer real-time queries.

Newer version of Hive community is trying to provide functionality of insert, update, and delete in Hive with full ACID support. You can check it here. Adding ACID to Apache Hive or you can see here also for more detail.
HIVE-5317 - Implement insert, update, and delete in Hive with full ACID support

An Idea about Technology

Timbakto

March 2, 2014

Big Data Tool- Hive: A simple overview

No comments:

Post a Comment

Thedeseart