In this blog I am here introduce some basic about Hive data warehouse system. The main goal is to understand hive and then working with hive. When we talk about hive then first question comes that is:
What is Hive? So ans is:
What is Hive? So ans is:
Hive is a data warehousing package/infrastructure built
on top of Hadoop.It provides an SQL like dialect, called Hive Query
Language(HQL) for querying data stored in a Hadoop cluster.HQL is the
Hive query language. Like all SQL dialects in widespread use, it doesn’t
fully conform to any particular revision of the SQL standard. It
is perhaps closest to MySQL’s dialect, but with significant differences.
Hive offers no support for rowlevel inserts, updates, and deletes. Hive
doesn’t support transactions.So we can't compare it with RDBMS. Hive
adds extensions to provide better performance in the context of Hadoop
and to integrate with custom extensions and even external programs. It
is well suited for batch processing data like: Log processing, Text mining, Document indexing, Customer-facing business intelligence,
Predictive modeling, hypothesis testing etc.
We cannot compare it with traditional database system and it is not designed for online transaction processing and does not offer real-time queries.
Newer version of Hive community is trying to provide functionality of insert, update, and delete in Hive with full ACID support. You can check it here. Adding ACID to Apache Hive or you can see here also for more detail.
HIVE-5317 - Implement insert, update, and delete in Hive with full ACID support
We cannot compare it with traditional database system and it is not designed for online transaction processing and does not offer real-time queries.
Newer version of Hive community is trying to provide functionality of insert, update, and delete in Hive with full ACID support. You can check it here. Adding ACID to Apache Hive or you can see here also for more detail.
HIVE-5317 - Implement insert, update, and delete in Hive with full ACID support
No comments:
Post a Comment