Transactions in Hive

In this blog post, we have explained about the row-level transactions available in Hive. This post will provide you a good idea of how to implement the row-level transactions on the Hive table. Before beginning with the transactions in Hive, let’s look at the ACID properties, which are vital for any transaction. What is ACID? […]

Bucketing in Hive

In our previous post, we have discussed on the concept of Partitioning in Hive. In this post, we will be discussing the concept of Bucketing in Hive, which gives a fine structure to Hive tables while performing queries on large datasets. As we all know, Partition helps in increasing the efficiency when performing a query […]

Partitioning In Hive

Introduction to partitioning: Hive has been one of the preferred tool for performing queries on large datasets, especially when full table scan is done on the datasets. In the case of tables which are not partitioned, all the files in a table’s data directory is read and then filters are applied on it as a […]

Working With Hive Complex Data Types

In this blog, we will discuss the working of complex data types in Hive. Before we move ahead you can go through the below link blogs to gain more knowledge on Hive and its working. Beginners Guide For Hive Perform Word Count Job Using Hive Pokemon Data Analysis Using Hive Bucketing in Hive – Let’s […]

How to Run Hive Scripts?

Being a Data Warehousing package built on top of Hadoop, Apache Hive is increasingly getting used for data analysis, data mining and predictive modeling. In this post, let’s look at how to run Hive Scripts. In general, we use the scripts to execute a set of statements at once. Hive Scripts are used pretty much […]

What Is Hive View?

Hive View For writing queries, we sometimes find it difficult to frame query when it is nested and complex. This scenario may often occur in the case of joins and we will be covering the same in this blog to simplify our way of querying in HIVE with help of VIEWS. When a query becomes […]

How to Write Script File in Hive?

In this blog, we will learn how to execute Script File in HiveWHY HADOOP?. Hive is a critical component of Hadoop and your expertise in Hive can land you top-paying jobs! Three ways to start Hive. 1. Hive shell: Command line interface 2. The Hive Web Interface is an alternative to using the Hive command line […]

Hive Use Case – Pokemon Data Analysis

In this post, we will be performing certain Hive queries to perform data analysis on Pokémon Go characters. So, what is Pokémon Go? Pokémon Go is a free-to-play, location-based augmented reality game developed by Niantic for iOS and Android devices. It was released only in July 2016 and only in selected countries. You can download […]

Beginner’s Guide for Hive

Why was Hive introduced? Few years ago, Hadoop came into existence for solving queries on huge structured, Semi-structured, and unstructured datasets. Hadoop is considered to be the bestsolution to store and process those huge datasets, because of its advantages like scalability, less infrastructure costs(commodity hardware), data security(replication factor) and MapReduce, which is the best programming […]