Steps to Create UDF in Apache Pig

This post contains the necessary step required to  create UDF in Apache Pig.  All UDF should extend a Filter function and has to contain a method called exec, which contains a Tuple. The logic applied here is that if the Tuple is null or zero, it will give you a Boolean value: True or False. […]


This post is about the operators in Apache Pig. Let’s take a quick look at what Pig and Pig Latin is and the different modes in which they can be operated, before heading on to Operators. What is Apache Pig? Apache Pig is a high-level procedural language for querying large data sets using Hadoop and […]

Pig Script in Local Mode

Pig Script in Local Mode Step1: Writing a Script Open an editor (e.g. gedit) in your Cloudera Demo VM environment. Write the following command to create ‘sample.pig’ file inside the home directory of cloudera user: Command:  gedit sample.pig Let’s write few PIG commands in the sample script! Let’s say our task is to read data from a […]