What is Apache Pig?

Pig is a Hadoop Extraction Transformation Load (ETL) Tool. people in the IT industry are using it for Big Data Log Analysis, If you know Python, R, Scala Programming Language then Apache Pig will be very  very easy to learn.



Pig Programming is easier than Java and Apache Hive Queries on Hadoop Distributed File system, Because Pig Process the Large Data sets and makes them into small data sets before we actually print them or store into any other systems for example ( HDFS, Hive, Hbase…etc).

Since it is Very easy to Learn and Program, Real world Hadoop developer prefer to use Pig than JAVA Map Reduce or Hive Programming.

Apache Pig Useful Link

  • Apache Pig Video Tutorials 
  • Apache Pig Reference Manual links
  • Apache Pig Tools are here 

Apache Pig Books:

  • Pig Programming
  • Pig Design Patterns
  • Hadoop Definitive Guide