If you want to Learn Big Data and Hadoop Technologies [Spark, Scala, Hive, Pig, Sqoop, Unix, Java, Python]
Please contact me at [email protected]
Apache Spark Learning
Apache Spark Intro :
- Apache Spark Introduction and Installation
- How to setup Spark environment using Eclipse
- Spark Scala Shell [ REPL ] using short cut keys
- How to Schedule Spark Jobs on UNIX CRONTAB
Apache Spark with HIVE :
In this section you will learn how to use Apache SPARK with HIVE.
- ETL Example program using Apache Spark, Scala and Hive
- How to process JSON Data and store the results into Hive Partitions
- Store the data into Hive Partitioned table using SPARK Data Frame
- How to write Spark UDF in Scala to check the Blank lines in Hive
- Parsing a part of long Text as keys using Spark
- Reading the Data from Text file using STRUCT TYPE
Apache Spark with Data Frame :
- Creating the Data Frame by Reading CSV File using Spark Session.
- Creating Spark Data Frame using Scala CASE Class
- SELECT in Spark Data Frame
- FILTER in Spark Data Frame
- GROUP BY in Spark Data Frame
- REGISTER TEMP TABLE on Spark Data Frame
- Spark SQL Data frame to load Hive table for Tableau Reports
Apache Spark with Oracle :
the following tutorials will help you to avoid SQOOP and you can directly work with Oracle data using Spark.
- Connecting to Oracle using Apache Spark
- Inserting hive data into Oracle tables using Spark and Scala
- Sqoop from ORACLE from 3 Data center’s and inserting into Hive.
- Automate Spark Daily Data Ingest from Relational Databases
- Data Ingestion from Oracle to Hadoop using Spark scala