January 2016 - Talend Expert
contact@talendexpert.com US 1-877-283-2691 UK 020 7692 4827

Coursera Assignment: Simple Join in Spark

Coursera Assignment: Simple Join in Spark SIMPLE JOIN 1)  If you’ve made it this far in the course, you’ll have a couple of files in your HDFS directory. We need these files to perform our simple join. To make use of these files, you need to load them as an RDD in Spark, which can be achieved with this code: fileA = sc.textFile(“input/join1_FileA.txt”) 2)  Since RDDs are evaluated lazily, we should do a quick spot check to see if the…

Read More >

Top 10 ETL Tools Reviews

Top 10 ETL Tools Reviews Top 10 ETL Tools Reviews : ETL is the short form of Extract, Transform and Load. These are key tools required when handling Database and Data Warehousing.  Also, if you are using any cloud storage services, these ETL tools are very of great importance. The main functions of these tools is to migrate the data from the source database to the required database through cloud computing or Data warehouses. These ETL tools together combine a…

Read More >