Talend 6 : Query natively on Hadoop using Impala with Cloudera

Function tImpalaConnection opens a connection to an Impala database.
Purpose This component allows you to establish an Impala connection to be reused by other Impala components in your Job.

https://help.talend.com/display/KB/How+to+profile+data+natively+on+Hadoop+using+Impala+with+Cloudera+-+Talend+v5.6+features

 

In this job we query Impala , the result are sent to HDFS, then We run Spark jon that uses the HDFS file

Spark Impala HDFS hadoop

Spark Impala HDFS hadoop

Any question ? please contact us on the contact page or send an email to : contact@Talendexpert.com

 

Download template job : Talend Spark Impala HDFS hadoop

 

Leave a Reply