Which OpenSource ETL tool is easier to use & more agile? Pentaho Kettle/Jitterbit/Talend/Clover/Jasper/Rhino?

Please notice this analysis is “vendor independent” and based on multiple clients experience

Which OpenSource ETL tool is easier to use & more agile? Pentaho Kettle/Jitterbit/Talend/Clover/Jasper/Rhino?

talend

  1. Talend and Pentaho offers some of the most deployed Open Source ETL tools and used in several critical implementations.
  2. Talend and Pentaho have strong community support  ( 3 million downloads for Talend ), well known companies. Open Source Business Intelligence is  growing fast and real world applications are widespread.
  3. Talend Open Studio and Pentaho Kettle are  both user friendly, well documented and have a strong community support.  Talend Open Studio requires more initial effort to get started however  its great potential is highly appreciated from the beginning.
  4. The interpreted nature of Kettle makes it sometimes slower in some tasks compared to Talend.
  5. Talend is a single threading code generator Java (or Python for Mapreduce or Scala for Spark ) , Kettle uses a metadata driven multi-threaded engine. So  it’s your choice: either debug generated Java code (Talend) or debug a  graphical data flow (Kettle). Generated code is more preferable by most developers.
  6. Talend is very easy to use . Talend has Data Fabric software that groups all the products you need in one ( ETL, MDM , ESB , Big data ..etc )
  7. Talend is fastest Big data ETL (x100 Pentaho, This is because can generate native Spark code if you are using an HDFS cluster with Spark )
  8. In term of Cost, Talend cost less than Pentaho
  9. Pentaho has an additional reporting tool for BI, but it’s far from competing with big players i.e Tableau, Qlikview
  10. Both have subscription model ( annual )

Leave a Reply