Which OpenSource ETL tool is easier to use & more agile? Pentaho Kettle/Jitterbit/Talend/Clover/Jasper/Rhino?
Please notice this analysis is “vendor independent” and based on multiple clients experience
Which OpenSource ETL tool is easier to use & more agile? Pentaho Kettle/Jitterbit/Talend/C
- Talend and Pentaho offers some of the most deployed Open Source ETL tools and used in several critical implementations.
- Talend and Pentaho have strong community support ( 3 million downloads for Talend ), well known companies. Open Source Business Intelligence is growing fast and real world applications are widespread.
- Talend Open Studio and Pentaho Kettle are both user friendly, well documented and have a strong community support. Talend Open Studio requires more initial effort to get started however its great potential is highly appreciated from the beginning.
- The interpreted nature of Kettle makes it sometimes slower in some tasks compared to Talend.
- Talend is a single threading code generator Java (or Python for Mapreduce or Scala for Spark ) , Kettle uses a metadata driven multi-threaded engine. So it’s your choice: either debug generated Java code (Talend) or debug a graphical data flow (Kettle). Generated code is more preferable by most developers.
- Talend is very easy to use . Talend has Data Fabric software that groups all the products you need in one ( ETL, MDM , ESB , Big data ..etc )
- Talend is fastest Big data ETL (x100 Pentaho, This is because can generate native Spark code if you are using an HDFS cluster with Spark )
- In term of Cost, Talend cost less than Pentaho
- Pentaho has an additional reporting tool for BI, but it’s far from competing with big players i.e Tableau, Qlikview
- Both have subscription model ( annual )