Talend and Spark review

Talend and Spark review

An acceleration engine for your integration platform

As the first data integration platform built on Apache Spark, Talend delivers unprecedented speed, scale, and agility to bring advanced analytics to the data-driven enterprise.

  • Speed performance with in-memory processing
  • A single platform across batch, real-time, and streaming data sources
  • Put advanced analytics and data science into production
  • Future-proof your integration infrastructure


Speed performance with in-memory processing

If your success depends on acting fast with insight from the latest data, then Talend and Spark are the answer. Process at the speed and scale of the Internet of Things with high-speed messaging and high-speed processing to capture and deliver millions of events per second.

  • Talend Big Data jobs running Spark are 5x faster than MapReduce* to deliver real-time results.
  • Talend visual tools enable you to build Spark jobs 10x faster than hand coding to run on Hadoop, standalone, or in the cloud.
  • Talend optimized connectors and components combine in-memory analytics, machine learning and caching components delivering high performance jobs without tuning Spark by hand.

* Validated by independent TPC-H integration benchmarks.




The platform will leverage over 100 Spark components and is designed to allow companies to convert Big Data streaming or IoT sensor information into actionable insights.

Existing Talend customers will be able to convert MapReduce jobs to Spark, which will lead to a 5x performance increase and improve developer productivity 10x, as compared to hand-coding, the company said.

This comes as a result of the prebuilt Spark components and design interface which comes with an automated, ‘no coding required’ Spark code generation.

The company is also providing a built-in Lambda Architecture that offers a single environment for working with bulk and batch, real-time, streaming and IoT data.

Talend and Spark review