Talend DI Benchmark PDF
Talend DI Benchmark PDF
Talend DI Benchmark PDF
© Talend 2011 1
Configuration
Hardware
• CPU: 2 x Intel Xeon E5320 (8 Cores), 1.8 Ghz
• RAM: 14 Gb
• HD: 1 Tb, 7200 Rpm
Software
• Operating System: Red Hat 64bits
• JVM: Sun 1.6
• MySQL 5.0 with MyISAM engine
• Edition: Talend Integration Suite 4.2 MPx
Data
• Files have been automatically (randomly)
generated
© Talend 2011 2
Scenario 1
File Input Delimited > File Output Delimited
Reading X lines from a delimited file and writing them in a delimited file without performing transformations.
© Talend 2011 3
Scenario 2 & 3
File Input Delimited > MySQL
Loading X lines from a delimited file into a MySQL database (parallelized).
DB to flat file
+108K rows/s!
© Talend 2011 5
Scenario 5
File Input Delimited > Transform > File Output Delimited
Reading X lines from a delimited file and writing them into a delimited file after some transformations.
Transformations list:
• the `rate` field is multiplied by 100
• the new field `name` is a concatenation of fields (`firstname`+ « » +`lastname`)
• the `address` field is converted to uppercase
© Talend 2011 6
Scenario 6
File Input Delimited > Sort > File Output Delimited
Reading X lines from a delimited file and writing a sorted delimited file.
The file is sorted ascending according to the `age` (integer) and `firstname` (string) fields.
© Talend 2010 7
Scenario 7
MySQL table > Transformation & lookup > MySQL table
Reading X lines from a MySQL table, performing transformations & lookup and
writing the result in a MySQL table leveraging the ELT mode.
+ 150K rows/s
5,000,000 8.09 61,759
20,000,000 13.16 151,941
© Talend 2011 8
Scenario 8
File Input Delimited > Sort > File Output Delimited
Reading X lines from a delimited file and writing a sorted delimited file.
The file is sorted ascending according to the `age` (integer) and `firstname` (string) fields.
© Talend 2011 9
Scenario 8 / MPx
File Input Delimited > Sort > File Output Delimited
Sorting a X lines delimited file leveraging MPx technology.
The file is sorted ascending according to the `age` (integer) and `firstname` (string) fields.
350K rows/s!
1,000,000 2.81 354,862
5,000,000 13.60 367,512
20,000,000 57.22 349,485
© Talend 2011 10
Scenario 9
File Input Delimited > Aggregate > File Output Delimited
Reading X lines from a delimited file, making an aggregation and writing the result in a delimited file.
The aggregation is based on the `age` field; Operations: COUNT, SUM(rate), AVG(rate), MIN(rate), MAX(rate)
© Talend 2011 11
Scenario 10
File Input Delimited > File Output Delimited
Reading X lines from a delimited file, making a lookup on a 1 million rows file, filtering and writing results in
different delimited files.
© Talend 2011 12