At the bend of a project, we often find ourselves confronted with issues related to migration or data, or a basic cleaning, or the integration of data between applications, is still in extracting qualitative for a summary reporting.
The commonly used approach is reflected by the term ETL (Extract Transform and Load). This approach is schematically connectors used to import or export data in databases or applications and transformations or mapping to handle them.
I have recently used the open source Talend Open Studio . This is a graphic workshop, on an Eclipse, allowing construction jobs. The approach is very intuitive and quality documentation. The diagram below illustrates an example of using the workshop to a problem of data cleansing:
Also very interesting, you can choose between Perl and Java for the generation of jobs, making it possible to insert operations directly in these languages in the transformation process.
In short, a product to have on the elbow at all times!



