In the field of Data Integration, coming across the issue of filtering incoming data from a source is a requirement that occurs regularly. The ETL process requires a lot of conditioning and filtering in order to overcome the data quality issues and all this processing may take a considerable amount of time if we go step by step.

Pentaho gives us the best suitable inbuilt plugin for the same namely “FILTER ROWS”.
The Filter Rows step allows you to filter rows based on conditions and comparisons giving us the desired output based on the conditions applied.

Following is a demo on how to use filter rows in Pentaho:

1. After combining the incoming data into one DUMMY step, we add a FILTER ROWS step to it.


2. Then we can apply the conditions as per requirement in the filter rows step.


3. You can further divide the step by sending the true data to a select values step and false data to a dummy step.


4. Preview the transformation and check if the result is as per your requirement.

Visit Nevpro Business Solutions Pvt. Ltd. to know more.

Connect with us with on Google+!

    1 Response to "How to filter rows by using Pentaho data integration"

    • Dash

      Hi BenjaminThanks for sharing meloodhtogy to implement roles in mondrian.I have done the same and it is working well for one role but not for multiple roles.I defined two role in my mondrian schema as following:And also I include these roles in connection string of datasource.xml (pentaho-solution/system/olap/datasources.xml) as following: Provider=mondrian;DataSource=SampleData;Role=ceo;Role=cto; solution:steel-wheels/analysis/SampleData.mondrian.xmlIt is working well for one role but when I include second role nothing is appearing on jpivot view. I looked at MDX editor and remove the dimension with member null and run the query then desired data is appearing else it is not appearing.Can you please suggest me if I am doing something wrong?Please suggest me way to implement multiple roles.Thanks,Tushar

Leave a Reply