The criteria made inside create_data_filter is just applied to train dataset while validation and test dataset just left as it is. Any idea what should we do to change it?