Learning Multiclassifiers with Predictive Features that Vary with Data Distribution