PMPP proving itself as training objective, good kernel signal rewards are rewarding