From my prospective rl just is a more tolerable way of saying synthetic data that everyone was not a fan of two years ago when i started doing rejection sampling to make Hermes 1. Synthetic data (including semi synthetic data) has been the present since ChatGPT came out.