This is hard to wrap my head around and this explanation is correct: Just as you train language in an LLM, you can literally train WASM in an LLM and have code execute inside the LLM without tool calling. Crazy