Trendaavat aiheet
#
Bonk Eco continues to show strength amid $USELESS rally
#
Pump.fun to raise $1B token sale, traders speculating on airdrop
#
Boop.Fun leading the way with a new launchpad on Solana.

hardmaru
Building Collective Intelligence @SakanaAILabs 🧠
hardmaru kirjasi uudelleen
1 decade ago: Reinforcement Learning Prompt Engineer in Sec. 5.3 of «Learning to Think …» [2]. Adaptive Chain of Thought! An RL net learns to query another net for abstract reasoning & decision making. Going beyond the 1990 World Model for millisecond-by-millisecond planning [1].
[2] J. Schmidhuber (JS, 2015). «On Learning to Think: Algorithmic Information Theory for Novel Combinations of RL Controllers and Recurrent Neural World Models.» ArXiv 1210.0118
[1] JS (1990). “Making the world differentiable: On using fully recurrent self-supervised neural networks for dynamic reinforcement learning and planning in non-stationary environments.» TR FKI-126-90, TUM. (This report also introduced artificial curiosity and intrinsic motivation through generative adversarial networks.)

23,37K
hardmaru kirjasi uudelleen
If you are thinking about world models or neural sims and don't know where to start, check out the OG paper on world models from @hardmaru and @SchmidhuberAI
- It is super clear to read and get the basics
- you can reproduce it on your Mac or any local machine
- you can steadily upgrade it be even more powerful
If Karpathy would teach world models this is the paper he would do deep dive into.
13,51K
Johtavat
Rankkaus
Suosikit
Ketjussa trendaava
Trendaa X:ssä
Viimeisimmät suosituimmat rahoitukset
Merkittävin