Today, we’re launching 4D generation, powered by our Cube Foundation Model. Creators can build experiences that let players create interactive 3D objects like cars, planes, and more. This is just the beginning, see what's next with AI-powered creation. 1/4
In our research lab, we are building “real-time dreaming” - the ability to generate fully playable video worlds prompted from any text or image. Our real-time, action conditioned world model (currently running internally at 16fps at 832x480p) is trained on a combination of data, including proprietary Roblox 3D avatar/world interaction data. World models are different from multiplayer engines in that they store state and memory in video latents. Roblox is multiplayer, and we are actively researching optimal ways to simultaneously store state for thousands of players, and keep them in sync with their environment. Our world model leverages database technology which stores all user interactions on Roblox in a vector format that can be used to re-render video and interaction from any camera angle. We see several immediate uses for our Roblox world model. We will use it side-by-side text, image and video prompts as a way to launch auto-generation of immersive worlds. In Roblox Studio, a creator could walk around and use prompts to “paint” a world and then convert it into a 3D representation or direct to Roblox native as a way for many people to play simultaneously. All of this comes alive as we explore the notion of a “Dream Theater” - where one user is dreaming, while others watch and prompt them. 2/4
To support massive, high-fidelity worlds, our engine implements an efficient LOD strategy with a cloud transcoding system that streams meshes and textures, even on lower end mobile devices. We are expanding this framework to AI upsampling of 3D worlds from prompts. In this preview video below, with a single user prompt, our 3D upsampling technology updates the geometry and texture to transform the classic Roblox game Crossroads into a fantasy world with rich organic detail. 3/4
We have 13B hours of player interaction on the platform monthly. This data enables us to train intelligent NPCs that can reason and interact in 3D worlds. Our training goes beyond videos of gameplay and simple WASD actions, utilizing our full data model for a more detailed representation of human interactions. Our video below shows Roblox NPCs figuring out how to build a campfire by reasoning backwards to find an axe, cut down a tree, and bring the wood to the firepit. This is still early research but we imagine a future where intelligent NPCs could play alongside real players. 4/4
108