Loading The VR School
Loading The VR School
The Student Has Always Been the Agent
On the functional taxonomy of world models — and the K-12 school that proved them in human development before the field found the vocabulary to describe it.
The word “learning” has an overloading problem. A lecture is learning. A textbook is learning. A lab explosion — the accidental kind — is learning. A difficult conversation with someone who disagrees with everything you believe is learning. Failure, humiliation, discovery, mastery: we call them all by the same name, fund them through the same channels, and measure them on identical standardized tests. This category error has cost humanity more than any algorithm error in the history of computing.
A parallel problem has emerged in artificial intelligence. The term “world model” has accumulated so many meanings across so many disciplines — computer vision, robotics, reinforcement learning, generative AI — that it risks becoming a vessel for whatever the speaker wishes to pour into it. A video model generating physically implausible fire is called a world model. A physics engine simulating combustion with Newtonian precision is called a world model. A language model improvising the rules of a game as it plays is called a world model. They are not the same thing.
What follows borrows a framework from that conversation — the three functional modes of world models: Renderers, Simulators, and Planners — not to import it wholesale into education, but to illuminate something the field of learning has been fumbling toward for decades without the vocabulary to describe it. The argument is straightforward: genuine education requires all three. And one school, built in a pandemic, accredited without state funding, serving students in twenty countries, has been quietly assembling all three for six years.
This is not a coincidence. It is an architecture. And it is an invitation.
In AI: Produces observations — visual fidelity, appearance, the world as seen
In education: Lectures, textbooks, videos — knowledge made visible but not inhabitable
In AI: Produces state — geometry, physics, dynamics, structure that holds under inspection
In education: Labs, life simulations, crisis scenarios — understanding built through consequence
In AI: Produces actions — what the agent does next given observations and goals
In education: Agency, project-building, world construction — the student as decision-maker
A Renderer, in the language of spatial AI, produces observations. Its primary metric is visual fidelity. The buildings in a generated drone shot may look architecturally flawless from above, but drive through the city below and they collapse — geometry without physics, appearance without structure. The Renderer excels at making things visible. It was never designed to make them real.
The lecture is education’s Renderer. It converts knowledge into light and sound with extraordinary human efficiency — a gifted teacher can render the French Revolution, the Krebs cycle, and the Pythagorean theorem in a single morning. The content is accurate. The delivery is sometimes beautiful. And the student sits, receiving it, in the same posture as someone watching a film. Passive. Exterior. Unable to enter.
For two centuries, this has been accepted as the normal condition of schooling. The lecture renders knowledge into the air, and we measure how much of it students can re-render back on an exam. The test is a rendering of a rendering. We have built an entire civilization on the hope that this chain of representations eventually connects to genuine understanding. For many students, it never does. Research consistently shows that lecture-based instruction produces retention rates between 5% and 10% after 24 hours. We have known this for decades. We have continued lecturing.
“You cannot teach a student to swim by showing them the ocean in high resolution. The fidelity of the image is not the problem. The image is the problem.”
— On rendering in education
The VR School’s answer to this was not to build a better lecture. It was to build a better world to be in. When a student at The VR School studies cellular biology, they do not watch a video of a mitochondrion. They walk inside one — through the inner membrane, past the cristae, to the ATP synthase complexes spinning at the pace of actual enzymatic reaction. This is still rendering: the environment is visual, immersive, generated. But it is rendering at a fundamentally different resolution of experience. The student is inside the observation, not outside it.
Marble — World Labs’ spatial intelligence platform — makes this possible at scale. Six campus worlds, each a navigable, inhabitable environment, each designed not for aesthetic wonder alone but for epistemological entry. The student does not observe the world. They occupy it. And occupation changes everything the Renderer, alone, cannot change: it anchors knowledge to space, to motion, to the felt sense of being somewhere real. This is why spatial memory — the cognitive system that encodes position and environment — is among the most durable forms of human memory. We remember where we were. The Marble campus teaches students to be somewhere worth remembering.
Rendering alone is not enough. It never was. The world is not a picture to be seen. It is a system to be understood.
— The Functional Taxonomy of Learning
A Simulator is categorically different from a Renderer. Where the Renderer optimizes for appearance, the Simulator optimizes for state — the complete description of what is happening in a world at a given moment, including every object, every position, every velocity, every property. Geometry that holds under inspection. Physics that respects Newton’s laws. Dynamics that behave the way the world actually behaves. The Simulator’s primary metric is not beauty. It is fidelity to consequence.
Simulation receives the least public attention in the world model conversation but proves the most consequential. The same is true in education. The lab — the genuine, hands-on, break-it-and-see-what-happens lab — is the rarest and most underfunded element of schooling. One hundred years of pedagogical research, from Piaget’s constructivism to the most recent neuroscience of learning, converges on the same finding: humans do not learn by receiving information. They learn by building internal models of how things work, testing those models against reality, and updating them when reality surprises them. This is simulation. This is what understanding actually is.
A student who sees a cup understands its appearance. A student who picks up the cup — who feels its weight, adjusts their grip, misjudges the lip and spills — understands the cup. The cup is not learned through rendering. It is learned through the physics of interaction. The collision mesh. The moment before it falls.
“Ninety-one percent of our students demonstrate math proficiency above grade level. Not because we gave them better textbooks. Because we gave them a world where math has consequences.”
— Dr. Freedom Cheteni · The VR School
The VR School’s most consequential programs are simulators. The Life Lab — a multi-phase decision engine that places students inside financial crises, ethical dilemmas, and career crossroads — does not present these scenarios as stories to read. It presents them as systems to navigate. A student managing the Budget Crisis Command Center is not analyzing a case study. They are the crisis manager. Every decision propagates through the simulation: cut the budget and watch the downstream effects cascade; defer the decision and watch the window close. The math is real. The consequences are felt. The learning is irreversible.
This is also why Marble’s collision mesh output matters so much to The VR School beyond aesthetics. A world that looks correct but cannot be physically inhabited is a Renderer. A world whose geometry holds under inspection — that can be navigated, bumped into, reshaped — is a Simulator. When students build worlds in Marble and those worlds have structural integrity, they are learning something a textbook cannot teach: that the world has rules, those rules have weight, and understanding means internalizing those rules until they become intuition.
The cup is not learned by seeing it.It is learned by holding it. By dropping it. By understanding, in the body, what “fragile” actually means.
— On simulation in education
The Planner is the most intellectually audacious of the three categories — and the most nascent. Where Renderers produce observations and Simulators produce state, Planners produce actions. They take what they observe, build an internal model of the world, and decide what to do next. This is the inverse of the Renderer: instead of translating state into appearance, it translates appearance into action. It is also, in the language of cognition, the definition of agency.
Traditional schooling produces almost no Planners. Students are trained to receive — to absorb the Renderer’s output, to rehearse it, to return it. Even the most sophisticated educational systems mistake comprehension for agency. A student who can explain photosynthesis has rendered their understanding back to you. A student who can simulate a failing ecosystem has understood it. A student who can design an intervention — who can plan, within a model of the ecosystem, which variables to change and in what sequence to restore equilibrium — has mastered it. The Planner is the master.
SofAI — The VR School’s 24/7 adaptive intelligence system — is itself a Planner. It does not deliver a predetermined curriculum. It observes each student’s behavior — what they attempt, where they struggle, how long they persist, which concepts they skip — and produces the next-best-action: the precise question, the targeted explanation, the challenge pitched at exactly the right difficulty to trigger the next stage of growth. This is not personalization as a marketing term. This is the POMDP loop applied to individual human development.
“A student who can only render what they know has memorized. A student who can simulate it has understood. A student who can plan within it — who can act, adapt, and rebuild — has mastered it. We built a school for mastery.”
— Dr. Freedom Cheteni
But the most important Planners at The VR School are not the AI systems. They are the students themselves. The five-module Spatial Intelligence curriculum is, in its entirety, a Planner-training program. Students begin by learning to render: spatial prompt writing, scene composition, the grammar of 3D narrative. They advance to simulation: scene architecture, narrative physics, the structural rules that make a world inhabitable rather than merely visible. And they arrive, in the final module, as genuine Planners: building, deploying, and expanding Marble worlds that encode their understanding of a subject — not as a description of it, but as a navigable, consequential environment others can enter.
A student who builds a Marble world about the civil rights movement is not writing an essay about it. They are designing the conditions under which another human being will understand something the student themselves had to fight to understand. That is not just learning. That is teaching. And teaching — the act of building a model coherent enough that another mind can inhabit it — is the highest expression of planning we know.
The Architecture of Learning
In reinforcement learning, the POMDP loop describes how an agent interacts with a partially observable world. In education, it describes how a mind becomes one.
The POMDP framework — the partially observable Markov decision process — describes how an agent navigates a world it cannot see completely. The agent takes an action. The action affects world state. The world returns an observation. The agent updates its model and chooses the next action. Around and around. This is not a theory of artificial intelligence. It is a description of how every human being has ever learned anything worth knowing.
A child learning to walk is running a POMDP. She observes her posture (Render), builds a model of balance and momentum (Simulate), decides to shift her weight (Plan), falls, observes the new state, updates her model. Three hundred iterations later, she walks. No lecture was required. No rubric was administered. The loop ran until it converged.
The VR School is, structurally, a POMDP loop for human development. The Marble campus provides the observation layer: rich, spatial, inhabitable environments that give students something real to observe. SofAI provides the modeling layer: a system that watches how students move through those environments, builds a model of each student’s understanding, and surfaces the precise next challenge. The Spatial Intelligence curriculum provides the action layer: students do not only navigate worlds, they build them. They act. Their actions create new worlds for others to observe. The loop closes. Then it runs again, deeper.
This is why the VR School’s outcomes diverge so sharply from traditional schooling: 3× learning retention, 91% math proficiency, 89% science proficiency, 86% ELA proficiency — not because the school is more rigorous, but because it is more complete. It does not run half the loop. It runs all of it.
91%
Math Proficiency
89%
Science Proficiency
3×
Retention vs. Lecture
402+
Students · 20 Countries
370+
UC A-G Courses
WASC
Fully Accredited
In the world model conversation, the most significant pattern of the current moment is the collapsing of boundaries between the three categories. Systems once designed purely to render are becoming action-conditioned. Simulators are becoming more editable, more generative, more driven by prompt than code. Planners are beginning to incorporate pretrained visual models as their understanding backbone. The three categories are not merging into something blurry. They are converging toward something unified — a single architecture capable of switching between output modalities depending on what the downstream consumer needs.
In education, this convergence is not a future state. It is the present practice of the best teachers who have ever lived. The great teacher does not choose between rendering, simulating, and planning. She moves fluidly between all three in a single lesson, reading the room, adjusting the input — now showing, now letting students explore, now asking what they would do next. The teacher is already the unified world model. The institutional tragedy is that we built schools that permit only the Renderer mode. We wrote lesson plans. We standardized lectures. We tested reception.
The VR School was built to be the unified model. Marble is its rendering layer and its simulation substrate simultaneously — the same platform that generates a visually stunning Gaussian-splat world also generates the collision meshes that make it physically navigable. SofAI is its planning intelligence — observing, modeling, acting — at both the macro level (curriculum design across cohorts) and the micro level (the next question for this student, in this moment). The Spatial Intelligence curriculum trains students to be unified models themselves: learners who can render their understanding into explorable worlds, simulate the consequences of what they build, and plan the next iteration from what they observe when others enter.
The logical endpoint of the world model research program is a single foundation model that renders, simulates, and plans — switching between output modalities as needed. The logical endpoint of the education research program is a learner who has internalized all three modes so deeply that the loop runs automatically, unconsciously, in every domain they encounter. These two endpoints are the same person. They are the same architecture. The VR School is where they meet.
The world model research program has made its central wager explicit: that sufficiently rich world models will enable agents to see, build, and act in worlds. This wager has driven decades of work in computer vision, robotics, and generative AI. It is now driving an entire research generation simultaneously, with billions of dollars flowing toward the convergence point.
What has not been articulated — and what The VR School represents — is that education is the highest-stakes deployment of this technology in human history. Not because schools are a large market, though they are: 1.6 billion students worldwide, a multi-trillion-dollar sector. But because the data generated by genuine spatial learning — by human minds navigating, building, and planning within world models — is categorically different from any training data currently available.
The internet gave AI abundant visual data. It gave it very little data about how human minds learn to understand physical reality. The VR School generates exactly that data, at institutional scale, under conditions of genuine pedagogical rigor. Four hundred students in twenty countries, building Marble worlds, navigating simulated crises, planning interventions in complex systems. Each session is a window into how human spatial intelligence actually develops.
Students building and navigating Marble worlds generate the most valuable training signal in spatial AI: human spatial cognition in action. Every world built, every navigation path taken, every moment of confusion and breakthrough — this is the data that trains better world models.
WASC accredited. UC A-G approved. 370+ courses. This is not a research prototype. It is the world’s first operational spatial intelligence school, with genuine students, genuine outcomes, and genuine institutional standing.
258 million children are out of school globally. Spatial intelligence technology is the first educational technology capable of reaching them without the infrastructure of a building, a teacher roster, or a textbook supply chain. The VR School is the proof of concept.
Better world models → richer learning environments → more students → more spatial learning data → better world models. This is not a pitch for a market. It is a description of a compounding system. The VR School is the seed crystal.
The VR School is not a proposal. It is a running system: 402 students, 20 countries, 6 Marble campus worlds, SofAI adaptive tutoring, Spark.js browser rendering, and a five-module Spatial Intelligence curriculum already delivering measurable outcomes.
The VR School is the distribution layer for spatial intelligence technology into K-12 education globally. Every student who graduates having built Marble worlds is an ambassador. Every partner district is a lighthouse. The school is not a customer. It is a channel.
“Language gave machines a way to talk about the world. World models are how machines will finally come to understand, imagine, reason, and interact with it. Education is how humans have done this for a hundred thousand years. The convergence of these two things is not a trend. It is the next frontier of human development.”
— On the convergence of world models and education
The world model research program began with a simple wager: that a machine with a rich enough model of the world could see it, build it, and act within it. Thirty years of work later, the wager is paying off. The boundaries between rendering, simulating, and planning are collapsing. The unified endpoint is within reach.
Education made the same wager first. Every great teacher who ever lived knew, intuitively, that showing students the world was not enough. That they had to be in it. That they had to break things and rebuild them. That they had to plan. The teacher was running the loop before the loop had a name.
The VR School was built to run the loop for every child on earth. Not the children with access to great teachers and well-funded labs and schools that can afford to break things. All 258 million of the ones who have been waiting. The ones for whom a Marble world — navigable, consequential, built by a student who learned it well enough to construct it — may be the only physics lab they ever enter.
The machine that can render the world, simulate it, and plan within it is the most powerful educational technology in history. We have been building it. We have been building the institution that deploys it. Come build the next chapter with us.
Dr. Freedom Cheteni
Founder & Superintendent, The VR School
Creator, Movement Thinking · #RelationalIntelligence
Stanford, California · 520 Lasuen Mall · June 2026
A UC A-G honors elective for students in grades 9–12. Learn to render, simulate, and plan within spatial environments — using Marble, SofAI, and the same functional taxonomy powering the next generation of AI. Eighteen weeks. Five modules. One irreversible shift in how you understand the world.