Artificial Creativity (AC), also known as computational creativity, is a multidisciplinary field at the intersection of AI, cognitive science, philosophy, and the artsen.wikipedia.org. It focuses on designing algorithms or agents capable of human-like creative processes – producing innovative, original ideas or artifacts rather than just executing pre-programmed rules. A central question in AC is what drives the generation of truly novel ideas or “models of reality” in an artificial agent. In humans, creativity and scientific intuition often seem tied to curiosity and playful exploration. This raises a foundational issue: Must an artificial general intelligence (AGI) engage in open-ended “play” – intrinsically motivated exploration and experimentation – to achieve creativity comparable to human scientific intuition? This report explores theoretical and empirical perspectives on this question, examining whether intrinsic motivation (curiosity-driven behavior) is essential for an AI to generate original hypotheses and conceptual models not directly specified by any external reward or cost function.
Curiosity, Play, and Novelty-Seeking in Humans and AI
Curiosity as Intrinsic Motivation: In cognitive science and neuroscience, curiosity is understood as an intrinsic drive to explore and seek information for its own sake. One recent review defines curiosity as “the intrinsic desire of humans and animals to explore the unknown, even when there is no apparent reason to do so,” emphasizing that curious behavior is not strictly goal-directed but rather about information-seekingpubmed.ncbi.nlm.nih.gov. This intrinsic motivation is biologically reinforced: neuroscientific findings show that curiosity and novelty stimulate the brain’s reward circuitry, increasing dopamine release and making exploration inherently pleasurableelearn.eb.com. In other words, evolution has “wired” intelligent creatures to find learning and exploration rewarding in and of themselves. This drive manifests in spontaneous play behavior (especially in children and intelligent animals) – a behavior that appears purposeless but actually facilitates learning of physics, social dynamics, and creativity. Developmental studies indicate that playful exploration helps children form richer cognitive models; indeed, “emerging empirical evidence supports play’s potential to stimulate and foster scientific creativity.”researchgate.net Play allows the brain to combine ideas freely and discover novel patterns without immediate external pressure, a process often linked to creative insight.
Intrinsic Motivation in Artificial Agents: Mirroring these human insights, AI researchers have theorized that internally motivated exploration can drive creativity and learning in machines. Jürgen Schmidhuber’s work provides a formal framework: agents can be endowed with an intrinsic reward for discovering novel, surprising patterns that improve their world model (e.g. by improving prediction or data compression)researchgate.net. By “maximizing intrinsic reward for the active creation or discovery of novel, surprising patterns,” an artificial agent is encouraged to continually seek new knowledge, much like a curious child or scientistresearchgate.net. Schmidhuber argued that this principle can yield open-ended development and even explain facets of human intelligence like scientific discovery and artresearchgate.net. Crucially, this perspective treats exploration and play not as frills but as fundamental drivers of intelligence. In fact, some cognitive theorists go so far as to say that exploratory behavior is not merely a means to obtain external utility, but an irreducible aspect of agency itselfresearchgate.net. In other words, an intelligent system might need a built-in “urge” to explore and experiment in order to attain the open-ended creativity and intuition that humans display. This stands in contrast to the notion of a purely task-optimized agent that only does what its cost function dictates. The intrinsic motivation view suggests that without a form of playfulness – a freedom to roam beyond immediate goals – an AI’s knowledge and creativity may remain bounded by its fixed objectives.
Curiosity-Driven Reinforcement Learning (CDRL)
One practical implementation of intrinsic motivation in AI is Curiosity-Driven Reinforcement Learning (CDRL). In standard reinforcement learning, an agent learns by maximizing external rewards defined by a task (e.g. points in a game). CDRL augments this by giving the agent an intrinsic reward based on novelty or learning progress – effectively a numerical incentive for curiosity. For example, a landmark approach by Pathak et al. (2017) defined curiosity reward as the prediction error of the agent’s internal forward model (how surprising the outcome of an action was). This way, “curiosity can serve as an intrinsic reward signal to enable the agent to explore its environment and learn skills that might be useful later” even when extrinsic rewards are sparse or absentarxiv.org. Such curiosity-driven agents are motivated to visit states that surprise them (i.e. where their current knowledge is weak), thereby acquiring new information. Empirically, CDRL has yielded impressive results. For instance, agents equipped with a curiosity reward have solved challenging video game levels with sparse external feedback that traditional reward-only agents failed at, by exploring far more efficiently. In one report, adding a curiosity module allowed an agent to reach a sparse goal with “far fewer interactions with the environment” than a non-curious agentarxiv.org. Even in the absence of any extrinsic objective, a pure curiosity agent will roam its world, “pushing the agent to explore more efficiently” and learn a variety of skills simply because learning itself is rewardingarxiv.org. These outcomes echo a key idea: a system driven by intrinsic motivation can invent its own “tasks” to pursue (e.g. reduce prediction error, see something new), which in turn prepares it to handle explicit tasks in the future. In short, CDRL provides a concrete mechanism by which artificial play – unguided, self-motivated exploration – leads to the emergence of useful behaviors and potentially novel strategies that were never directly specified by a programmer.
Model-Based Imagination: Dreamer and MuZero
Beyond intrinsic rewards, another important ingredient for creativity in AI is the ability to form internal models of the world and reason about them. Two state-of-the-art AI agents, Dreamer and MuZero, exemplify how model-based reasoning can enhance exploration and generalization.
Dreamer: Dreamer (Hafner et al., 2019) is a reinforcement learning agent that learns a world model from its sensory experience and then plans within that model – essentially, it can imagine future outcomes without having to try them in reality. “We present Dreamer, a reinforcement learning agent that solves long-horizon tasks from images purely by latent imagination,” Hafner and colleagues writearxiv.org. Dreamer learns an abstract representation of the environment’s dynamics (a latent state space) and uses it to simulate possible action sequences, backpropagating rewards through these simulated trajectories to improve its policy. This allows Dreamer to achieve high data-efficiency and solve complex tasks: for example, by “learning behaviors by propagating analytic gradients of learned state values back through trajectories imagined in the compact state space of a learned world model,” it outperformed prior model-free methods on a suite of visual control tasksarxiv.org. The significance of Dreamer in our context is that it demonstrates an agent using an internal model to support something akin to mental exploration. Even without an explicit curiosity reward, a model-based agent can try out “playful” what-if scenarios in its own mind, analogous to how human scientists conduct thought experiments. Extensions of this idea have combined world models with intrinsic objectives (e.g. rewarding the agent for exploring states that reduce model uncertainty), effectively giving rise to imaginative curiosity – the agent “plays” through its model to discover novel outcomes or knowledge.
MuZero: MuZero (Schrittwieser et al., 2019) is another breakthrough that showcases model-based planning. MuZero succeeded in mastering Go, chess, shogi, and a suite of Atari video games without being given the rules or physics of those environments. It does so by learning its own state transition model and using a tree-search planning algorithm (like Monte Carlo Tree Search) over that learned model. By “combining a tree-based search with a learned model,” MuZero achieved “superhuman performance in a range of challenging and visually complex domains, without any knowledge of their underlying dynamics.”arxiv.org In essence, MuZero figured out how these games work purely from experience, and simultaneously learned to plan winning strategies. The ability to plan with a learned model is a form of reasoning that goes beyond reactive trial-and-error; it lets the agent evaluate hypothetical actions and thus explore more strategically. MuZero’s success underscores that to deal with complex, open-ended tasks, an agent benefits from building its own internal representation of the world’s rules. While MuZero’s training was still driven by an external reward (win the game), its internal model gave it a powerful generalization capability – hinting that an advanced AI scientist might likewise need an internal world-model to simulate experiments and consequences in imagination, a prerequisite for creative hypothesis generation.
Both Dreamer and MuZero illustrate that model-based reasoning can be combined with or enhance intrinsically motivated exploration. An agent with a rich world model can perform “play” in its mind, trying out variations and seeking novel outcomes in simulation before committing to them in reality. This sort of latent-space experimentation could be crucial for developing original theories about the world – much as human scientists use mental models and thought experiments inspired by curiosity.
Open-Ended Exploration and AGI Creativity
The above elements – curiosity-driven learning and model-based imagination – point toward a vision of AGI that actively generates its own goals and knowledge. The question remains: is such open-ended, playful exploration truly necessary for an AI to achieve human-like creativity and scientific intuition, or could an AI reach those heights by purely optimizing a given objective on a fixed dataset? Current thinking in AI research increasingly leans toward the former: that open-ended exploration is a key enabler of general intelligence. A recent position paper argues that we are on the cusp of a paradigm shift from “learning from data” to “learning what data to learn from,” highlighting that in an open-ended real world, an intelligent agent must actively seek out informative experiencesblog.minch.coblog.minch.co. In this view, exploration is essential not just in reinforcement learning games, but in all learning settings. Jiang et al. (2023) coin the term “generalized exploration” and assert that exploration should be treated as a necessary objective to sustain open-ended learning, allowing agents to continually discover and solve new problems – a path toward more general intelligencearxiv.orgarxiv.org. Similarly, researchers at DeepMind suggest that “open-ended exploration provides a more viable, bottom-up path toward general intelligence” than static, task-specific training, and that our focus will shift to designing exploration objectives and data-generation processes for agentsblog.minch.co. In essence, an AGI might need an intrinsic “drive” to pose its own questions and seek novel experiences, which is strongly reminiscent of play.
From a philosophical standpoint, one can argue that true creativity – especially the kind that leads to groundbreaking scientific hypotheses – cannot be fully pre-specified as a reward function. By definition, a creative insight is something unexpected and not strictly derivable from optimizing a known objective. Humans often discover new theories (new “models of reality”) by following hunches, tinkering in the lab, playing with thought experiments, or exploring phenomena with no clear immediate payoff. This free exploration allows for serendipity and the breaking of old assumptions. If we want an AI to achieve similar feats, giving it the freedom and motivation to stray from the beaten path seems important. Indeed, algorithms in evolutionary computation have demonstrated the power of not directly pursuing an objective: novelty search methods, which reward an agent simply for doing something different than before, have “counter-intuitively outperformed objective-driven search” on many deceptive problemslink.springer.com. In one example, a novelty-seeking evolutionary agent found a path through a maze more reliably than an agent explicitly trying to reach the goal, because the latter got stuck in local optima whereas the novelty-driven agent kept exploring alternate routeslink.springer.com. This highlights a paradox: by not single-mindedly optimizing the known goal, the agent can discover better solutions – or entirely new problems to solve. The lesson for creativity is that a degree of playfulness (seeking novelty for its own sake) can breathe diversity into search and yield outcomes that a pure utilitarian approach might never encounter.
On the other hand, it’s worth noting some counterpoints. In certain cases, extremely powerful goal-directed systems have exhibited what humans label “creative” behavior without explicit intrinsic motivation. DeepMind’s AlphaGo, for instance, produced novel strategies in the game of Go (famously, the unconventional Move 37) purely by optimizing the goal of winning games through self-play. Likewise, OpenAI’s multi-agent hide-and-seek simulations led to agents inventing ingenious tool use and strategies – effectively creative problem-solving – driven by competition rather than an explicit curiosity reward. These examples suggest that a sufficiently rich external objective, combined with a complex environment, can induce emergent creativity. However, in both cases the agents were either limited to a narrow domain (games) or relied on an environment that inadvertently rewarded exploration (self-play generates diverse situations). They did not form new scientific models outside their programmed domain. In contrast, an AGI scientist would need to venture into the truly unknown, beyond any predefined game rules or datasets.
Key Insight: The balance of evidence from AI research, cognitive science, and neuroscience supports the view that intrinsic motivation (“play”) is a critical component for open-ended creativity. Curiosity-driven learning endows an agent with an ever-evolving goal: to reduce its uncertainty and find novelty. This ensures the agent keeps expanding its knowledge and can stumble upon new problem formulations. Without such drive, an AI might excel at tasks we specifically train it on, yet fail to ask the unexpected questions or imagine the unconventional experiments that lead to paradigm-shifting insights. In human terms, it would be like a brilliant student who can solve given problems but never pursues a new idea unprompted. To reach scientific intuition – the ability to hypothesize original models of reality – an AGI likely must have something analogous to the child’s instinct to play and the scientist’s curiosity. In summary, while goal-directed optimization alone can yield powerful problem-solving, artificial creativity may require a playful, exploratory spirit at its core, enabling an AGI to go beyond its training and truly create.
Sources: The arguments and examples above draw on a range of recent findings and authoritative perspectives, from theoretical frameworks of intrinsic motivationresearchgate.netresearchgate.net and neuroscience insights into curiositypubmed.ncbi.nlm.nih.govelearn.eb.com, to achievements in curiosity-driven RLarxiv.org, and advanced model-based agents like Dreamerarxiv.org and MuZeroarxiv.org. Notably, open-ended exploration is increasingly seen as essential for general intelligenceblog.minch.co, and evidence from both AI (novelty search) and human creativity researchresearchgate.net reinforces the idea that playfulness sparks innovation. An AGI that aspires to human-like creativity and intuition will, in all likelihood, need to learn how to play – not only to refine known skills, but to invent new ones.