In Search of the Ingredients of Open-Endedness: Replicating Picbreeder with Large Vision-Language Models
Quick Take
This study replicates Picbreeder using Vision Language Models (VLMs) to explore AI's capacity for open-ended creativity. The results show significant qualitative differences from human-generated outputs, influenced by factors like exploratory noise and agent diversity. The code is available for further research.
Key Points
- Replicated Picbreeder by replacing human users with advanced Vision Language Models.
- Observed qualitative differences in outputs compared to historical human-generated images.
- Investigated factors like exploratory noise and behavioral diversity affecting creativity.
- Metrics used included phylogenetic complexity and visual novelty.
- Code available on GitHub for community access and experimentation.
Article Content
From source RSS / original summaryarXiv:2605. 23908v1 Announce Type: new Abstract: We are in the midst of large-scale industrial and academic efforts to automate the processes of scientific, technological and creative production through AI-driven assistants. Historically, a fundamental property of these processes in their human form has been their open-endedness: their capacity for generating a seemingly endless supply of novel and meaningful new forms. Do artificial agents have any capacity for such fruitful unguided discovery?
To answer this question, we turn to Picbreeder, the canonical exemplar of human-driven open-ended search, in which users collaboratively generated a diverse library of images through interactive evolution of small neural networks. We replicate Picbreeder, replacing human users with frontier Vision Language Models (VLMs).
We observe clear qualitative differences between the output of our system and the historical human baseline, and attempt to characterize them using metrics of phylogenetic complexity and visual and semantic salience and novelty. In an effort to identify some of the causal factors contributing these differences, we study the addition of exploratory noise to the agents' selection process, of behavioral diversity between agents, and of narrative momentum in the form of memory of past actions.
We make our code available at https://github. com/smearle/picbreeder-vlm.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.AI
See more →The Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Plane
The Redpanda Agentic Data Plane (ADP) introduces out-of-band metadata channels to enhance the safety of autonomous AI agents, ensuring secure data access and tamper-proof audit trails. This architecture mitigates risks associated with unpredictable AI behavior by enforcing governance throughout the agent lifecycle, demonstrated in a multi-agent trading system with strict data scoping and approval thresholds.