Back to Articles
The old multimedia stack was tools. The new one is endpoints. agents.md is the contract The pipeline: one photo, two Spaces The twist: the user runs the pipeline too Why this is the shape of things Try it An agent built a 3D figurine studio, and now anyone can mint a collectible from one photo. No creative software was opened at any point.
I built an interactive studio that turns any photo into a collectible 3D figurine. I never opened Photoshop. I never opened Blender. Not once during the build, and not once when a figurine is actually generated. Every image is produced by one Hugging Face Space and every 3D model by another, both driven by an agent reading their agents.md.
Here it is, live, sign in and bring your own face:
This is the third post in a small series about treating multimedia models as callable building blocks. The first two built galleries. This one is different: the studio is a real app, and the end user runs the exact same pipeline the agent did, with their own prompt and their own photo.











