That's clever. I guess the next step will be to be able to generate 3D models based on prompts. That will be more difficult I guess, as 3D is less forgiving due to the high degree of accuracy required. Actually, I don't have a good understanding of how text to image might work. I imagine some sort of intelligent collage is created, similar to how matte painters incorporate photos seamlessly into their work