/r/ImagenAI
Imagen and Imagen Video are text-to-image and text-conditioned video AI models by Google
/r/ImagenAI
For general things that don't push the boundaries, Imagen 3 doesn't really have an equal right at this moment. It is even significantly better than Ideogram 2.0 for text.
But the biggest thing is that for most of the time I can get nearly 100% prompt adherence, even for complicated prompts like this picture of an extruded plastic ice cream cone sign from the 1970s which is illuminated from the inside. It literally looks like a product from the 1970s.
The only drawback with Imagen is the censorship, which can get annoying, especially when generating normal, or common subject matter.
I'm paying for Gemini ($20) for chat and also $30 each month for midjourney because it has a good enough image generator for my needs. I could use only Gemini for my chat and AI image generation needs if Gemini wasn't so restrictive. For my needs, Gemini is perhaps better because it gives a good consistent style for what I need with minimal prompting. But Gemini very often says that it won't make an image because it doesn't know if it's against the policy even though everything I'm making is very safe and normal. It's quite frustrating and I hope they fix this soon.
I've been trying to get imagen 3 to draw images of people in a variety of styles. So far the only way I can do it is to say it is a cartoon or a 3D cartoon. It doesn't matter if these are real people or well known fictional people. Asking for a pencil sketch or a watercolour or an oil painting or literally anything else will not work. I cannot generate an image of Sherlock Holmes even though that character is public domain. I cannot create an image of Robinson Crusoe or any other famous literary figure because it the photo realism of people a filter. I understand why that's there and safety features are important. Come on when I can't even generate images in an obviously non-realistic style people what can I do? Just like with the rest of Gemini, it's far too cautious about this stuff.
Tl;dr anyone found any prompts that can help me?
Has anyone attempted to make a sprite sheet with transparency that has the sprite locations locked so you could generate themed sheets that could be used in a game? Is this even possible?
Google changed Gemini's image generator from Imagen 2 to 3 today and I feel like it's a terrible downgrade. Is there a way to access imagen 2 again.