Mother Algorithm

One of the things I love most about AI art generation is how unexpected the results can be.

Recently, I was experimenting with how an AI generated initial image affects the output alongside the same prompt.

The idea of using an initial image is that it gives the AI a starting point from which to generate from. I was particularly interested to see if using another AI generated image made a difference to the accuracy of the output because using an ‘organic’ initial image can produce weird or very bias results.

I opted to use the prompt ‘Last selfie on earth’ because I love anything dystopian (aside from real life ironically now that I’m getting older) and I knew it would produce interesting results based on its popularity.

However, what I didn’t expect was what came next. Otherwise I would have considered not running the prompt at 2am.

Using NightCafe Studio, I selected two pictures of AI generated women rendered in a digital art style.

Picture 1 featured a kind of alien anime girl that I nicknamed ‘Galaxy Gazer’:

‘Galaxy Gazer’ – available for digital download now

While picture 2 featured a digital drawing of a haunted girl with blood on her face which I will refer to as ‘Ellen’ for the rest of this post:

‘Ellen’ – a digital painting of an Irish girl

The idea was that the AI would generate outputs featuring female depictions of the ‘last selfie on earth’ but with different ‘feels’, colour schemes and slightly different art styles. But that wasn’t what happened. This is what happened:

Reader, I don’t think I need to tell you how it feels when it’s 2am and something unexpected like that happens. Especially if you’re a stupid mole person like me who works in darkness with a Spotify playlist that contains more ambient and downtempo music than you’d like to admit.

After the brief spike in emotion, I was fascinated. I had used two different initial images with the same prompt and somehow, on both occasions, it resulted in an almost identical batch output.

While there was some variants between the individual outputs, the AI was clear. The last selfie on earth would depict a man with facial hair, olive skin and be no older than his early 40s.

The output produced from ‘Ellen’ particularly looked like a kaleidoscope of different outcomes for the man in question. In some outputs, he’s young, scared and defensive, as if facing a threat he can see or in the middle of conflict. In others, he’s tired and old, sometimes seeming oblivious or intoxicated, sometimes looking scared and powerless to change his fate from an unseen threat.

The output from ‘Galaxy Gazer’ was more ambiguous with more variance on the physical features of the man, making him appear to be more like different individuals. However, as mentioned before, both batch outputs are linked by common features like facial hair, olive skin and age range.

So now, the question remains, why did NightCafe’s stable diffusion AI art generation algorithm do this?

The conspiracy theorists amongst us might say it’s because AI knows something we don’t. It’s thoughts like that which make AI art generation so irresistible to keep picking back up and experimenting with.

However, the more boring explanation lies with how the AI has been trained to process information and how this affects the output.

For example, the ‘Ellen’ initial image uses a dark and almost grayscale color scheme and her face is fairly distinctively high angled. When the AI is fed this image, this is the main information that it takes away from it to apply to the output.

The AI then takes the information and weighs up how to apply it to the prompt.

The AI possesses biases or in media communication terms ‘conventions’ that help it consistently produce work that the user expects to see.

In the case of the prompt ‘Last selfie on earth’ , the conventions that AI has decided to apply is that the subject should be male, have facial hair (there are never razors during the apocalypse) and be olive/tan skinned (There also isn’t any shade or sunscreen).

Reader, I could write an entire book about the ethics of conventions, why they exist and if they should even exist but it’s a well trodden niche that plenty of other great content creators are currently exploring. My job as a content creator is to simply point out what the conventions are and why they are present in the image in front of you.

So in this instance, the AI has judged the subject to be exclusively male because they are almost exclusively the hero in the action and adventure genres.

This is particularly true for the dystopian sub-genre where the protagonist is a male trying to overcome isolation and a hostile environment that doesn’t cut him a break. So he must become stronger and tougher to survive, and maybe recapture his essential humanity along the way.

As mentioned before, the man’s appearance is supposed to be shaped by said hostile environment. He’s been wandering a wasteland with the sun beating down on him so he’s got a strong tan. There are no razors or grooming kits because personal grooming is a very low priority for a human living through the apocalypse. So he can’t cut his hair or his facial hair very often or very well.

The resulting output is a mashup of the main details of ‘Ellen’ and the conventions that the AI holds about the prompt ‘Last selfie on earth’. This is how you end up with a batch output of 9 very similar looking men (because Ellen’s face is distinctively high angled) with a dark color scheme (again because of Ellen) and a disheveled and generally morose appearance (conventions).

In summary, take caution when using AI image generation at 2am, even if the company who produces it is appropriately called NightCafe studio. You might get a shock you didn’t expect.

However, if you do get an output that shocks you, don’t panic, AI probably doesn’t know something you don’t. Most outputs are just a heady mix of the initial image you provide and the conventions that AI expects you to see based on your prompt.

Lately you might have heard a lot about AI-based programs in the news. From Chat GPT’s amazing ability to effortlessly generate countless genres of text, to DALL-E’s ability to render your ideas into awe-inspiring art.

But when it comes to AI generated images produced by the likes of DALL-E , Midjourney and NightCafe Studio, you might find yourself wondering is it art? And if the person directing the AI decides to sell the results, are they an artist and should you really support them?

Today we’re going to explore 4 reasons why you shouldn’t be afraid to buy AI-generated art and why it’s worth supporting this new creative genre.

1.They Used to Say That Digital Art Would Ruin Art

Unbelievable isn’t it? If you go to any art community online nowadays, you will find countless examples of gorgeously rendered digital art which was lovingly crafted across many hours. If you asked, most people would say that digital art is a legitimate art medium that deserves just as much attention as traditional ones.

Yet, when digital art first emerged, critics denounced the medium because they said it didn’t require as much technique and skill as traditional art mediums. They bemoaned a future where new digital artists would cause traditional artists to perish and replace their hard work with ‘artificial renderings’ that didn’t take several months and hundreds of pounds (or dollars) in materials to make.

In other words, they saw digital art as cheating and they didn’t like it one bit.

But now, just over 20 years later, we can see that the world has room for both digital and traditional art forms. You might be a fan of one or both and it hasn’t affected the art world one bit. Artists have even found ways to mix both mediums to take their talents to new heights.

AI-generated art is just another moment like that. A new genre of digital art has emerged and once again, critics are adamant that it will destroy everything when in reality, it doesn’t have to be like that at all.

2. AI Image Generation Is a Valuable Tool for Creative People

If you’re creative, have you ever experienced the horrible feeling of having a really good picture of an idea in your mind but not quite knowing how to put it onto paper? How about the annoying quest of trying to find the perfect reference image and coming up empty?

AI-Image generation can help with that.

When it comes to AI-image generation, your only limit is what you can prompt it to do, which is almost anything.

Are you a furniture designer who is struggling with their strawberry chair concept? DALL-E can help give you some pointers.

DALL-E’s take on what a ‘strawberry chair’ could be

What about a fantasy fiction writer who can’t quite visualize the world their characters inhabit? Midjourney or Stable Diffusion can generate a series of scenic images to help you.

For a digital artist like myself, AI-image generation algorithms can provide royalty free source images for my work to remix with other sources.

It can also help me experiment with colour and texture combinations that I never would have thought of before, which in turn helps inspire me in new directions at a much faster pace than ever before.

3. It Might Not Be as Easy as You Think

Some of you might think that AI art generation is so easy that it requires no skill at all because the AI does all the work.

However, you might be surprised to know that’s actually a common misconception.

In order for you to get the AI to work, you need to craft the right text input or ‘prompt’ to tell the AI what to make.

While some might choose to make their prompt a sentence, it’s not necessarily how the AI processes information because it is a machine.

The really interesting results come from combinations of keywords and phrases. Too few keywords and you might not end up with the concept you’re going for.

On the other hand, if you use too many keywords, you risk ‘overcooking’ the output and rendering the design useless.

Therefore, there is definitely a steep learning curve to crafting the perfect prompt and developing skill, just like any other artistic medium.

4. It’s Weirdly Collaborative

While AI generated images are impressive, they are not perfect. Particularly when it comes to generating animals and people. Much like human artists, some of the most common issues revolve around hands, hair and feet.

This is because the AI is just a series of algorithms and doesn’t always completely understand how something is ‘supposed’ to look. This can lead to some hilarious or at times, downright nightmare fuel inducing outputs

A Stable Diffusion rendering of Virgin Mary & Jesus with way too many arms

And that’s where I come in. Even the best outputs rely on me using my knowledge of photo editing to fix errors and ‘polish’ it into the best image it can be.

In that sense, I have a collaborative relationship with AI where I come up with an initial concept which AI helps me realise before I take the output and complete the final steps. It’s like being part of my own personal art collective, only with a machine instead of a group of humans.

Conclusion

AI Image Generation is a brand new artistic medium. While some critics might claim that it is just for talentless hacks who want to destroy ‘real’ creativity, I believe that it is a valid medium that could offer new opportunities for artists to grow and collaborate with machines, leading to exciting possibilities.

If you believe in the evolution of art through the ages, support the most cutting edge medium in the art world today.