Foreword: Below are four variations of the same prompt I used to create my first ever Midjourney images. The only thing that varies in the prompts below, are the aesthetic stylisation requested for them. I then went ahead and chose 4 images in total (one from each sets of fours) that follows the prompt to its closest approximation. I explain below why I chose those, with a critical explanation of why I chose them over the others as well as in what regards and to what degree the AI image generator on the one hand surprised me by delivering the unexpected, and on the other, did not, in a certain sense, meet my expectations with the prompts I provided.
Image 1: Prompt: A science classroom underwater with fatigued students doing math problems on infinite stair cases, in the style of Japanese lo-fi 64 bit colour, use lighting and shadows --ar 16:9 --c 25 --style raw --s 250 - @MAT 255 (fast)
1. Does the image meet your expectations?
It does something beyond meeting expectations here. On this first ever encounter with an AI Image generator like Midjourney, I realised what it means for the AI to read and synthesise linguistic cues and semiotic signifiers into imagined singnifieds, where the rules of Saussurian semiotics, as it appears in the real world do not hold true. The prompt-based image generation generates goes beyond a structuralist reality to reveal the potentialities of free association between signifiers and signifieds -- one that taps into the loop that feeds the AI engine and lets machinic permutations foster imagined semiotic connections. In short, although the aesthetic parameters in the prompt tries to follow it as closely as possible, the scene description requested by it, given its complexity, has been largely evaded. Even though the rendition of the underwater classroom is fairly accurate, with its use of lo-fi image style and lighting, the infinite nature of the staircase is no where near what was expected by me.
2. To what degree does your text query influence the generated image?
Insofar as the aesthetic stylisation is concerned, the image-style and use of colour requested in the text has influenced the image and rendered the Japanese lo-fi, 64 bit aesthetic quality that I was looking for. I feel the image generated did make sense with the text I had input, however, it was not the direction that I was envisioning personally.
3. What is the style of the image, and why do you think it has produced that?
Based on available examples of Japanese lo-fi art in 64 bit colour, the AI was able to produce the aesthetic quality I was looking for. However, in terms of what I had asked for the scene to be in -- in other words, in terms of its diegetic description -- it has somehow failed to render the infinite nature of the staircases. I was hoping it would be more flat, like an illustration from a book and significantly more whimsical in its rendition of the space.
4. Any thoughts about how the visual elements in the image are organized?
The visual elements, namely the underwater atmosphere of the classroom do seem to be organized to be in line with the prompt I attempted. It appears that Midjourney did have a gap in some of the things that were not included -- namely the infinite stairs. The variation of images, caused by the chaos parameter set to 25 is also quite significantly noticeable and admirable.
5. How would you change the query? to achieve what difference?
I would perhaps change this query to attempt to better combat lack of specificity in the elements of the scene, over the aesthetic style. I would specify the kind of infinite staircase I am aiming for the AI to depict (e.g. the Penrose Stairs).
6. Any other comments?
Based on this query, I did make significant changes or jumps to the query to try other styles but was still not able to achieve what I wanted exactly - which resulted in further straying or abandoning this idea in part unfortunately.
7. On a scale of 5 - from 5 being GREAT to 1 being LOW your rating of the result
3.5/5
Image 2: Prompt: A science classroom underwater with fatigued students doing math problems on infinite stair cases, in the style of a 3d unity render, Octance render, 3d sculpture, raytrace, 4K, dramatic lighting, focused, detailed --ar 16:9 --c 25 --style raw --s 250 - @MAT 255 (fast)
1. Does the image meet your expectations?
This is one of the rare instances that the image came fairly close to my expectations than the others. The aesthetic parameters in my prompt were followed more or less accurately. The images resemble the style of photo-realistic 3D computer/playstation games -- that sometimes use 3d Unity and Raytrace. Unlike the rest of the examples here, the scenes depicted in these images can be said to be taking place underwater. Although, here again, the scene description requested by it, given its complexity, has been largely evaded. The infinite nature of the staircase is no where near what was expected by me.
2. To what degree does your text query influence the generated image?
Insofar as the aesthetic stylisation is concerned, the 3d unity render, Octance render, 3d sculpture, raytrace, and dramatic lighting effect requested in the text has influenced the image and given it its aesthetic particularity. I feel the image generated did make sense with the text I had input, and the image that was generated are much akin to the virtual games that I based its style on.
3. What is the style of the image, and why do you think it has produced that?
The style of the image is much like that of images from computer games. Based on available examples of effects generated by 3d unity render, Octance render and raytrace, the AI was able to render the 3d sculpture-like aesthetic quality. However, in terms of what I had asked for the scene to be in -- in other words, in terms of its diegetic description -- it has somehow failed to render the infinite nature of the staircases. Image 2 and 4 do not seem to show any sign of staircases at all!
4. Any thoughts about how the visual elements in the image are organized?
This is an example where the visual elements, namely the underwater atmosphere of the classroom seem to be quite well organized to be in line with the prompt I attempted. It appears that Midjourney did have a gap in some of the things that were not included -- namely the infinite stairs. The variation of images, caused by the chaos parameter set to 25 is also quite significantly noticeable and admirable.
5. How would you change the query? to achieve what difference?
I would perhaps change this query to attempt to better combat lack of specificity in the elements of the scene, over the aesthetic style. I would specify the kind of infinite staircase I am aiming for the AI to depict (e.g. the Penrose Stairs).
6. Any other comments?
Based on this query, I did make significant changes or jumps to the query to try other styles but was still not able to achieve what I wanted exactly - which resulted in further straying or abandoning this idea in part unfortunately.
7. On a scale of 5 - from 5 being GREAT to 1 being LOW your rating of the result
4/5
Image 3: Prompt: A science classroom underwater with fatigued students doing math problems on infinite stair cases, painted by Georges Seurat, use Pointillist brush technique, vibrant colours, epic --ar 16:9 --c 25 --style raw --s 250 - @MAT 255 (fast)
1. Does the image meet your expectations?
As I have mentioned above -- Not quite. Here, even the aesthetic parameters seems to have gone awry. The images do not quite follow Georges Seurat's use of Pointillist brush strokes. The last image in the set of 4 curiously resembles Vincent van Gogh's
The Starry Night (June 1889) in its quasi-post-impressionist rendition of the sky. Again, the scene description requested by it, given its complexity, has been largely evaded. The rendition of the underwater classroom is inaccurate, just as the infinite nature of the staircase is no where near what was expected by me.
2. To what degree does your text query influence the generated image?
Insofar as the aesthetic stylisation is concerned, the image-style and use of colour requested in the text has influenced the image and rendered the painting-like aesthetic quality. Though, the style of painting followed was not exactly what I had asked for. I feel the image generated did make sense with the text I had input, however, it was not the direction that I was envisioning personally.
3. What is the style of the image, and why do you think it has produced that?
Based on available examples of vibrantly coloured paintings (not necessarily Pointillist or Post-impressionist), the AI was able to render the painting-like aesthetic quality. However, in terms of what I had asked for the scene to be in -- in other words, in terms of its diegetic description -- it has somehow failed to render the infinite nature of the staircases as well as the the request for it to be underwater. I was hoping it would be follow Seurat's style more accurately and his use of Pointillist technique.
4. Any thoughts about how the visual elements in the image are organized?
The visual elements, namely the underwater atmosphere of the classroom do not seem to be organized to be in line with the prompt I attempted. It appears that Midjourney did have a gap in some of the things that were not included -- namely the infinite stairs. The variation of images, caused by the chaos parameter set to 25 is also quite significantly noticeable and admirable.
5. How would you change the query? to achieve what difference?
I would perhaps change this query to attempt to better combat lack of specificity in the elements of the scene, over the aesthetic style. I would specify the kind of infinite staircase I am aiming for the AI to depict (e.g. the Penrose Stairs).
6. Any other comments?
Based on this query, I did make significant changes or jumps to the query to try other styles but was still not able to achieve what I wanted exactly - which resulted in further straying or abandoning this idea in part unfortunately.
7. On a scale of 5 - from 5 being GREAT to 1 being LOW your rating of the result
2.5/5
Image 4: Prompt: A science classroom underwater with fatigued students doing math problems on infinite stair cases, in the style of still photographs taken with high contrast Illford HP5 balck and white film, using a 35mm lens, dramatic, Chiaroscuro lighting effect, deep shadows, intense --ar 16:9 --c 25 --style raw --s 250 - @MAT 255 (fast)
1. Does the image meet your expectations?
Not quite. The aesthetic parameters in my prompt were not quite followed. Although, the images are in fact quite high-contrast in nature, thereby trying to emulate the Chiaroscuro lighting effect, with intense, deep shadows, they are not exactly black and white or neutral monochromes, as one would expect from the Illford HP5 balck and white film stock (the lensing is accurate, however). While image 3 has a hint of warm hues in the highlights, image 4 is in subdued colour (almost like in a neo-noir film, like David Fincher's Se7en (1995, USA). Again, the scene description requested by it, given its complexity, has been largely evaded. The rendition of the underwater classroom is inaccurate (image 3, for instance, would fit more the description of a flooded staircase than an underwater classroom). As always, the infinite nature of the staircase is no where near what was expected by me.
2. To what degree does your text query influence the generated image?
Insofar as the aesthetic stylisation is concerned, the use of high-contrast Chiaroscuro lighting effect requested in the text has influenced the image and given it its high-contrast lighting quality. Though, the style of painting followed was not exactly what I had asked for. I feel the image generated did make sense with the text I had input, however, it was not the direction that I was envisioning personally.
3. What is the style of the image, and why do you think it has produced that?
Based on available examples of high-contrast (with or without Chiaroscuro lighting effect), the AI was able to render the photographic aesthetic quality. However, in terms of what I had asked for the scene to be in -- in other words, in terms of its diegetic description -- it has somehow failed to render the infinite nature of the staircases as well as the the request for it to be underwater. I was hoping everything to be at least submerged underwater.
4. Any thoughts about how the visual elements in the image are organized?
The visual elements, namely the underwater atmosphere of the classroom do not seem to be organized to be in line with the prompt I attempted. It appears that Midjourney did have a gap in some of the things that were not included -- namely the infinite stairs. The variation of images, caused by the chaos parameter set to 25 is also quite significantly noticeable and admirable.
5. How would you change the query? to achieve what difference?
I would perhaps change this query to attempt to better combat lack of specificity in the elements of the scene, over the aesthetic style. I would specify the kind of infinite staircase I am aiming for the AI to depict (e.g. the Penrose Stairs). I might also use the word "submerged underwater" instead of just "underwater" in order for it to follow my expectations more closely.
6. Any other comments?
Based on this query, I did make significant changes or jumps to the query to try other styles but was still not able to achieve what I wanted exactly - which resulted in further straying or abandoning this idea in part unfortunately.
7. On a scale of 5 - from 5 being GREAT to 1 being LOW your rating of the result
2.5/5
Finally, important to note here, is the blue bias in some of images in the first two and the last sets of four [i.e. the fourth image of the last or fourth set]-- something that was not a part of my prompt). Nevertheless, below are 4 images chosen from each sets of four above, that have come more or less close to my expectations, not just in terms of rendering the aesthetic style in the query provided as closely as possible, but more so in retaining most of the visual elements asked for in the scene depicted in the first place. These are:
Image (A)
Image (B)
Image (C)
Image (D)
