The quality of text-to-image generation is continuously improving, yet the boundaries of its applicability are still unclear. In particular, refinement of the text input with the objective of achieving better results – commonly called prompt engineering – so far seems to have not been geared towards work with preexisting texts. We investigate whether text-to-image generation and prompt engineering could be used to generate basic illustrations of popular fairytales. Using Midjourney v4, we engage in action research with a dual aim: to attempt to generate 5 believable illustrations for each of 5 popular fairytales, and to define a prompt engineering process that starts from a pre-existing text and arrives at an illustration of it. We arrive at a tentative 4-stage process: i) initial prompt, ii) composition adjustment, iii) style refinement, and iv) variation selection. We also discuss three reasons why the generation model struggles with certain illustrations: difficulties with counts, bias from stereotypical configurations and inability to depict overly fantastic situations. Our findings are not limited to the specific generation model and are intended to be generalisable to future ones.

Grimm in Wonderland: Prompt Engineering with Midjourney to Illustrate Fairytales / M. Ruskov (CEUR WORKSHOP PROCEEDINGS). - In: IRCDL 2023 : Information and Research Science Connecting to Digital and Library Science 2023 / [a cura di] A. Bardi, A. Falcon, S. Ferilli, S. Marchesin, D. Redavid. - Aachen : CEUR-WS, 2023 Mar 28. - pp. 180-191 (( Intervento presentato al 19. convegno Information and Research Science Connecting to Digital and Library Science tenutosi a Bari nel 2023.

Grimm in Wonderland: Prompt Engineering with Midjourney to Illustrate Fairytales

M. Ruskov
Primo
2023

Abstract

The quality of text-to-image generation is continuously improving, yet the boundaries of its applicability are still unclear. In particular, refinement of the text input with the objective of achieving better results – commonly called prompt engineering – so far seems to have not been geared towards work with preexisting texts. We investigate whether text-to-image generation and prompt engineering could be used to generate basic illustrations of popular fairytales. Using Midjourney v4, we engage in action research with a dual aim: to attempt to generate 5 believable illustrations for each of 5 popular fairytales, and to define a prompt engineering process that starts from a pre-existing text and arrives at an illustration of it. We arrive at a tentative 4-stage process: i) initial prompt, ii) composition adjustment, iii) style refinement, and iv) variation selection. We also discuss three reasons why the generation model struggles with certain illustrations: difficulties with counts, bias from stereotypical configurations and inability to depict overly fantastic situations. Our findings are not limited to the specific generation model and are intended to be generalisable to future ones.
No
English
text-to-image generation; prompt engineering; action research; fairytales
Settore INF/01 - Informatica
Intervento a convegno
Esperti anonimi
Ricerca applicata
Pubblicazione scientifica
   Values across Space and Time (VAST)
   VAST
   EUROPEAN COMMISSION
   H2020
   101004949
IRCDL 2023 : Information and Research Science Connecting to Digital and Library Science 2023
A. Bardi, A. Falcon, S. Ferilli, S. Marchesin, D. Redavid
Aachen
CEUR-WS
28-mar-2023
180
191
12
3365
Volume a diffusione internazionale
Diamond
Information and Research Science Connecting to Digital and Library Science
Bari
2023
19
Convegno nazionale
Intervento inviato
https://ceur-ws.org/Vol-3365/paper6.pdf
orcid
Aderisco
M. Ruskov
Book Part (author)
open
273
Grimm in Wonderland: Prompt Engineering with Midjourney to Illustrate Fairytales / M. Ruskov (CEUR WORKSHOP PROCEEDINGS). - In: IRCDL 2023 : Information and Research Science Connecting to Digital and Library Science 2023 / [a cura di] A. Bardi, A. Falcon, S. Ferilli, S. Marchesin, D. Redavid. - Aachen : CEUR-WS, 2023 Mar 28. - pp. 180-191 (( Intervento presentato al 19. convegno Information and Research Science Connecting to Digital and Library Science tenutosi a Bari nel 2023.
info:eu-repo/semantics/bookPart
1
Prodotti della ricerca::03 - Contributo in volume
File in questo prodotto:
File Dimensione Formato  
paper6.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 19.09 MB
Formato Adobe PDF
19.09 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1013468
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact