This paper presents a practical guide to machine learning-assisted visual content analysis for social scientists. Combining machine automation with human expertise and reflexivity, the proposed methodological framework bridges the gap between computer vision and social research. Our custom approach combines inductive, deductive, and abductive logics of scientific inquiry and consists of three complementary steps: (a) Pattern exploration-employing unsupervised learning to explore visual patterns within image datasets; (b) Theory-driven image classification-utilizing supervised learning with convolutional neural networks to systematically label visual content; and (c) Context-sensitive interpretation-to provide critical and creative engagement with the patterns identified in the previous steps. We illustrate these three steps, and their various combinations, through empirical examples from a study of visuality in digital diplomacy, and critically discuss the epistemological implications of using machine learning as a method in visual social research.

With eyes of a machine: A three-step guide for applying machine learning to visual content analysis in social research / A.H. Kvist Møller, M. Airoldi. - In: BIG DATA & SOCIETY. - ISSN 2053-9517. - 12:2(2025 Jun), pp. 20539517251343860.1-20539517251343860.13. [10.1177/20539517251343860]

With eyes of a machine: A three-step guide for applying machine learning to visual content analysis in social research

M. Airoldi
Ultimo
2025

Abstract

This paper presents a practical guide to machine learning-assisted visual content analysis for social scientists. Combining machine automation with human expertise and reflexivity, the proposed methodological framework bridges the gap between computer vision and social research. Our custom approach combines inductive, deductive, and abductive logics of scientific inquiry and consists of three complementary steps: (a) Pattern exploration-employing unsupervised learning to explore visual patterns within image datasets; (b) Theory-driven image classification-utilizing supervised learning with convolutional neural networks to systematically label visual content; and (c) Context-sensitive interpretation-to provide critical and creative engagement with the patterns identified in the previous steps. We illustrate these three steps, and their various combinations, through empirical examples from a study of visuality in digital diplomacy, and critically discuss the epistemological implications of using machine learning as a method in visual social research.
AI; Computer vision; machine learning; mixed methods; visual content analysis; visual methods
Settore GSPS-06/A - Sociologia dei processi culturali e comunicativi
Settore GSPS-05/A - Sociologia generale
giu-2025
Article (author)
File in questo prodotto:
File Dimensione Formato  
kvist-moller-airoldi-2025-with-eyes-of-a-machine-a-three-step-guide-for-applying-machine-learning-to-visual-content.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Licenza: Creative commons
Dimensione 772.72 kB
Formato Adobe PDF
772.72 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1169356
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex 0
social impact