Machine learning-based solutions for link prediction in Online Social Networks (OSNs) have been the subject of many research efforts. While most of them are mainly focused on the global and local properties of the graph structure surrounding links, a few take also into account additional contextual information, such as the textual content produced by OSN accounts. In this paper we cope with the latter solutions to i) evaluate the role of textual data in enhancing performances in the link prediction task on OSN; and ii) identify strengths and weaknesses of different machine learning approaches when dealing with properties extracted from text. We conducted the evaluation of several tools, from well-established methods such as logistic regression or ensemble methods to more recent deep learning architectures for graph representation learning, on a novel dataset gathered from an emerging blockchain online social network. This dataset represents a valuable playground for link prediction evaluation since it offers high-resolution temporal data on link creation and textual data for each account. Our findings show that the combination of structural and textual features enhances the prediction performance of traditional models. Deep learning architectures outperform the traditional ones and they can also benefit from the addition of textual features. However, some textual attributes can also reduce the prediction power of some deep architectures. In general, deep learning models are promising solutions even for the link prediction task with textual content but may suffer the introduction of structured properties inferred from the text.

Link Prediction with Text in Online Social Networks: The Role of Textual Content on High-Resolution Temporal Data / M. Dileo, C.T. Ba, M. Zignani, S. Gaito (LECTURE NOTES IN COMPUTER SCIENCE). - In: Discovery Science / [a cura di] P. Pascal, D. Ienco. - [s.l] : Springer, 2022 Oct. - ISBN 978-3-031-18839-8. - pp. 212-226 (( Intervento presentato al 25. convegno International Conference on Discovery Science tenutosi a Montpellier nel 2022 [10.1007/978-3-031-18840-4_16].

Link Prediction with Text in Online Social Networks: The Role of Textual Content on High-Resolution Temporal Data

M. Dileo
Primo
;
C.T. Ba
Secondo
;
M. Zignani
Penultimo
;
S. Gaito
Ultimo
2022

Abstract

Machine learning-based solutions for link prediction in Online Social Networks (OSNs) have been the subject of many research efforts. While most of them are mainly focused on the global and local properties of the graph structure surrounding links, a few take also into account additional contextual information, such as the textual content produced by OSN accounts. In this paper we cope with the latter solutions to i) evaluate the role of textual data in enhancing performances in the link prediction task on OSN; and ii) identify strengths and weaknesses of different machine learning approaches when dealing with properties extracted from text. We conducted the evaluation of several tools, from well-established methods such as logistic regression or ensemble methods to more recent deep learning architectures for graph representation learning, on a novel dataset gathered from an emerging blockchain online social network. This dataset represents a valuable playground for link prediction evaluation since it offers high-resolution temporal data on link creation and textual data for each account. Our findings show that the combination of structural and textual features enhances the prediction performance of traditional models. Deep learning architectures outperform the traditional ones and they can also benefit from the addition of textual features. However, some textual attributes can also reduce the prediction power of some deep architectures. In general, deep learning models are promising solutions even for the link prediction task with textual content but may suffer the introduction of structured properties inferred from the text.
Online social network; Link prediction; Graph neural networks; Temporal dataset
Settore INF/01 - Informatica
ott-2022
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
2022_LinkPredictionWithContextualInformation_DS2022.pdf

accesso riservato

Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 340.25 kB
Formato Adobe PDF
340.25 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
978-3-031-18840-4_16.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 337.06 kB
Formato Adobe PDF
337.06 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/956573
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 3
social impact