Machine learning-based solutions for link prediction in Online Social Networks (OSNs) have been the subject of many research efforts. While most of them are mainly focused on the global and local properties of the graph structure surrounding links, a few take also into account additional contextual information, such as the textual content produced by OSN accounts. In this paper we cope with the latter solutions to i) evaluate the role of textual data in enhancing performances in the link prediction task on OSN; and ii) identify strengths and weaknesses of different machine learning approaches when dealing with properties extracted from text. We conducted the evaluation of several tools, from well-established methods such as logistic regression or ensemble methods to more recent deep learning architectures for graph representation learning, on a novel dataset gathered from an emerging blockchain online social network. This dataset represents a valuable playground for link prediction evaluation since it offers high-resolution temporal data on link creation and textual data for each account. Our findings show that the combination of structural and textual features enhances the prediction performance of traditional models. Deep learning architectures outperform the traditional ones and they can also benefit from the addition of textual features. However, some textual attributes can also reduce the prediction power of some deep architectures. In general, deep learning models are promising solutions even for the link prediction task with textual content but may suffer the introduction of structured properties inferred from the text.
Link Prediction with Text in Online Social Networks: The Role of Textual Content on High-Resolution Temporal Data / M. Dileo, C.T. Ba, M. Zignani, S. Gaito (LECTURE NOTES IN COMPUTER SCIENCE). - In: Discovery Science / [a cura di] P. Pascal, D. Ienco. - [s.l] : Springer, 2022 Oct. - ISBN 978-3-031-18839-8. - pp. 212-226 (( Intervento presentato al 25. convegno International Conference on Discovery Science tenutosi a Montpellier nel 2022 [10.1007/978-3-031-18840-4_16].
Link Prediction with Text in Online Social Networks: The Role of Textual Content on High-Resolution Temporal Data
M. Dileo
Primo
;C.T. BaSecondo
;M. ZignaniPenultimo
;S. GaitoUltimo
2022
Abstract
Machine learning-based solutions for link prediction in Online Social Networks (OSNs) have been the subject of many research efforts. While most of them are mainly focused on the global and local properties of the graph structure surrounding links, a few take also into account additional contextual information, such as the textual content produced by OSN accounts. In this paper we cope with the latter solutions to i) evaluate the role of textual data in enhancing performances in the link prediction task on OSN; and ii) identify strengths and weaknesses of different machine learning approaches when dealing with properties extracted from text. We conducted the evaluation of several tools, from well-established methods such as logistic regression or ensemble methods to more recent deep learning architectures for graph representation learning, on a novel dataset gathered from an emerging blockchain online social network. This dataset represents a valuable playground for link prediction evaluation since it offers high-resolution temporal data on link creation and textual data for each account. Our findings show that the combination of structural and textual features enhances the prediction performance of traditional models. Deep learning architectures outperform the traditional ones and they can also benefit from the addition of textual features. However, some textual attributes can also reduce the prediction power of some deep architectures. In general, deep learning models are promising solutions even for the link prediction task with textual content but may suffer the introduction of structured properties inferred from the text.File | Dimensione | Formato | |
---|---|---|---|
2022_LinkPredictionWithContextualInformation_DS2022.pdf
accesso riservato
Tipologia:
Pre-print (manoscritto inviato all'editore)
Dimensione
340.25 kB
Formato
Adobe PDF
|
340.25 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
978-3-031-18840-4_16.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
337.06 kB
Formato
Adobe PDF
|
337.06 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.