The exponential growth of data-intensive research in the Life Sciences and other scientific domains has led to an increasing demand for computational resources across Europe. While major EU initiatives have supported the development of large-scale infrastructures, researchers still face significant challenges in accessing and integrating heterogeneous computing environments. Fragmentation in both technical architectures and policy frameworks continues to hinder seamless interoperability and efficient resource sharing. To address these challenges, we present the European Pulsar Network, a distributed computing architecture built upon the Galaxy workflow management system and the Pulsar job execution service. The network enables transparent job offloading from Galaxy servers to remote computing clusters, allowing resource scaling without requiring direct user intervention or awareness. This architecture supports a flexible and scalable approach to workload distribution, ensuring improved performance and resource availability across different infrastructures. A key component of the network is the Open Infrastructure framework, which facilitates the streamlined deployment of both Galaxy servers and Pulsar endpoints by compute providers. This is achieved through automation tools such as Terraform and Ansible, eliminating the need for manual reconfiguration of existing systems. This approach promotes simplified integration and ease of adoption by new sites. To date, the European Pulsar Network comprises six national Galaxy endpoints, other than the central European instance, and thirteen Pulsar endpoints distributed across Europe. This collaborative effort is actively enhancing the scalability, resilience, and interoperability of the Galaxy ecosystem in support of FAIR and reproducible data analysis.

Scaling Scientific Workflows in Europe: Architecture and Deployment of the Galaxy-Pulsar Computational Network / M. Antonio Tangaro, S. Nicotri, B. Grüning, S. Kumar Srikakulam, A. Dadras, O. Kaiser, M. Kuntz, A. Bretaudeau, P. De Geest, S. Luna-Valero, M. Chavero Díez, J. María Fernández González, S. Capella-Gutierrez, J. Lluís Gelpí, J. Astalos, B. Jurič, M. Ruda, Ł. Opioła, H. Bayındır, S. Gioiosa, G. De Sanctis, F. Zambelli. - (2025 Aug 07). [10.5281/zenodo.16761932]

Scaling Scientific Workflows in Europe: Architecture and Deployment of the Galaxy-Pulsar Computational Network

F. Zambelli
Ultimo
2025

Abstract

The exponential growth of data-intensive research in the Life Sciences and other scientific domains has led to an increasing demand for computational resources across Europe. While major EU initiatives have supported the development of large-scale infrastructures, researchers still face significant challenges in accessing and integrating heterogeneous computing environments. Fragmentation in both technical architectures and policy frameworks continues to hinder seamless interoperability and efficient resource sharing. To address these challenges, we present the European Pulsar Network, a distributed computing architecture built upon the Galaxy workflow management system and the Pulsar job execution service. The network enables transparent job offloading from Galaxy servers to remote computing clusters, allowing resource scaling without requiring direct user intervention or awareness. This architecture supports a flexible and scalable approach to workload distribution, ensuring improved performance and resource availability across different infrastructures. A key component of the network is the Open Infrastructure framework, which facilitates the streamlined deployment of both Galaxy servers and Pulsar endpoints by compute providers. This is achieved through automation tools such as Terraform and Ansible, eliminating the need for manual reconfiguration of existing systems. This approach promotes simplified integration and ease of adoption by new sites. To date, the European Pulsar Network comprises six national Galaxy endpoints, other than the central European instance, and thirteen Pulsar endpoints distributed across Europe. This collaborative effort is actively enhancing the scalability, resilience, and interoperability of the Galaxy ecosystem in support of FAIR and reproducible data analysis.
Settore BIOS-01/D - Biologia farmaceutica
Settore INFO-01/A - Informatica
   leveraging the European compute infrastructures for data-intensive research guided by FAIR principles
   EuroScienceGateway
   European Commission
   Horizon Europe Framework Programme
   101057388
7-ago-2025
https://zenodo.org/records/16761933
File in questo prodotto:
File Dimensione Formato  
European_Pulsar_Network_manuscript.pdf

accesso aperto

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Licenza: Creative commons
Dimensione 3.34 MB
Formato Adobe PDF
3.34 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1178836
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact