The exponential growth of data-intensive research in the Life Sciences and other scientific domains has led to an increasing demand for computational resources across Europe. While major EU initiatives have supported the development of large-scale infrastructures, researchers still face significant challenges in accessing and integrating heterogeneous computing environments. Fragmentation in both technical architectures and policy frameworks continues to hinder seamless interoperability and efficient resource sharing. To address these challenges, we present the European Pulsar Network, a distributed computing architecture built upon the Galaxy workflow management system and the Pulsar job execution service. The network enables transparent job offloading from Galaxy servers to remote computing clusters, allowing resource scaling without requiring direct user intervention or awareness. This architecture supports a flexible and scalable approach to workload distribution, ensuring improved performance and resource availability across different infrastructures. A key component of the network is the Open Infrastructure framework, which facilitates the streamlined deployment of both Galaxy servers and Pulsar endpoints by compute providers. This is achieved through automation tools such as Terraform and Ansible, eliminating the need for manual reconfiguration of existing systems. This approach promotes simplified integration and ease of adoption by new sites. To date, the European Pulsar Network comprises six national Galaxy endpoints, other than the central European instance, and thirteen Pulsar endpoints distributed across Europe. This collaborative effort is actively enhancing the scalability, resilience, and interoperability of the Galaxy ecosystem in support of FAIR and reproducible data analysis.
Scaling Scientific Workflows in Europe: Architecture and Deployment of the Galaxy-Pulsar Computational Network / M. Antonio Tangaro, S. Nicotri, B. Grüning, S. Kumar Srikakulam, A. Dadras, O. Kaiser, M. Kuntz, A. Bretaudeau, P. De Geest, S. Luna-Valero, M. Chavero Díez, J. María Fernández González, S. Capella-Gutierrez, J. Lluís Gelpí, J. Astalos, B. Jurič, M. Ruda, Ł. Opioła, H. Bayındır, S. Gioiosa, G. De Sanctis, F. Zambelli. - (2025 Aug 07). [10.5281/zenodo.16761932]
Scaling Scientific Workflows in Europe: Architecture and Deployment of the Galaxy-Pulsar Computational Network
F. ZambelliUltimo
2025
Abstract
The exponential growth of data-intensive research in the Life Sciences and other scientific domains has led to an increasing demand for computational resources across Europe. While major EU initiatives have supported the development of large-scale infrastructures, researchers still face significant challenges in accessing and integrating heterogeneous computing environments. Fragmentation in both technical architectures and policy frameworks continues to hinder seamless interoperability and efficient resource sharing. To address these challenges, we present the European Pulsar Network, a distributed computing architecture built upon the Galaxy workflow management system and the Pulsar job execution service. The network enables transparent job offloading from Galaxy servers to remote computing clusters, allowing resource scaling without requiring direct user intervention or awareness. This architecture supports a flexible and scalable approach to workload distribution, ensuring improved performance and resource availability across different infrastructures. A key component of the network is the Open Infrastructure framework, which facilitates the streamlined deployment of both Galaxy servers and Pulsar endpoints by compute providers. This is achieved through automation tools such as Terraform and Ansible, eliminating the need for manual reconfiguration of existing systems. This approach promotes simplified integration and ease of adoption by new sites. To date, the European Pulsar Network comprises six national Galaxy endpoints, other than the central European instance, and thirteen Pulsar endpoints distributed across Europe. This collaborative effort is actively enhancing the scalability, resilience, and interoperability of the Galaxy ecosystem in support of FAIR and reproducible data analysis.| File | Dimensione | Formato | |
|---|---|---|---|
|
European_Pulsar_Network_manuscript.pdf
accesso aperto
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Licenza:
Creative commons
Dimensione
3.34 MB
Formato
Adobe PDF
|
3.34 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




