Managing legal documents, particularly court judgments, can pose a significant challenge due to the extensive amount of data involved. Traditional methods of document management are no longer adequate as the data volume continues to grow, necessitating more advanced and efficient systems. To tackle this issue, a proposed infrastructure aims to establish a structured repository of textual documents and enhance them with annotations to facilitate various subsequent tasks. The framework is designed with sustainability in mind, allowing for multiple services and applications of the annotated document repository while taking into account the limited availability of annotated data. By employing a combination of machine learning and syntactic rules, a set of Natural Language Processing (NLP) services pre-processes and iteratively annotates the documents. This approach ensures that the resulting annotations align with the organizational processes utilized in Italian courts. The solution’s feasibility was demonstrated through experiments that employed different low-resource methods and solutions, effectively integrating these approaches in a meaningful manner.

A Service Infrastructure for Management of Legal Documents / V. Bellandi, S. Castano, A. Ferrara, S. Montanelli, D. Riva, S. Siccardi (CEUR WORKSHOP PROCEEDINGS). - In: ITADATA 2023 : Italian Conference on Big Data and Data Science 2023 / [a cura di] N. Bena, B. Di Martino, A. Maratea, A. Sperduti, E. Di Nardo, A. Ciaramella, R. Montella, C.A. Ardagna. - [s.l] : CEUR-WS, 2023. - pp. 1-8 (( Intervento presentato al 2. convegno Italian Conference on Big Data and Data Science, ITADATA 2023 tenutosi a Napoli nel 2023.

A Service Infrastructure for Management of Legal Documents

V. Bellandi;S. Castano;A. Ferrara;S. Montanelli;D. Riva;S. Siccardi
2023

Abstract

Managing legal documents, particularly court judgments, can pose a significant challenge due to the extensive amount of data involved. Traditional methods of document management are no longer adequate as the data volume continues to grow, necessitating more advanced and efficient systems. To tackle this issue, a proposed infrastructure aims to establish a structured repository of textual documents and enhance them with annotations to facilitate various subsequent tasks. The framework is designed with sustainability in mind, allowing for multiple services and applications of the annotated document repository while taking into account the limited availability of annotated data. By employing a combination of machine learning and syntactic rules, a set of Natural Language Processing (NLP) services pre-processes and iteratively annotates the documents. This approach ensures that the resulting annotations align with the organizational processes utilized in Italian courts. The solution’s feasibility was demonstrated through experiments that employed different low-resource methods and solutions, effectively integrating these approaches in a meaningful manner.
Concept Extraction; Legal Document Annotation; Named Entity Recognition; Zero-Shot Learning
Settore INF/01 - Informatica
2023
https://ceur-ws.org/Vol-3606/paper42.pdf
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
paper42.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 1.07 MB
Formato Adobe PDF
1.07 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1027681
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact