Enforcing legal information extraction through context-aware techniques: The ASKE approach

Castano, S.; Ferrara, A.; Furiosi, E.; Montanelli, S.; Picascia, S.; Riva, D.; Stefanetti, C.

doi:10.1016/j.clsr.2023.105903

To cope with the growing volume, complexity, and articulation of legal documents as well as to foster digital justice and digital law, increasing effort is being devoted to legal knowledge extraction and digital transformation processes. In this paper, we present the ASKE (Automated System for Knowledge Extraction) approach to legal knowledge extraction, based on a combination of context-aware embedding models and zero-shot learning techniques into a three-phase extraction cycle, which is executed a number of times (called generations) to progressively extract concepts representative of the different meanings of terminology used in legal documents chunks. A graph-based data structure called ASKE Conceptual Graph is initially populated through a data preparation step, and it is continuously enriched at each ASKE generation with results of document chunk classification, new extracted terminology, and newly derived concepts. A quantitative evaluation of ASKE knowledge extraction and document classification is provided by considering the EurLex dataset. Furthermore, we present the results of applying ASKE to a real case-study of Italian case law decisions with qualitative feedback from legal experts in the framework of an ongoing national research project.

Enforcing legal information extraction through context-aware techniques: The ASKE approach / S. Castano, A. Ferrara, E. Furiosi, S. Montanelli, S. Picascia, D. Riva, C. Stefanetti. - In: COMPUTER LAW & SECURITY REVIEW. - ISSN 2212-4748. - 52:(2024 Apr), pp. 105903.1-105903.14. [10.1016/j.clsr.2023.105903]

Enforcing legal information extraction through context-aware techniques: The ASKE approach

S. Castano^Primo;A. Ferrara^Secondo;Emanuela Furiosi;S. Montanelli;S. Picascia;D. Riva^Penultimo;C. Stefanetti^Ultimo

2024

Abstract

To cope with the growing volume, complexity, and articulation of legal documents as well as to foster digital justice and digital law, increasing effort is being devoted to legal knowledge extraction and digital transformation processes. In this paper, we present the ASKE (Automated System for Knowledge Extraction) approach to legal knowledge extraction, based on a combination of context-aware embedding models and zero-shot learning techniques into a three-phase extraction cycle, which is executed a number of times (called generations) to progressively extract concepts representative of the different meanings of terminology used in legal documents chunks. A graph-based data structure called ASKE Conceptual Graph is initially populated through a data preparation step, and it is continuously enriched at each ASKE generation with results of document chunk classification, new extracted terminology, and newly derived concepts. A quantitative evaluation of ASKE knowledge extraction and document classification is provided by considering the EurLex dataset. Furthermore, we present the results of applying ASKE to a real case-study of Italian case law decisions with qualitative feedback from legal experts in the framework of an ongoing national research project.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Digital justice; Legal knowledge extraction; Legal knowledge graph; Natural Language Processing
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Settori scientifico-disciplinari dell'articolo (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
			
	Titolo del progetto
	
	Titolo Progetto
	
									SEcurity and RIghts in the CyberSpace (SERICS)
								
	Acronimo
	
									SERICS
								
	Nome finanziatore
	
										MINISTERO DELL'UNIVERSITA' E DELLA RICERCA
									
	N. Contratto
	
									codice identificativo PE00000014
								
	Data di pubblicazione
	
				apr-2024
			
	Data ahead of print o data di stampa
	
				24-ott-2023
			
	Rivista in ANCE
	
				COMPUTER LAW & SECURITY REVIEW
			
	DOI
	
				https://dx.doi.org/10.1016/j.clsr.2023.105903
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
2024 - CLSR.pdf accesso aperto Tipologia: Publisher's version/PDF Licenza: Creative commons Dimensione 1.49 MB Formato Adobe PDF Visualizza/Apri	1.49 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1018649

Citazioni

ND

16

11

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca