Functional Text Segmentation is the task of partitioning a textual document in segments that play a certain function. In the legal domain, this is important to support downstream tasks, but it faces also challenges of segment discontinuity, few-shot scenario, and domain specificity. We propose an approach that, revisiting the underlying graph structure of a Conditional Random Field and relying on a combination of neural embeddings and engineered features, is capable of addressing these challenges. Evaluation on a dataset of Italian case law decisions yields promising results.

Few-Shot Legal Text Segmentation via Rewiring Conditional Random Fields: A Preliminary Study / A. Ferrara, S. Picascia, D. Riva (LECTURE NOTES IN COMPUTER SCIENCE). - In: Advances in Conceptual Modeling / [a cura di] T.P. Sales, J. Araújo, J. Borbinha, G. Guizzardi. - Cham : Springer, 2023. - ISBN 9783031471117. - pp. 141-150 (( Intervento presentato al 42. convegno ER 2023 Workshops, CMLS, CMOMM4FAIR, EmpER, JUSMOD, OntoCom, QUAMES, and SmartFood tenutosi a Lisboa nel 2023 [10.1007/978-3-031-47112-4_13].

Few-Shot Legal Text Segmentation via Rewiring Conditional Random Fields: A Preliminary Study

A. Ferrara
Primo
;
S. Picascia
Secondo
;
D. Riva
Ultimo
2023

Abstract

Functional Text Segmentation is the task of partitioning a textual document in segments that play a certain function. In the legal domain, this is important to support downstream tasks, but it faces also challenges of segment discontinuity, few-shot scenario, and domain specificity. We propose an approach that, revisiting the underlying graph structure of a Conditional Random Field and relying on a combination of neural embeddings and engineered features, is capable of addressing these challenges. Evaluation on a dataset of Italian case law decisions yields promising results.
Conditional Random Fields; Legal Document Processing; Text Segmentation
Settore INF/01 - Informatica
   SEcurity and RIghts in the CyberSpace (SERICS)
   SERICS
   MINISTERO DELL'UNIVERSITA' E DELLA RICERCA
   codice identificativo PE00000014
2023
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
FewShot-Legal-Text-Segmentation-via-Rewiring-Conditional-Random-Fields-A-Preliminary-StudyLecture-Notes-in-Computer-Science-including-subseries-Lecture-Notes-in-Artificial-Intelligence-and-Lecture-Notes-in-Bioinformatics.pdf

accesso riservato

Descrizione: Conference Paper
Tipologia: Publisher's version/PDF
Dimensione 1.2 MB
Formato Adobe PDF
1.2 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1019274
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact