IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

Corpora are nowadays the primary source of many dictionaries and the core element of various platforms and information systems. Lexicographers have therefore a variety of new possibilities which were unthinkable in the past with different types of corpora available for use in the data collection phase of the lexicographic process (Flinz 2021). Not only lexicographers, but also other types of users (academics, translators, teachers, students etc.) can profit from them, especially when corpora are public and can be accessed using corpus linguistic tools (Ballestracci/Buffagni/Flinz 2020; Flinz/Farina 2020). The LBC-corpora are monolingual specialized comparable corpora, already online (http://corpora.lessicobeniculturali.net/) and, as monitor corpora (Lemnitzer/Zinsmeister 2015: 140), they will be augmented over time. They can be analysed using the open source tool NoSketchEngine (Billero 2020). The LBC-Corpora are also the lexicographic primary source of the LBC multilingual dictionary, which is in preparation: the provisional entry lists of different languages (Spanish, German, and French) are now ready (Billero/Farina/Nicolás Martínez 2020) and together with a selection of KWICS, which have been carefully selected following a quantitative-qualitative procedure (for German see Buffagni/Flinz/Ballestracci in prep.), will soon be online (Flinz et al. in prep.). The purpose of this paper is to reflect on the LBC-corpora from a double perspective: from the user of the LBC-platform and from the lexicographic team. In the first case following an overview of the principal characteristics of the LBC-Platform, the focus will be on the accessible corpora showing the tools which can be used (§ 2). In the second case the LBC-corpora will be examined in their function as a data basis for the LBC-Dictionary (§ 3). The attention will be on the data preparation phase: after dis cussing the procedure for the realization of the LBC-provisional lemma candidate lists, the focus will be on the adopted procedure for finding equivalence relations and for the individuation of other types of relations between the entries (synonymy, belonging to the same semantic field etc.). In § 4 the focus will be on the LBC-provisional lemma candidate lists and their related KWICs. Conclusions and an outlook to the future can be found in the last section (§ 5).

The multifunctional LBC-Corpora: different aims depending from the user / C. Flinz. - In: LEXICOGRAPHICA. - ISSN 0175-6206. - 39:1(2023 Nov 22), pp. 191-208. [10.1515/lex-2023-0010]

The multifunctional LBC-Corpora: different aims depending from the user.

C. Flinz

2023

Abstract

Corpora are nowadays the primary source of many dictionaries and the core element of various platforms and information systems. Lexicographers have therefore a variety of new possibilities which were unthinkable in the past with different types of corpora available for use in the data collection phase of the lexicographic process (Flinz 2021). Not only lexicographers, but also other types of users (academics, translators, teachers, students etc.) can profit from them, especially when corpora are public and can be accessed using corpus linguistic tools (Ballestracci/Buffagni/Flinz 2020; Flinz/Farina 2020). The LBC-corpora are monolingual specialized comparable corpora, already online (http://corpora.lessicobeniculturali.net/) and, as monitor corpora (Lemnitzer/Zinsmeister 2015: 140), they will be augmented over time. They can be analysed using the open source tool NoSketchEngine (Billero 2020). The LBC-Corpora are also the lexicographic primary source of the LBC multilingual dictionary, which is in preparation: the provisional entry lists of different languages (Spanish, German, and French) are now ready (Billero/Farina/Nicolás Martínez 2020) and together with a selection of KWICS, which have been carefully selected following a quantitative-qualitative procedure (for German see Buffagni/Flinz/Ballestracci in prep.), will soon be online (Flinz et al. in prep.). The purpose of this paper is to reflect on the LBC-corpora from a double perspective: from the user of the LBC-platform and from the lexicographic team. In the first case following an overview of the principal characteristics of the LBC-Platform, the focus will be on the accessible corpora showing the tools which can be used (§ 2). In the second case the LBC-corpora will be examined in their function as a data basis for the LBC-Dictionary (§ 3). The attention will be on the data preparation phase: after dis cussing the procedure for the realization of the LBC-provisional lemma candidate lists, the focus will be on the adopted procedure for finding equivalence relations and for the individuation of other types of relations between the entries (synonymy, belonging to the same semantic field etc.). In § 4 the focus will be on the LBC-provisional lemma candidate lists and their related KWICs. Conclusions and an outlook to the future can be found in the last section (§ 5).

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				lexical information system; cultural heritage;  LSP-dictionary; corpora;
lemma candidate list; KWICs
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore L-LIN/14 - Lingua e Traduzione - Lingua Tedesca
			
	Data di pubblicazione
	
				22-nov-2023
			
	Rivista in ANCE
	
				LEXICOGRAPHICA
			
	DOI
	
				https://dx.doi.org/10.1515/lex-2023-0010
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
Flinz_Lexicographica-2023-0010.pdf Open Access dal 23/11/2024 Tipologia: Publisher's version/PDF Dimensione 819.73 kB Formato Adobe PDF Visualizza/Apri	819.73 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1018475

Citazioni

ND

0

0

ND

social impact