Corpora found their way into lexicography a long time ago and are also used as the primary sources of many contemporary dictionaries. Their use in the lexicographical process has opened up a variety of new possibilities (Lemnitzer/ Zinsmeister 2015: 170) that were previously unthinkable with traditional collections of documents. Corpora are also the lexicographic primary source of Tourlex, a newly conceived bilingual wiki-based resource, designed to support future employees of the tourism industry, with a special focus on collocations (Flinz 2019). In particular specific partial corpora are used for different lexicographic work steps (Wolf 2010: 23): the creation of a small specialized comparison corpus (Lemnitzer/Zinsmeister 2015: 138) based on the text type „General Terms and Conditions of Travel“ and large corpora, such as the reference corpus DeReKo and German Web 2013. In addition, documents and contexts were also researched from the Internet. The purpose of this paper is to reflect on the corpora that have been used as a data basis for Tourlex in order to show that both small and large corpora can be used for different lexicographic purposes: The prerequisite for the use of different corpora, however, is that the respective objectives are defined in advance. After an overview of the use of corpora in the phase of data collection in the lexicographic process of dictionary projects (§ 2), the primary sources (cf. Wiegand 1998: 140) of Tourlex and the used approach are presented (§ 3). In the fourth section, the extraction of the lemma candidate list and the finding of equivalence relations both of individual lexemes and of collocations are described in detail. The model used is systematized and its application is presented exemplarily.

Korpora als primäre Quellen von Tourlex / C. Flinz (LEXICOGRAPHICA. SERIES MAIOR). - In: Korpora in der Lexikographie und Phraseologie / [a cura di] M. Piosok, J. Taborek, M. Woznicka. - [s.l] : de Gruyter, 2021. - ISBN 9783110716801. - pp. 57-83 [10.1515/9783110716955-004]

Korpora als primäre Quellen von Tourlex

C. Flinz
2021

Abstract

Corpora found their way into lexicography a long time ago and are also used as the primary sources of many contemporary dictionaries. Their use in the lexicographical process has opened up a variety of new possibilities (Lemnitzer/ Zinsmeister 2015: 170) that were previously unthinkable with traditional collections of documents. Corpora are also the lexicographic primary source of Tourlex, a newly conceived bilingual wiki-based resource, designed to support future employees of the tourism industry, with a special focus on collocations (Flinz 2019). In particular specific partial corpora are used for different lexicographic work steps (Wolf 2010: 23): the creation of a small specialized comparison corpus (Lemnitzer/Zinsmeister 2015: 138) based on the text type „General Terms and Conditions of Travel“ and large corpora, such as the reference corpus DeReKo and German Web 2013. In addition, documents and contexts were also researched from the Internet. The purpose of this paper is to reflect on the corpora that have been used as a data basis for Tourlex in order to show that both small and large corpora can be used for different lexicographic purposes: The prerequisite for the use of different corpora, however, is that the respective objectives are defined in advance. After an overview of the use of corpora in the phase of data collection in the lexicographic process of dictionary projects (§ 2), the primary sources (cf. Wiegand 1998: 140) of Tourlex and the used approach are presented (§ 3). In the fourth section, the extraction of the lemma candidate list and the finding of equivalence relations both of individual lexemes and of collocations are described in detail. The model used is systematized and its application is presented exemplarily.
LSP dictionaries; corpora; collocations; equivalents
Settore L-LIN/14 - Lingua e Traduzione - Lingua Tedesca
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
Flinz_De_Gruyter.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 599.3 kB
Formato Adobe PDF
599.3 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Caricamento pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/841739
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact