Read More
Date: 2024-02-01
688
Date: 14-1-2022
747
Date: 2023-09-20
862
|
corpus, plural corpora (n.)
A collection of LINGUISTIC DATA, either written texts or a TRANSCRIPTION of recorded speech, which can be used as a starting-point of linguistic description or as a means of verifying hypotheses about a LANGUAGE (corpus linguistics). Linguistic DESCRIPTIONS which are ‘corpusrestricted’ have been the subject of criticism, especially by GENERATIVE GRAMMARIANS, who point to the limitations of corpora (e.g. that they are samples of PERFORMANCE only, and that one still needs a means of PROJECTING beyond the corpus to the language as a whole). In fieldwork on a new language, or in HISTORICAL study, it may be very difficult to get beyond one’s corpus (i.e. it is a ‘closed’ as opposed to an ‘extendable’ corpus), but in languages where linguists have regular access to NATIVE-SPEAKERS (and may be native-speakers themselves) their approach will invariably be ‘corpus-based’, rather than corpus-restricted. Corpora provide the basis for one kind of COMPUTATIONAL LINGUISTICS. A computer corpus is a large body of machine-readable texts. Increasingly large corpora (especially of English) have been compiled since the 1980s, and are used both in the development of natural language processing software and in such applications as lexicography, speech recognition, and machine translation.
|
|
دراسة يابانية لتقليل مخاطر أمراض المواليد منخفضي الوزن
|
|
|
|
|
اكتشاف أكبر مرجان في العالم قبالة سواحل جزر سليمان
|
|
|
|
|
اتحاد كليات الطب الملكية البريطانية يشيد بالمستوى العلمي لطلبة جامعة العميد وبيئتها التعليمية
|
|
|