corpus

English

WOTD – 10 June 2016

Pronunciation

  • (Received Pronunciation) IPA(key): /ˈkɔːpəs/
  • (General American) IPA(key): /ˈkɔɹpəs/
  • (file)
  • Rhymes: -ɔː(ɹ)pəs
  • Hyphenation: cor‧pus

Etymology 1

Borrowed from Latin corpus (body). Doublet of corpse, corps and riff.

Noun

corpus (plural corpora or corpuses)

  1. (linguistics) A collection of writings, often on a specific topic, of a specific genre, from a specific demographic or a particular author, etc.
    Synonyms: collection, compilation, aggregation; see also Thesaurus:body
    • 2007, Mihail Mihailov; Hannu Tommola, “Compiling Parallel Text Corpora: Towards Automation of Routine Procedures”, in Wolfgang Teubert, editor, Text Corpora and Multilingual Lexicography (Benjamins Current Topics; 8), Amsterdam: John Benjamins Publishing Company, →ISBN, page 60:
      Text corpora are being used in most current lexicographic projects. Applied linguistic research is another field where text corpora are welcome as an inexhaustible source of empirical information, a polygon for testing various linguistic tools – spell-checkers, OCRs, machine translation systems, NLP systems, etc.
    • 2008, Anabel Borja, “Corpora for Translators in Spain. The CDJ-GITRAD Corpus and the GENITT Project.”, in Gunilla [M.] Anderman and Margaret Rogers, editors, Incorporating Corpora: The Linguist and the Translator, Clevedon, North Somerset: Multilingual Matters, →ISBN, page 248:
      Comparable corpora are made up of texts in different languages that may be related in various ways, but are not translations of each other. They may have nothing in common at all, or be on the same subject, of the same genre, or from the same chronological period, etc.
    • 2013, “Introduction”, in Gerry Knowles, Briony Williams, and L[ita] Taylor, editors, A Corpus of Formal British English Speech: The Lancaster/IBM Spoken English Corpus, Abingdon, Oxon.; New York, N.Y.: Routledge, →ISBN, page 1:
      The Lancaster/IBM Spoken English Corpus began in September 1984 as part of a research project into the automatic assignment of intonation [] The original design of the corpus was determined by the need to provide data for research into speech synthesis. As a result, unlike most other corpora currently being used in the computational linguistics field, the SEC exists in several forms. [] However, whatever the original motivation for compiling a corpus, it quickly becomes an object of interest in its own right. New users find it valuable for applications for which it was not designed.
    • 2014, Giuseppina Balossi, “Corpus Approaches to the Study of Language and Literature”, in A Corpus Linguistic Approach to Literary Language and Characterization: Virginia Woolf's The Waves (Linguistic Approaches to Literature; 18), Amsterdam: John Benjamins Publishing Company, →ISBN, page 41:
      A corpus approach is a useful methodology for observing, describing and interpreting the stylistic features of language in literary and non-literary texts.
  2. (uncommon) A body, a collection.
    Synonyms: collection; see also Thesaurus:body
    • 1998, Dimitǎr Draganov, “New Coin Types of Hadrianopolis”, in Ulrike Peter, editor, Stephanos Nomismatikos: Edith Schönert-Geiss zum 65. Geburtstag (Griechisches Münzwerk), Berlin: Akademie Verlag, →ISBN Invalid ISBN, page 221:
      About a hundred years ago in Germany, the publishing of corpuses of the ancient Greek coinages was started. [] The significance of those, and some other corpuses is exclusive, because they allowed an enormous amount of numismatic material kept in museum and private collections all over the world, to be studied and systematized.
    • 2014, Margaret Darling; Barbara Precious, “Introduction”, in A Corpus of Roman Pottery from Lincoln (Lincoln Archaeological Studies; 6), Oxford: Oxbow Books, →ISBN, page 1:
      An assessment in 1991 proposed publication of the results of this work in three stages: [] secondly, a corpus of the Roman pottery to present the type series and to discuss the fabrics and forms recovered, []
Derived terms
Translations
The translations below need to be checked and inserted above into the appropriate translation tables, removing any numbers. Numbers do not necessarily match those in definitions. See instructions at Wiktionary:Entry layout#Translations.

Etymology 2

From German Corpus (10-point type), from its use in editions of the Corpus Juris.

Noun

corpus (uncountable)

  1. (printing, dated) Synonym of long primer
    • 1833, George Crabb, “Printing”, in Universal Technological Dictionary, or Familiar Explanation of the Terms Used in All Arts and Sciences, Containing Definitions Drawn from the Original Writers, and Illustrated by Plates, Epigrams, Cuts, &c., volume II, enlarged edition, London: Printed for Baldwin and Cradock, Paternoster-Row, and for the new proprietor, J. Dowding, 82, Newgate-Street, OCLC 65260870:
      Brevier had its name from being first used in the printing of the breviary; and the German Corpus, in English Long Primer, probably from its use in printing their Corpus Juris.
    • 1843, “Type-founding”, in The Penny Cyclopædia of the Society for the Diffusion of Useful Knowledge, volume XXV (Titles of Honour – Ungula), London: Charles Knight and Co., 22, Ludgate Street, OCLC 2041456, page 455:
      Long Primer. This neat type, which is much used for printing works in duodecimo, is called Petit Romain in France, and Corpus in Germany; the latter name being probably derived from its use in printing the 'Corpus Juris:' 89 m's of Long Primer go to a foot.

Anagrams


Catalan

Etymology

From Latin corpus. Doublet of cos.

Pronunciation

Noun

corpus m (plural corpus)

  1. corpus (a collection of writings)

Further reading


Dutch

Etymology

From Latin corpus.

Pronunciation

  • (file)

Noun

corpus n (plural corpussen, diminutive corpusje n)

  1. a collection of writings, a text corpus

Usage notes

The word retained the original Latin neuter gender. It is one of the few Dutch words ending on -us that is not masculine.


French

Etymology

Unadapted borrowing from Latin corpus (body). Doublet of corps.

Pronunciation

  • IPA(key): /kɔʁ.pys/

Noun

corpus m (plural corpus)

  1. (linguistics) a corpus, a body of texts

Further reading


Latin

Etymology

From Proto-Italic *korpos, from Proto-Indo-European *krep-.

Pronunciation

  • (Classical) IPA(key): /ˈkor.pus/, [ˈkɔr.pʊs]

Noun

corpus n (genitive corporis); third declension

  1. (anatomy) body, substance, material
    • Seneca Minor, Epistulae Morales ad Lucilium, Epistula XCII
      Nemo liber est qui corpori servit.
      No one is free who is a slave to the body.
  2. the flesh of an animal's body
  3. a corpse
  4. the trunk or shaft of something
  5. a frame, body, system, structure, community, corporation
  6. (figuratively) the wood under the bark of a tree
  7. (Medieval) a corpus (collection of writings by a single author or addressing a certain topic)

Inflection

Third declension neuter.

Case Singular Plural
Nominative corpus corpora
Genitive corporis corporum
Dative corporī corporibus
Accusative corpus corpora
Ablative corpore corporibus
Vocative corpus corpora

Derived terms

Descendants

Further reading

  • corpus in Charlton T. Lewis and Charles Short (1879) A Latin Dictionary, Oxford: Clarendon Press
  • corpus in Charlton T. Lewis (1891) An Elementary Latin Dictionary, New York: Harper & Brothers
  • corpus in Charles du Fresne du Cange’s Glossarium Mediæ et Infimæ Latinitatis (augmented edition, 1883–1887)
  • corpus in Gaffiot, Félix (1934) Dictionnaire Illustré Latin-Français, Hachette
  • Carl Meissner; Henry William Auden (1894) Latin Phrase-Book, London: Macmillan and Co.
    • to spread over the whole body: per totum corpus diffundi
    • bodily strength: vires corporis or merely vires
    • a good constitution: firma corporis constitutio or affectio
    • sensual pleasure: voluptates (corporis)
    • to refresh oneself, minister to one's bodily wants: corpus curare (cibo, vino, somno)
    • to devote oneself body and soul to the good of the state: totum et animo et corpore in salutem rei publicae se conferre
    • the free men are sold as slaves: libera corpora sub corona (hasta) veneunt (B. G. 3. 16. 4)
    • wounds (scars) on the breast: vulnera adverso corpore accepta
  • corpus in William Smith et al., editor (1890) A Dictionary of Greek and Roman Antiquities, London: William Wayte. G. E. Marindin
  • Sihler, Andrew L. (1995) New Comparative Grammar of Greek and Latin, Oxford, New York: Oxford University Press, →ISBN

Anagrams



Portuguese

Etymology

Borrowed from Latin corpus. Doublet of the inherited corpo.

Noun

corpus m (plural corpora or corpus)

  1. corpus (collection of writings)

Spanish

Etymology

Borrowed from Latin corpus, possibly through the intermediate of English corpus, according to the RAE[1]. Doublet of the inherited cuerpo.

Pronunciation

  • IPA(key): /ˈkorpus/

Noun

corpus m (plural corpus)

  1. corpus (a collection of writings)

References

This article is issued from Wiktionary. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.