D 2016

THE DEVELOPMENT OF CORPUS LINGUISTICS IN THE 20TH CENTURY

SLÁDKOVÁ, Věra

Basic information

Original name

THE DEVELOPMENT OF CORPUS LINGUISTICS IN THE 20TH CENTURY

Name in Czech

VÝVOJ KORPUSOVÉ LINGVISTIKY VE 20. STOLETÍ

Authors

SLÁDKOVÁ, Věra (203 Czech Republic, guarantor, belonging to the institution)

Edition

Hradec Králové, MMK 2016: Mezinárodní Masarykova konference pro doktorandy a mladé vědecké pracovníky, p. 679-686, 8 pp. 2016

Publisher

Magnanimitas

Other information

Language

English

Type of outcome

Proceedings paper

Field of Study

60200 6.2 Languages and Literature

Country of publisher

Czech Republic

Confidentiality degree

is not subject to a state or trade secret

Publication form

printed version "print"

RIV identification code

RIV/75081431:_____/16:00000892

Organization unit

Institute of Technology and Business in České Budějovice

ISBN

978-80-87952-17-7

Keywords (in Czech)

korpusová lingvistika; korpusy; anotace; označování slovních druhů

Keywords in English

corpus linguistics; corpora; annotation; part-of-speech tagging

Tags

Changed: 4/1/2017 09:13, Hana Dlouhá

Abstract

V originále

Corpus linguistics has become the major methodology for studying language and corpora as research tools initiated the creation of software applications which have been developing since the 1960s. This paper explains what corpora are, gives a brief overview of their types, and maps the corpus annotation from the diachronic perspective. It also looks for the roots of this scientific field from the very beginning of linguistics and describes the work of the pioneers in humanities, who were not discouraged by Chomsky and other empiricist and laid foundation for the Brown Corpus of American English, which later served as a reference tool for creating the LOB Corpus of British English and a model for other corpora. The major and the most significant corpus projects that affected linguistic research, dictionary building, reference books creation and computational linguistics are presented from the historical perspective until the 1990s with a view to paying tribute to a few scientists of Czech origin who also considerably influenced the field and illustrating how exciting the corpus linguistics was at the very start.