部分重要概念 
Text Corpus In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (now usually electronically stored and processed). They are used to do statistical analysis, checking occurrences or validating linguistic rules on a specific universe. 
Brown Corpus The Brown Corpus of Standard American English (or just Brown Corpus) was compiled by Henry Kucera and W. Nelson Francis at Brown University, Providence, RI as a general corpus (text collection) in the field of corpus linguistics. 
Bank of English The Bank of English is the name of the COBUILD corpus, a collection of English texts. These are mainly British, but American and Australian data are also included. 
Part-of-Speech Tagging Part-of-speech tagging (POS tagging or POST), also called grammatical tagging, is the process of marking up the words in a text as corresponding to a particular part of speech, based on both its definition, as well as its context, i.e. relationship with adjacent and related words in a phrase, sentence, or paragraph. 
重要參考文獻(xiàn) 何安平,2004,《語(yǔ)料庫(kù)語(yǔ)言學(xué)與英語(yǔ)教學(xué)》,北京:外語(yǔ)教學(xué)與研究出版社。 
楊惠中(編),2002,《語(yǔ)料庫(kù)語(yǔ)言學(xué)導(dǎo)論》,上海:上海外與教育出版社。Gavioli, L. (2005). Exploring corpora for ESP learning. Amsterdam: John Benjamins. 
華南師范大學(xué)外國(guó)語(yǔ)言文化學(xué)院編委會(huì)(編),2005,《語(yǔ)料庫(kù)語(yǔ)言學(xué)的研究與應(yīng)用》,長(zhǎng)春:東北師范大學(xué)出版社。 
Kennedy, G. (2000). An introduction to corpus linguistics [語(yǔ)料庫(kù)語(yǔ)言學(xué)入門(mén)], 北京:外語(yǔ)教學(xué)與研究出版社。 
Deignan, A. (2005). Metaphor and corpus linguistics. Amsterdam: John Benjamins. 
Dash, N. S. (2005). Corpus linguistics and language technology: With reference to Indian language. New Delhi: Mittal Publications. 
Connor, U. & Upton, T. A. (2004). (Eds.) 
Applied corpus linguistics: A multidimensional perspective. New York: Rodopi. 
Halliday, M.A.K. et al. (2004). Lexicography and corpus linguistics: An introduction. New York: Continuum. 
領(lǐng)域前沿 Mark Davies, Brigham Young University  http://davies-linguistics./personal/ Susan Hunston, University of Birmingham  http://www.english./who/hunston.htm Gary Kennedy, Ohio State University  http://www.math./~kennedy/ Wolfgang Teubert, University of Birmingham  http://www.english./who/teubert.htm Corpus Linguistics 2007, the fourth Corpus Linguistics conference, the University of Birmingham  http://www.corpus./conference2007/ International Journal of Corpus Linguistics  http://www./cgi-bin/t_seriesview.cgi?series=IJCL The Inter-Varietal Applied Corpus Studies (IVACS)  http://www.mic./ivacs/about.htm British National Corpus  http://www.comp./computing/research/ucrel/bnc.html