Here
Freely available corpora
Middle English The Linguistics Department at the University of Pennsylvania offers the Penn-Helsinki Parsed Corpus of Middle English, a database of 510,000 words of syntactically parsed Middle English text for use by historical linguists Spanish Three Spanish corpora are freely available in Internet for research purposes: Spoken Peninsular Spanish (1 Mi words) Written Argentinian Spanish (2 Mi words) Written Chilean Corpus (2 Mi words) These corpora have a basic tagging in a SGML and TEI related form, easy to convert to the latest versions.
Check them at http://www.lllf.uam.es/
Institutions
Norwegian Computing Centre for the Humanities (NCCH) with the International Computer Archive of Modern English (ICAME) ELSNET
Projects
TELRI
Distribution Institutions
Linguistic Data Consortium (LDC) ELRA
Others
The British National Corpus (BNC) Cobuild Direct (BOE) Encyclopedia Britannica (beta)
Speech
ShATR - A Corpus for Auditory Scene Analysis