Resources in our repository exist in different languages, for different types of materials, as well
as for different categories of material.
Thanks to the use of Standards for the storage and
description of these resources, they can all be listed from a single access point, as demonstrated here.
[From webiste]. The Wordtheque is a powerful interface with a massive database (currently 707.737.941 words) containing multilingual novels, technical literature and translated texts. Hits are highlighted in context windows that can be expanded up or down. To go to the source web pages (novels, etc.)
[From authors' webiste]. The CHILDES system provides tools for studying conversational interactions. These tools include a database of transcripts, programs for computer analysis of transcripts, methods for linguistic coding,and systems for linking transcripts to digitized audio and video.
Nous nous proposons de : rassembler des corpus de textes français libres de droits;
stucturer les textes en posant des balises pour délimiter les différentes parties de chaque ouvrage;
maintenir une page de liens vers les articles sur internet à propos de ces ouvrages;
monitorer des forums de questions pointant sur les ouvrages, les auteurs et leurs contextes;
monitorer un service d'annonces d'échange et vente d'éditions diverses des ouvrages;
introduire Progsession pour un travail en groupe.
Nous espérons terminer en 2007 un premier plan portant sur 3000 oeuvres dont plus de 2000 sont déjà téléchargeables.
Words in English by Suzanne Kemmer (Rice University)
[from the website] This website is a resource for those who want to learn more about this fascinating language [i.e., English] – its history as a language, the origins of its words, and its current modern characteristics.
The Great Vowel Shift
[from the website] This site is designed for my students--undergraduates with limited linguistic knowledge who are being introduced to the Great Vowel Shift. There are topics I do not discuss in this site because they are too basic, too complicated, or too controversial for this audience.
The History of English Phonemes
[from the website] This Website is designed to help students of the English language trace the development of the phonemes of English from the Old English period into Present-Day English. The information contained in the site is available in any good textbook on the history of the language, but printed texts normally present the information in a linear fashion corresponding to the chronological development of English. The value of the Website is the hypertextual treatment of the information, which is meant to keep students from having to spend a great deal of time leafing through textbooks.
[from the website] This site is devoted to the origins of words and phrases, or as a linguist would put it, to etymology. Etymology is the study of word origins. (It is not the study of insects; that is entomology.) Where words come from is a fascinating subject, full of folklore and historical lessons. Often, popular tales of a word's origin arise. Sometimes these are true; more often they are not. While it often seems disappointing when a neat little tale turns out to be untrue, almost invariably the true origin is just as interesting.
A Collection of Sentence Oddities and Trivia
Cover topics such as:
Pangrams, Palindromes, Plurals
Vivian Cook website
Various informative pages on Writing Systems or Second Language Acquisition. Include a linguistics glossary and extensive bibliography on Second Language Acquisition.
Textes en français historique
[Copie du site web] "Ce n'est pas une histoire du chant. Ce n'est pas non plus une histoire du français. C'est une histoire du français chanté. Dès l'origine, le chant constitue une forme de discours à part entière, qui obéit à ses règles propres. L'histoire de ces règles, qui définissent le champ de la déclamation, est ici retracée. Touchant à plusieurs disciplines, cette étude s'adresse en tout premier aux chanteurs pratiquant la musique ancienne, qui peinent à trouver, dans les traités spécialisés, des réponses à leurs questions. Peut-être intéressera-t-elle aussi quelques linguistes que le chant et la musique ne laissent pas indifférents et, qui sait, d'autres esprits curieux."
Old French On The Web
A Website Devoted to the Language and Literature of Old French
[From the website] On this webpage you will find an annotated reference system to find everything related to Corpus Linguistics that is available on the Internet: Corpora, Concordances, Corpus Linguistics research efforts and events, software for tagging, annotation etc.
Devoted to Corpora (Bookmarks for Corpus-based Linguists)
[From the website] These annotated links (c. 1,000 of them) are meant mainly for linguists and language teachers who work with corpora, not computational linguists/NLP (natural language processing) people, so although the language-engineering-type links here are fairly extensive, they are not exhaustive (for such info, you'll have to look elsewhere). Stuff here also represent my personal interests and biases (which will be obvious in some of my descriptive notes) and consequently there may be gaps, errors and omissions which you are welcome to tell me about. The English language bias on these pages will, I hope, be forgiven.
ELDA (Language Resources Distribution)
[From the website] Our catalogue of language resources currently gathers around 700 spoken and written language resources. It can be accessed from the ELRA web site and from the ELDA web site. The identification and the collection of existing language resources is part of our regular activity. The new resources we have collected, once the catalogue has been updated, are announced on some mailing lists, as well as in the ELRA members' news and in the quarterly ELRA newsletter.
EURALEX (European Association for Lexicography)
[From the website] EURALEX is the European Association for Lexicography: an international association which was founded in 1983, with the aims of furthering all aspects of the broad field of lexicography, and of promoting the exchange of ideas and information. It is committed to the development of lexicography in all European languages (as well as other non-European languages). EURALEX's interests include dictionaries of all kinds (monolingual, bilingual, and multilingual, general and specialist, in book and in machine-readable form); metalexicography, the theory of lexicography, and the history of lexicography; the praxis of dictionary-making; dictionary use; terminology and terminography; corpus lexicography; computational lexicography and dictionaries for natural language processing; and lexicology in general.
Linguistic Data Resources on the Internet
A topically organized list of language data resources on the Internet.
Archives for Language and Machine Learning
[From the website] SIGLEX, a Special Interest Group on the Lexicon of the Association for Computational Linguistics, provides an umbrella for research interests on lexical issues ranging from lexicography and the use of online dictionaries to computational lexical semantics. SIGLEX is also the umbrella organization for SENSEVAL, evaluation exercises for Word Sense Disambiguation.
History of the English Language
List of Links about the English Language and its historical changes.
Une Histoire de la langue française @ Globe-Gate
Collection of nearly 100 links related to French, its dialects and historical changes