Breton/Breton
From the LDC Language Resource Wiki
(Difference between revisions)
(→Miscellaneous) |
(→Writing) |
||
Line 31: | Line 31: | ||
====Writing==== | ====Writing==== | ||
- | * | + | * [http://www.omniglot.com/writing/breton.htm Omniglot]. |
==Linguistic resources== | ==Linguistic resources== |
Revision as of 22:33, 16 April 2010
Under construction Home > Breton
Contents |
General
Language summary
2010-04-7
- ISO 639-3 code: bre
- Population:
- 500,000 in France (1989 International Committee for the Defense of the Breton Language).
- 1,200,000 know Breton who do not regularly use it.
- Population total all countries: 500,045.
- 188,568 people (Ya d'ar brezhoneg, 2010)
- Also spoken in: -
- Alternate names: -
- Dialects: -
- Classification: Indo-European, Celtic, Insular, Brythonic
Linguistic notes
Writing
Linguistic resources
Overview
Grammar
Lexicon
- (GFDL) Breton Wiktionary :: Rummad:Brezhoneg
- (GFDL) French Wiktionary :: Catégorie::breton
- (GFDL) English Wiktionary :: Category:Breton
Topical word lists
Names
- Babynology: List of Breton baby names in Roman transliteration
Monographs
Linguistic portals and bibliographies
Data Sources
Monolingual Text
- EMILLE ONLY FOR: Bengali, Panjabi, Tamil, and Urdu EMILLE corpus. Approximately NUMBERS HERE words. Free license for non-profit research use. Documentation
News
- Agence Bretagne Presse
- Bremaik (weekly news articles)
Blogs
Parallel Text
- EMILLE corpus. ONLY FOR: Bengali, Panjabi, Tamil, and Urdu 200,000 words of text in English (information leaflets from the UK Government and various local authorities) with Breton translation. Free license for non-profit research use.
- MultiKulti Langs listed: Albanian, Arabic, Bengali, Chinese, English, Farsi, French, Gujarati, Somali, Spanish, Portuguese, Turkish, Urdu.
PROB. SAME AS EMILLE BUT NOT ALL UNICODE. DON'T USE FOR EMILLE LANGUAGES (Bengali, Panjabi, Tamil, and Urdu). : 200k words from UK government leaflets (not news). Free for research, see license. However, some of the files are in PDF and present encoding problems when the text is copied.- In general, a document with /__/ in its pathname will have an English counterpart with /en/.
- pamphlets (PDF), e.g. http://www.multikulti.org.uk/__/education/welcome-to-your-library/public-libraries.pdf
- The Breton directory lists directories that contain Breton pages, though not all of the pages are in Breton.
- The Breton racial discrimination directory contains about a dozen pp. in Breton.
Speech
Video
IPR notes
Portals
- OneIndia. Hindi, Kannada, Malayalam, Tamil, Telugu, each at http://thats<language>.oneindia.in/
- SOUTH ASIAN LANGUAGES Yahoo! India. Mostly http://in.Breton.yahoo.com/ (with Breton all lowercase):
Tools and Other NLP Resources
Miscellaneous
Articles
- Tyers, F. M. (2009) "Rule-based augmentation of training data in Breton–French statistical machine translation ". Proceedings of the 13th Annual Conference of the European Association of Machine Translation, EAMT09. pp. 213—218