Breton/Breton
From the LDC Language Resource Wiki
(Difference between revisions)
(→Lexicon) |
(→Monolingual Text) |
||
Line 67: | Line 67: | ||
===Monolingual Text=== | ===Monolingual Text=== | ||
- | |||
====News==== | ====News==== | ||
* [http://www.agencebretagnepresse.com/index.php?langue=bzh Agence Bretagne Presse] | * [http://www.agencebretagnepresse.com/index.php?langue=bzh Agence Bretagne Presse] | ||
- | * [http://bremaik.free.fr/ Bremaik] (weekly news articles) | + | * {{GPL}} [http://bremaik.free.fr/ Bremaik] (weekly news articles) |
====Blogs==== | ====Blogs==== |
Revision as of 10:36, 17 April 2010
Under construction Home > Breton
Contents |
General
Language summary
2010-04-7
- ISO 639-3 code: bre
- Population:
- 500,000 in France (1989 International Committee for the Defense of the Breton Language).
- 1,200,000 know Breton who do not regularly use it.
- Population total all countries: 500,045.
- 188,568 people (Ya d'ar brezhoneg, 2010)
- Also spoken in: -
- Alternate names: -
- Dialects: Leoneg (Leonais), Tregerieg (Tregorrois), Gwenedeg (Vannetais), Kerneveg (Cornouaillais).
- Classification: Indo-European, Celtic, Insular, Brythonic
Linguistic notes
Writing
Linguistic resources
Overview
Grammar
- Ian Press (1986) A Grammar of Modern Breton (Mouton Grammar Library) ISBN: 978-3-110105-79-7
- Roparz Hemon (translated by Michael Everson) (2007) Breton Grammar (Cathair na Mart: Evertype) ISBN: 978-1-904808-11-4
Lexicon
- Morphological
- Bilingual
- (GPL) Le Geriadur Tomaz (Breton--French) (~33,000 entries)
- Multilingual
- (GFDL) Breton Wiktionary :: Rummad:Brezhoneg
- (GFDL) French Wiktionary :: Catégorie:breton
- (GFDL) English Wiktionary :: Category:Breton
Topical word lists
Names
Monographs
Linguistic portals and bibliographies
Data Sources
Monolingual Text
News
- Agence Bretagne Presse
- (GPL) Bremaik (weekly news articles)
Blogs
Parallel Text
- EMILLE corpus. ONLY FOR: Bengali, Panjabi, Tamil, and Urdu 200,000 words of text in English (information leaflets from the UK Government and various local authorities) with Breton translation. Free license for non-profit research use.
- MultiKulti Langs listed: Albanian, Arabic, Bengali, Chinese, English, Farsi, French, Gujarati, Somali, Spanish, Portuguese, Turkish, Urdu.
PROB. SAME AS EMILLE BUT NOT ALL UNICODE. DON'T USE FOR EMILLE LANGUAGES (Bengali, Panjabi, Tamil, and Urdu). : 200k words from UK government leaflets (not news). Free for research, see license. However, some of the files are in PDF and present encoding problems when the text is copied.- In general, a document with /__/ in its pathname will have an English counterpart with /en/.
- pamphlets (PDF), e.g. http://www.multikulti.org.uk/__/education/welcome-to-your-library/public-libraries.pdf
- The Breton directory lists directories that contain Breton pages, though not all of the pages are in Breton.
- The Breton racial discrimination directory contains about a dozen pp. in Breton.
Speech
Video
IPR notes
Portals
Tools and Other NLP Resources
Miscellaneous
Articles
- Tyers, F. M. (2009) "Rule-based augmentation of training data in Breton–French statistical machine translation ". Proceedings of the 13th Annual Conference of the European Association of Machine Translation, EAMT09. pp. 213—218