Breton/Breton

From the LDC Language Resource Wiki

(Difference between revisions)
Jump to: navigation, search
(Portals)
(Linguistic resources)
Line 38: Line 38:
===Grammar===
===Grammar===
 +
 +
* Ian Press (1986) ''A Grammar of Modern Breton'' (Mouton Grammar Library) ISBN: 978-3-110105-79-7
 +
* Roparz Hemon (translated by Michael Everson) (2007) ''Breton Grammar''  (Cathair na Mart: Evertype) ISBN: 978-1-904808-11-4
===Lexicon===
===Lexicon===

Revision as of 10:23, 17 April 2010

Under construction Home > Breton

BREZHONEG


BRETON



Contents

General

Language summary

2010-04-7

  • ISO 639-3 code: bre
  • Population:
    • 500,000 in France (1989 International Committee for the Defense of the Breton Language).
    • 1,200,000 know Breton who do not regularly use it.
    • Population total all countries: 500,045.
    • 188,568 people (Ya d'ar brezhoneg, 2010)
  • Also spoken in: -
  • Alternate names: -
  • Dialects: Leoneg (Leonais), Tregerieg (Tregorrois), Gwenedeg (Vannetais), Kerneveg (Cornouaillais).
  • Classification: Indo-European, Celtic, Insular, Brythonic

Linguistic notes

Writing

Linguistic resources

Overview

Grammar

  • Ian Press (1986) A Grammar of Modern Breton (Mouton Grammar Library) ISBN: 978-3-110105-79-7
  • Roparz Hemon (translated by Michael Everson) (2007) Breton Grammar (Cathair na Mart: Evertype) ISBN: 978-1-904808-11-4

Lexicon

Topical word lists

Names

Monographs

Linguistic portals and bibliographies

Data Sources

Monolingual Text

  • Image:redRx.gif EMILLE ONLY FOR: Bengali, Panjabi, Tamil, and Urdu EMILLE corpus. Approximately NUMBERS HERE words. Free license for non-profit research use. Documentation

News

Blogs

Parallel Text

  • Image:redRx.gif EMILLE corpus. ONLY FOR: Bengali, Panjabi, Tamil, and Urdu 200,000 words of text in English (information leaflets from the UK Government and various local authorities) with Breton translation. Free license for non-profit research use.
  • Image:redRx.gif MultiKulti Langs listed: Albanian, Arabic, Bengali, Chinese, English, Farsi, French, Gujarati, Somali, Spanish, Portuguese, Turkish, Urdu.
    PROB. SAME AS EMILLE BUT NOT ALL UNICODE. DON'T USE FOR EMILLE LANGUAGES (Bengali, Panjabi, Tamil, and Urdu).
     : 200k words from UK government leaflets (not news). Free for research, see license. However, some of the files are in PDF and present encoding problems when the text is copied.

Speech

Video

IPR notes

Portals

Tools and Other NLP Resources

Miscellaneous

Articles

  • Tyers, F. M. (2009) "Rule-based augmentation of training data in Breton–French statistical machine translation ". Proceedings of the 13th Annual Conference of the European Association of Machine Translation, EAMT09. pp. 213—218
Personal tools