Breton/Breton

From the LDC Language Resource Wiki

(Difference between revisions)
Jump to: navigation, search
(Lexicon)
m
 
(38 intermediate revisions not shown)
Line 1: Line 1:
-
{{:Under construction}}
+
{{Under construction}}
[[Main_Page|Home]] > [[Breton]]
[[Main_Page|Home]] > [[Breton]]
<center><font size=7>BREZHONEG</font>
<center><font size=7>BREZHONEG</font>
Line 6: Line 6:
<font size=7>BRETON</font></center>
<font size=7>BRETON</font></center>
-
[[Category:Lwiki:Use|Breton]]
 
Line 13: Line 12:
==General==
==General==
 +
<small>[[User:Ftyers|Ftyers]] 15:39, 22 April 2010 (UTC)</small>
===Language summary===
===Language summary===
-
<nowiki>2010</nowiki>-<nowiki>04</nowiki>-<nowiki>7</nowiki>
+
 
* ISO 639-3 code: bre
* ISO 639-3 code: bre
*Population:  
*Population:  
Line 22: Line 22:
** 1,200,000 know Breton who do not regularly use it.  
** 1,200,000 know Breton who do not regularly use it.  
** Population total all countries: 500,045.
** Population total all countries: 500,045.
-
** 188,568 people ([http://ouiaubreton.com/ Ya d'ar brezhoneg], 2010)
 
*Also spoken in: -  
*Also spoken in: -  
*Alternate names: -
*Alternate names: -
-
*Dialects:  -
+
*Dialects:  Leoneg (Leonais), Tregerieg (Tregorrois), Gwenedeg (Vannetais), Kerneveg (Cornouaillais).
*Classification: [http://www.ethnologue.com/show_lang_family.asp?code=bre Indo-European, Celtic, Insular, Brythonic]
*Classification: [http://www.ethnologue.com/show_lang_family.asp?code=bre Indo-European, Celtic, Insular, Brythonic]
Line 31: Line 30:
====Writing====
====Writing====
-
* [[Image:redRx.gif]] [http://www.omniglot.com/search.htm Omniglot]. Search '''www.omniglot.com''', not Web.
+
* [http://www.omniglot.com/writing/breton.htm Omniglot: Breton]
==Linguistic resources==
==Linguistic resources==
Line 38: Line 37:
===Grammar===
===Grammar===
 +
 +
* Ian Press (1986) ''A Grammar of Modern Breton'' (Mouton Grammar Library) ISBN 978-3-110105-79-7
 +
* Roparz Hemon (translated by Michael Everson) (2007) ''Breton Grammar''  (Cathair na Mart: Evertype) ISBN 978-1-904808-11-4
===Lexicon===
===Lexicon===
 +
* [http://br.wiktionary.org/ Wiktionary]. Monolingual. 12151 entries. {{CC-BY-SA}},{{GFDL}} {{si|[[User:Mamandel|Mamandel]] 19:35, 3 May 2010 (UTC)}}
 +
====Morphological====
-
* {{GFDL}} [http://br.wiktionary.org/wiki/Rummad:Brezhoneg Breton Wiktionary :: Rummad:Brezhoneg]
+
====Bilingual====
-
* {{GFDL}} [http://fr.wiktionary.org/wiki/Catégorie:breton French Wiktionary :: Catégorie::breton ]
+
-
* {{GFDL}} [http://en.wiktionary.org/wiki/Category:Breton_language English Wiktionary :: Category:Breton]
+
 +
* [http://meskach.free.fr/arbo/dico/tomaz.html Le Geriadur Tomaz (Breton--French)] (~33,000 entries) {{GPL}}
 +
 +
====Multilingual====
 +
 +
* [http://br.wiktionary.org/wiki/Rummad:Brezhoneg Breton Wiktionary :: Rummad:Brezhoneg] {{GFDL}}
 +
* [http://fr.wiktionary.org/wiki/Catégorie:breton French Wiktionary :: Catégorie:breton ] {{GFDL}}
 +
* [http://en.wiktionary.org/wiki/Category:Breton_language English Wiktionary :: Category:Breton] {{GFDL}}
====Topical word lists====
====Topical word lists====
=====Names=====
=====Names=====
-
*  [[Image:redRx.gif]] [http://www.babynology.com/Breton_babynames.html Babynology]: List of Breton baby names in Roman transliteration
 
===Monographs===
===Monographs===
===Linguistic portals and bibliographies===
===Linguistic portals and bibliographies===
-
 
-
*  [[Image:redRx.gif]] LINGUIST List resource pages
 
-
** (GET LINGUIST PAGE FOR Breton
 
-
*  SIL Bibliography
 
-
** [http://www.ethnologue.com/show_language.asp?code=bre Breton]
 
-
 
-
 
==Data Sources==
==Data Sources==
===Monolingual Text===
===Monolingual Text===
-
*[[Image:redRx.gif]]  <font color=red mam>'''''EMILLE ONLY FOR: Bengali, Panjabi, Tamil, and Urdu'''''</font mam> [http://www.ling.lancs.ac.uk/corplang/emille/ EMILLE] corpus. Approximately [[Lwiki:EMILLE corpus|'''NUMBERS HERE''']] words. Free license for non-profit research use. [http://www.emille.lancs.ac.uk/manual.pdf Documentation]
+
 
 +
* [http://br.wikipedia.org/ Wikipedia]. Monolingual. 33,174 entries. {{CC-BY-SA}},{{GFDL}} {{si|[[User:Mamandel|Mamandel]] 19:39, 3 May 2010 (UTC)}}
====News====
====News====
* [http://www.agencebretagnepresse.com/index.php?langue=bzh Agence Bretagne Presse]
* [http://www.agencebretagnepresse.com/index.php?langue=bzh Agence Bretagne Presse]
-
* [http://bremaik.free.fr/ Bremaik] (weekly news articles)
+
* [http://bremaik.free.fr/ Bremaik] (weekly news articles) {{GPL}}
====Blogs====
====Blogs====
===Parallel Text===
===Parallel Text===
-
*[[Image:redRx.gif]]  [http://www.ling.lancs.ac.uk/corplang/emille/ EMILLE] corpus. <font color=red mam>'''ONLY FOR: Bengali, Panjabi, Tamil, and Urdu'''</font mam> 200,000 words of text in English (information leaflets from the UK Government and various local authorities) with Breton translation. Free license for non-profit research use.
+
 
-
*[[Image:redRx.gif]] <font color=red mam> [http://www.multikulti.org.uk/__/ MultiKulti]  '''Langs listed: Albanian, Arabic, Bengali, Chinese, English, Farsi, French, Gujarati, Somali, Spanish, Portuguese, Turkish, Urdu.<br>PROB. SAME AS EMILLE BUT NOT ALL UNICODE. DON'T USE FOR EMILLE LANGUAGES (Bengali, Panjabi, Tamil, and Urdu).''' : 200k words from UK government leaflets (not news). Free for research, see license. However, some of the files are in PDF and present encoding problems when the text is copied.
+
* [http://elx.dlsi.ua.es/~fran/brfr_OAB_corpus Ofis ar Brezhoneg Aligned Corpus of Breton--French] (30,993 aligned sentences, [[NLP Resources#TMX|TMX]] and plain text format) {{GPL}}
-
** In general, a document with '''/__/''' in its pathname will have an English counterpart with '''/en/'''.
+
-
**pamphlets (PDF), e.g. http://www.multikulti.org.uk/__/education/welcome-to-your-library/public-libraries.pdf
+
-
** The [http://www.multikulti.org.uk/__/index.html Breton directory] lists directories that contain Breton pages, though not all of the pages are in Breton.
+
-
** The [http://www.multikulti.org.uk/__/racism-discrimination/index.html Breton racial discrimination directory] contains about a dozen pp. in Breton.</font mam>
+
===Speech===
===Speech===
Line 91: Line 89:
==Portals==
==Portals==
-
*[[Image:redRx.gif]]  [http://www.oneindia.in/ OneIndia]. Hindi, Kannada, Malayalam, Tamil, Telugu, each at '''<nowiki>http://thats<language>.oneindia.in/</nowiki>'''
 
-
*[[Image:redRx.gif]]  <font color=red>SOUTH ASIAN LANGUAGES</font> [http://in.yahoo.com/ Yahoo! India]. Mostly http://in.Breton.yahoo.com/  (with Breton all lowercase):
 
-
** [http://in.jagran.yahoo.com Hindi ("jagran")]
 
-
** [http://in.tamil.yahoo.com Tamil]
 
-
** [http://in.gujarati.yahoo.com Gujarati]
 
-
** [http://in.kannada.yahoo.com Kannada]
 
-
** [http://in.malayalam.yahoo.com Malayalam]
 
-
** [http://in.telugu.yahoo.com Telugu]
 
-
** [http://in.punjabi.yahoo.com Punjabi]
 
==Tools and Other NLP Resources==
==Tools and Other NLP Resources==
 +
 +
===Morphological analysis===
 +
 +
===Morphological disambiguation===
 +
 +
===Machine translation===
 +
 +
* [http://sourceforge.net/projects/apertium/files/ Apertium :: apertium-br-fr] {{GPL}}
 +
 +
===Articles===
 +
 +
* Tyers, F. M. (2010) "[http://www.mt-archive.info/EAMT-2010-Tyers.pdf Rule-based Breton to French machine translation"]. ''Proceedings of the 14th Annual Conference of the European Association of Machine Translation, EAMT10'' pp. 174&mdash;181.
 +
* Tyers, F. M. (2009) "[http://www.mt-archive.info/EAMT-2009-Tyers-2.pdf Rule-based augmentation of training data in Breton–French statistical machine translation]". ''Proceedings of the 13th Annual Conference of the European Association of Machine Translation, EAMT09''. pp. 213&mdash;218
==Miscellaneous==
==Miscellaneous==
 +
 +
 +
 +
 +
[[Category:Breton|Breton]]

Latest revision as of 19:00, 3 May 2011

THIS PAGE IS

UNDER CONSTRUCTION


Home > Breton

BREZHONEG


BRETON




Contents

General

Ftyers 15:39, 22 April 2010 (UTC)

Language summary

  • ISO 639-3 code: bre
  • Population:
    • 500,000 in France (1989 International Committee for the Defense of the Breton Language).
    • 1,200,000 know Breton who do not regularly use it.
    • Population total all countries: 500,045.
  • Also spoken in: -
  • Alternate names: -
  • Dialects: Leoneg (Leonais), Tregerieg (Tregorrois), Gwenedeg (Vannetais), Kerneveg (Cornouaillais).
  • Classification: Indo-European, Celtic, Insular, Brythonic

Linguistic notes

Writing

Linguistic resources

Overview

Grammar

Lexicon

Morphological

Bilingual

Multilingual

Topical word lists

Names

Monographs

Linguistic portals and bibliographies

Data Sources

Monolingual Text

News

Blogs

Parallel Text

Speech

Video

IPR notes

Portals

Tools and Other NLP Resources

Morphological analysis

Morphological disambiguation

Machine translation

Articles

Miscellaneous

Personal tools