Breton/Breton

From the LDC Language Resource Wiki

(Difference between revisions)
Jump to: navigation, search
(Lexicon)
(Encoding and Fonts)
Line 62: Line 62:
** [http://www.ethnologue.com/show_language.asp?code=bre Breton]
** [http://www.ethnologue.com/show_language.asp?code=bre Breton]
-
==Encoding and Fonts==
 
-
<font color=red mam>Before the development and general use of Unicode, computer use of Breton and other South Asian languages required special fonts using only one byte. Many of these fonts were specific to one website or another and used idiosyncratic encodings. To some extent that is still the case; and so this page includes some such sites (see [[#News|News]]), and some resources for specific fonts and encoding converters.</font mam>
 
-
===Encodings===
 
-
====Unicode====
 
-
[[Image:redRx.gif]] The Unicode range for Breton is [http://www.unicode.org/charts/PDF/U____.pdf ____-____].
 
-
 
-
*[[Image:redRx.gif]]  [http://tlt.its.psu.edu/suggestions/international/bylanguage/Breton.html Penn State info page]; [http://tlt.psu.edu/suggestions/international/bylanguage/Bretonchart.html Penn State chart of Unicode Entity Codes for ____] (including OS X and Windows keyboard entry)
 
-
*[[Image:redRx.gif]]  <font color=red mam>[http://www.exnet.btinternet.co.uk/ Exnet], Andy White. "This site hosts documents relating to the encoding of Indic scripts. Most documents contain a bias towards the Bengali script (due to my own preferances)." (Last updated 10th March 2003)</font mam>
 
-
 
-
====ISCII====
 
-
See [[Lwiki:ISCII]]. <font color=red>FOR SOUTH ASIAN LANGUAGES ONLY</font>
 
-
 
-
===Fonts===
 
-
<font color=red mam>
 
-
'''Lists of Unicode fonts'''
 
-
*[[Image:redRx.gif]]  [http://www.alanwood.net/unicode Unicode and Multilingual Support in HTML, Fonts, Web Browsers and Other Applications]. In Alan Wood’s Unicode Resources.
 
-
** <nowiki>http://www.alanwood.net/unicode/fonts_windows#Breton.html</nowiki>
 
-
**  ?? Search for ''Breton''.??
 
-
*[[Image:redRx.gif]]  [http://www.wazu.jp/ Wazu Japan's Gallery of Unicode fonts], and ??[http://www.wazu.jp/gallery/Test_Breton.html test page]??
 
-
** [http://www.wazu.jp/#test_pages Unicode test pages]
 
-
 
-
 
-
*[[Image:redRx.gif]]  The South Asia Language Resource Center of the University of Chicago has links to
 
-
** [http://salrc.uchicago.edu/resources/fonts/available/___/#fonts Breton fonts], most of them available for free download
 
-
** [http://salrc.uchicago.edu/resources/fonts/available/___/#layouts Input Schemes and Keyboard Layouts]
 
-
** [http://salrc.uchicago.edu/resources/fonts/available/___/#mac information about Mac vs. PC vs. Linux rendering issues]
 
-
</font mam>
 
-
 
-
 
-
 
-
====Conversion====
 
-
*[[Image:redRx.gif]]  [http://www.lancs.ac.uk/staff/hardiea/unicodify.htm  Unicodify]: From Lancaster University, producers of the Emille corpus. For Windows; source code available.
 
-
**<font color=red mam>MOSTLY FOR SOUTH ASIAN LANGUAGES-- CHECK BEFORE USING</font> "a suite of programs for converting text in a variety of 8-bit encodings to Unicode (using the UTF-16 encoding).<br> Unicodify was particularly designed to handle HTML-based text using non-ISCII 8-bit fonts to render South Asian scripts. However, elements of the suite can map other types of non-ASCII 8-bit encodings, such as Latin-2, ISCII and PASCII."
 
-
 
-
====Transliteration====
 
==Data Sources==
==Data Sources==

Revision as of 22:11, 16 April 2010

Under construction Home > Breton

BREZHONEG


BRETON



Contents

General

Language summary

2010-04-7

  • ISO 639-3 code: bre
  • Population:
    • 500,000 in France (1989 International Committee for the Defense of the Breton Language).
    • 1,200,000 know Breton who do not regularly use it.
    • Population total all countries: 500,045.
    • 188,568 personnes (Ya d'ar brezhoneg)
  • Also spoken in: -
  • Alternate names: -
  • Dialects: -
  • Classification: Indo-European, Celtic, Insular, Brythonic

Linguistic notes

Writing

  • Image:redRx.gif Omniglot. Search www.omniglot.com, not Web.

Linguistic resources

Overview

Grammar

Lexicon

[Dictionaries, word and phrase lists, and translation tools]


Topical word lists

Names
  • Image:redRx.gif Babynology: List of Breton baby names in Roman transliteration

Monographs

Linguistic portals and bibliographies

  • Image:redRx.gif LINGUIST List resource pages
    • (GET LINGUIST PAGE FOR Breton
  • SIL Bibliography


Data Sources

Monolingual Text

  • Image:redRx.gif EMILLE ONLY FOR: Bengali, Panjabi, Tamil, and Urdu EMILLE corpus. Approximately NUMBERS HERE words. Free license for non-profit research use. Documentation

News

  • Newspaper portals:
  • newspaper...

Blogs

Parallel Text

  • Image:redRx.gif EMILLE corpus. ONLY FOR: Bengali, Panjabi, Tamil, and Urdu 200,000 words of text in English (information leaflets from the UK Government and various local authorities) with Breton translation. Free license for non-profit research use.
  • Image:redRx.gif MultiKulti Langs listed: Albanian, Arabic, Bengali, Chinese, English, Farsi, French, Gujarati, Somali, Spanish, Portuguese, Turkish, Urdu.
    PROB. SAME AS EMILLE BUT NOT ALL UNICODE. DON'T USE FOR EMILLE LANGUAGES (Bengali, Panjabi, Tamil, and Urdu).
     : 200k words from UK government leaflets (not news). Free for research, see license. However, some of the files are in PDF and present encoding problems when the text is copied.

Speech

Video

IPR notes

Portals

Tools and Other NLP Resources

Miscellaneous

Personal tools