Panjabi/Panjabi
From the LDC Language Resource Wiki
m (→Monolingual Text) |
m |
||
Line 54: | Line 54: | ||
* [http://dsal.uchicago.edu/dictionaries/singh/ Singh, Maya]. ''The Panjabi dictionary.'' English <-> Panjabi. Lahore, Munshi Gulab Singh & Sons, 1895. Via [http://dsal.uchicago.edu/dictionaries/ Digital Dictionaries of South Asia]. English, Gurmukhi, or romanized input.Unicode. | * [http://dsal.uchicago.edu/dictionaries/singh/ Singh, Maya]. ''The Panjabi dictionary.'' English <-> Panjabi. Lahore, Munshi Gulab Singh & Sons, 1895. Via [http://dsal.uchicago.edu/dictionaries/ Digital Dictionaries of South Asia]. English, Gurmukhi, or romanized input.Unicode. | ||
* [http://www.srigranth.org/servlet/gurbani.gurbani?Action=Dictionary Srigranth.org] English <-> Panjabi. On-line; medium size? [[#Gurbani|Gurbani]] encoding; clickable "keyboard" for Gurmukhi input. | * [http://www.srigranth.org/servlet/gurbani.gurbani?Action=Dictionary Srigranth.org] English <-> Panjabi. On-line; medium size? [[#Gurbani|Gurbani]] encoding; clickable "keyboard" for Gurmukhi input. | ||
- | * [http://pa.wiktionary.org | + | * {{CC-BY-SA}},{{GFDL}} [http://pa.wiktionary.org Wiktionary (Eastern Panjabi)]. Gurmukhi script. Unicode. Monolingual. 123 entries. {{si|[[User:Mamandel|Mamandel]] 16:29, 3 May 2010 (UTC)}} |
====Names==== | ====Names==== | ||
Line 115: | Line 115: | ||
===Monolingual Text=== | ===Monolingual Text=== | ||
* [http://www.ling.lancs.ac.uk/corplang/emille/ EMILLE] corpus. Free license for non-profit research use. Approximately 15,600,000 words. [http://www.emille.lancs.ac.uk/manual.pdf Documentation] | * [http://www.ling.lancs.ac.uk/corplang/emille/ EMILLE] corpus. Free license for non-profit research use. Approximately 15,600,000 words. [http://www.emille.lancs.ac.uk/manual.pdf Documentation] | ||
- | * [http://pa.wikipedia.org Wikipedia (Eastern Panjabi)]. Gurmukhi script. Unicode. {{si|[[User:Mamandel|Mamandel]] 16: | + | * {{CC-BY-SA}},{{GFDL}} [http://pa.wikipedia.org Wikipedia (Eastern Panjabi)]. Gurmukhi script. Unicode. 1,636 entries. {{si|[[User:Mamandel|Mamandel]] 16:29, 3 May 2010 (UTC)}} |
- | * [http://pnb.wikipedia.org Wikipedia (Western Panjabi)]. Perso-Arabic script | + | * {{CC-BY-SA}},{{GFDL}} [http://pnb.wikipedia.org Wikipedia (Western Panjabi)]. Perso-Arabic script. Unicode. 4,215 articles. {{si|[[User:Mamandel|Mamandel]] 16:29, 3 May 2010 (UTC)}} |
====News==== | ====News==== |
Revision as of 16:29, 3 May 2010
PANJABI
(Eastern Panjabi, Gurmukhi)
Contents |
General
This document pertains primarily to Eastern Panjabi (Gurmukhi). There is some material on Western Panjabi as well.
Dialects
Eastern Panjabi
(Information from Ethnologue, 2009-05-13)
- ISO 639-3 code: pan
- Spoken in: India: Punjab, Majhi in Gurdaspur and Amritsar districts, Bhatyiana in South Firozpur District; Rajasthan, Bhatyiana in north Ganganagar District; Haryana; Delhi; Jammu and Kashmir. Also spoken in Bangladesh and diaspora.
- Population: 27,109,000 in India
- Alternate names: Punjabi, Gurmukhi, Gurumukhi
- Dialects: Panjabi Proper, Majhi, Doab, Bhatyiana (Bhatneri, Bhatti), Powadhi, Malwa, Bathi. Western Panjabi is distinct from Eastern Panjabi, although there is a chain of dialects to Western Hindi (Urdu).
- Classification: Indo-European, Indo-Iranian, Indo-Aryan, Central zone, Panjabi
- Script: Gur(u)mukhi and Devanagari
Western Panjabi
Information from Ethnologue, 2009-05-13
- ISO 639-3 code: pnb
- Spoken in: Mainly in the Punjab area of Pakistan.
- Population: 60,647,207 in Pakistan (2000 WCD).
- Alternate names: Western Punjabi, Lahnda, Lahanda, Lahndi
- Dialects: There is a continuum of varieties between Eastern and Western Panjabi, and with Western Hindi and Urdu. 'Lahnda' is a name given earlier for Western Panjabi; an attempt to cover the dialect continuum between Hindko, Pahari-Potwari, and Western Panjabi in the north and Sindhi in the south.
- Classification: Indo-European, Indo-Iranian, Indo-Aryan, Northwestern zone, Lahnda
- Script: Perso-Arabic
Linguistic notes
Writing
Eastern Panjabi is usually written with the Brahmi-derived Gurmukhi script, and sometimes, especially by Hindus, with Devanagari. Western Panjabi is usually written in Shahmukhi, a variant of the Arabic writing system very similar to the writing system of Urdu.
Linguistic resources
Grammars
Lexicons
- Punjabonline English <-> Punjabi Dictionary. On-line; size unknown. English or Gurmukhi input. Gurbani encoding.
- Singh, Maya. The Panjabi dictionary. English <-> Panjabi. Lahore, Munshi Gulab Singh & Sons, 1895. Via Digital Dictionaries of South Asia. English, Gurmukhi, or romanized input.Unicode.
- Srigranth.org English <-> Panjabi. On-line; medium size? Gurbani encoding; clickable "keyboard" for Gurmukhi input.
- (CC-BY-SA),(GFDL) Wiktionary (Eastern Panjabi). Gurmukhi script. Unicode. Monolingual. 123 entries. [Mamandel 16:29, 3 May 2010 (UTC)]
Names
These sites do not distinguish names by sex.
- 5abi: Gurbani encoding.
- Babynology: List of Panjabi baby names in Roman transliteration. (Each name appears twice, once for each sex, since the site's general format is to list male and female names separately.)
- Sikh Names. Transliteration, with meanings.
- Sushmajee: About 1000 names. Transliteration.
Linguistic portals and bibliographies
- Bashir, Elena. Resources for the Study of Panjabi. SALRC. [Last modified Dec., 2006; most recent date in content is 2000]
- Indian Language Data Centre (ILDC), part of the Technology Development for Indian Languages project of the Indian Department of Information Technology.
- SIL Bibliography
- South Asia Language Resource Center at the University of Chicago (SALRC)
Encoding and Fonts
Before the development and general use of Unicode, computer use of Panjabi and other South Asian languages required special fonts using only one byte. Many of these fonts were specific to one website or another and used idiosyncratic encodings. To some extent that is still the case; and so this page includes some such sites (see News), and some resources for specific fonts and encoding converters.
Encodings
Unicode
The Unicode range for Gurmukhi is 0A00-0A7F.
- Panjabi.html Penn State info page; Penn State chart of Unicode Entity Codes for the Gurmukhi (Punjabi) Script (including OS X and Windows keyboard entry)
- Exnet, Andy White. "This site hosts documents relating to the encoding of Indic scripts. Most documents contain a bias towards the Bengali script (due to my own preferances)." (Last updated 10th March 2003)
ISCII
The Bureau of Indian Standards supports its own encoding standard. See ISCII.
Gurbani
An 8-bit encoding used by a number of sites. Most of the script is in the lower half. Fonts available from SikhNet below.
Satluj
An 8-bit encoding, with the script in the upper half; x20-x7f is used for punctuation, special characters, digits (both Western and Panjabi), and so on. Used by Daily Ajit, Awandhar. Font available from Sikh Students Federation below.
Fonts
- Alan Wood’s Unicode Resources: Unicode fonts for Windows computers; search for Gurmukhi.
- ILDC. PNOT-Amar Normal, a Unicode font.
- SALRC:
- Gurmukhi fonts, most of them available for free download
- Input Schemes and Keyboard Layouts
- information about Mac vs. PC vs. Linux rendering issues
- SikhNet. A wide variety of styles in the 8-bit Gurbani encoding, many credited to Kulbir S. Thind, M.D.
- Sikh Students Federation offers free download of the Satluj font.
- Wazu Japan's Gallery of Unicode Gurmukhi fonts, and test page
Conversion
- GUCA: Gurmukhi Unicode Conversion Application. GNU GPL. Requires Microsoft .NET Framework. Converts ASCII encoded, font-based Gurmukhi text based on Dr. Thind's fonts (e.g. AnmolLipi, GurbaniLipi fonts) into Unicode. Also includes a custom mapping engine to add encodings. -- Although the site for "Dr. Thind's fonts" now uses Unicode, many other sites still use these 8-bit encodings. See SikhNet, above.
- Unicodify: From Lancaster University, producers of the Emille corpus. For Windows; source code available.
Transliteration
- Indian Language Converter. Type in Roman characters according to the Gurmukhi character chart on the page and get Gurmukhi text and HTML. On-web or download with GNU GPL. E.g.:
- Roman input: guramukhee
Gurmukhi output: ਗੁਰਮੁਖੀ
HTML output: ਗੁਰਮੁਖੀ<br/>
- Roman input: guramukhee
Data Sources
Monolingual Text
- EMILLE corpus. Free license for non-profit research use. Approximately 15,600,000 words. Documentation
- (CC-BY-SA),(GFDL) Wikipedia (Eastern Panjabi). Gurmukhi script. Unicode. 1,636 entries. [Mamandel 16:29, 3 May 2010 (UTC)]
- (CC-BY-SA),(GFDL) Wikipedia (Western Panjabi). Perso-Arabic script. Unicode. 4,215 articles. [Mamandel 16:29, 3 May 2010 (UTC)]
News
- 5abi Partly Unicode, partly Gurbani.
- Ajit Weekly. Unicode.
- Daily Ajit, Awandhar. Website uses Satluj encoding; online version of newspaper has only images of text.
- Quami Ekta. Unicode.
- Sanjh Savera. Panjabi newspaper from Canada. Gurbani encoding.
- Newspaper portals:
- India Press: Punjabi Newspapers
Other
- Academy of the Punjab in North America: English, Gurmukhi, and Shahmukhi, but many of the texts are imaged.
- Eh Din. Sikh website, has archives.
- Rationalist Society of India Gurbani encoding.
Parallel Text
Civic information and advice
- Choose and Book: Introduction. "A [UK] national electronic referral service which gives patients a choice of place, date and time for their first outpatient appointment in a hospital or clinic." PDF, Gurbani encoding.
- Domestic Violence Information. Leaflets from various agencies in many languages. Unicode.
- Eastbourne Borough Council. Links to a lot of gov and org sites in the UK with pages in many languages.
- EMILLE corpus. 200,000 words of text in English (information leaflets from the UK Government and various local authorities) with Eastern Panjabi translation. Free license for non-profit research use.
- Health and Safety Executive (UK). About a dozen government leaflets. PDF, Gurbani.
- Law Society of England and Wales. Thirteen guides to common legal problems, parallel in English and about 16 other languages. (Two more guides promise "other languages available soon".) [2009-06-23]
- Victim Support (UK). Seven leaflets. PDF, Gurbani encoding.
IT
- GUCA. Panjabi computing resource website that is parallel English and Panjabi
- Red Hat Enterprise Linux has a lot of documentation in parallel text, including
- Reference Guide
- Release Notes (3.7.0 and up are bilingual)
- RHEL-4 Manuals
Religious: Christian
- Bible. Amazon.com links for print editions.
- Religious Passages from Cloverdale Bibleway Church: Nine sermons of William Marrion Branham. Apparently a non-Gurbani Latin encoding. PDF 350-645 kB (mean 431), est. 150k words. Paginated printing and binding, 2-up and 2-sided [pp. 0+1,2+23; 22+3,4+21; 20+5,6+19...]. Panjabi, English
Religious: Sikh
- Guru Granth Sahib. Sikh holy texts, word lists, concordances, interlinear translations. Panjabi (Gurmukhi, Shahmukhi, Devanagari) and English. Some files Unicode, but some Gurbani encoding.
- Punjabi Online: Mool Mantar. Religious text. Interlinear, with Panjabi in Gurbani encoding, grouped as (Panjabi1, transliteration1, Panjabi2, English). Panjabi2, may be commentary on Panjabi1, and only the commentary translated.
- Sikhnet: GuruGranthSahib. Religious text. Gurbani encoding.
- Sridasam. Religious text. 2326 pages. Unicode. Parallel text verse by verse: Panjabi and Hindi apparently complete, English translation only through p. 1466. [2009-07-23]
Video
- Alpha ETC Punjabi: Programmes, Schedule [2009-07-23]
- APNA Channel "is a satellite channel broadcasting from Thailand, and is envisaged as a news channel telecasting in Punjabi language, internationally footage to be in 127 countries." (Website appears to be mostly in English. News is all English text feeds. Video programming is apparently Panjabi with Shahmukhi titles.) [2009-07-23]
- Awaaz-E-Watan ("voice of the homeland"). 1 hour per week. Cable channels in Fresno, CA and vicinity. [2009-07-23]
- Channel Punjabi Television (Canada). "music, concerts, television shows and ringtones". [2009-07-23]
- DD-Punjabi Channel, Broadcasting Corporation of India. Schedule.
- Jus Punjabi: "1st American Punjabi Channel"
- MH1: music. "also contemplating to start a Hindi music and entertainment channel. ... also plans to introduce a medical channel [that] would primarily cater to Punjab. (19/01/2009)" [2009-07-23]
- Punjab Today: "the leading 24-hour Punjabi news channel... It offers varied programming content such as Bollywood gossip, politics, and heritage and culture." [2009-07-23]
- RAVi TV: US, "programmed by and for Punjabi natives residing in the United States". "Live streaming is temporarily disabled due to signal piracy by unauthorized websites. We will resume Ravi feed in few days after implementing new security system for the website." [2009-07-6; same message 2009-07-23]
Portals
- Academy of the Punjab in North America: Literature, forums, other. English, Gurmukhi, and Shahmukhi, but many of the texts are imaged.
- NRI Zone (Non-Resident Indian). Does not seem to have been updated since Jan 6 2008.
- Punjabi Network: Forums, blogs, videos? Panjabi, romanized Panjabi, and English. Signup required for some features.
- Yahoo! India. Unicode.