Pashto/Pashto
From the LDC Language Resource Wiki
m (→Writing) |
(→Literature: deleted * http://www.pashtopoetry.net/ Pashto Poetry: site squatted) |
||
(21 intermediate revisions not shown) | |||
Line 13: | Line 13: | ||
''Pashto'' belongs to the Iranian subfamily of Indo-European. Its vocabulary includes many loans, chiefly from its Persian and Indo-Aryan neighbors, and from Arabic, especially via Islam. | ''Pashto'' belongs to the Iranian subfamily of Indo-European. Its vocabulary includes many loans, chiefly from its Persian and Indo-Aryan neighbors, and from Arabic, especially via Islam. | ||
- | === | + | ===Dialects=== |
ISO 639-3 treats Pashto as a | ISO 639-3 treats Pashto as a | ||
Line 73: | Line 73: | ||
Like other scripts of Semitic origin, in most contexts Pashto indicates consonants only, except in special-purpose texts such as educational materials. As a result, the script is phonologically underspecified, and it is not in general possible to infer pronunciation from spelling. | Like other scripts of Semitic origin, in most contexts Pashto indicates consonants only, except in special-purpose texts such as educational materials. As a result, the script is phonologically underspecified, and it is not in general possible to infer pronunciation from spelling. | ||
+ | |||
+ | See [[#Identifying_Pashto|below]] for a list of Unicode Perso-Arabic characters that are probably unique to Pashto. | ||
* [http://www.omniglot.com/writing/pashto.htm Omniglot] | * [http://www.omniglot.com/writing/pashto.htm Omniglot] | ||
* [http://www.afghanan.net/pashto/pashto%20alifba.pdf Pashto Alphabets ''<nowiki>[i.e., letters]</nowiki>'' in Detail]. Perso-Arabic letters in all four positional forms, with Unicode name and code point, Pashto name with Roman transliteration, and languages using (Arabic, Pashto, Farsi). | * [http://www.afghanan.net/pashto/pashto%20alifba.pdf Pashto Alphabets ''<nowiki>[i.e., letters]</nowiki>'' in Detail]. Perso-Arabic letters in all four positional forms, with Unicode name and code point, Pashto name with Roman transliteration, and languages using (Arabic, Pashto, Farsi). | ||
*:Note: Not complete with respect to Unicode 5.1: lists only code points named "ARABIC LETTER ...", and not all of those. Has at least one typo ("U+0623 Arabic Letter Zain" [should be U+06<u>32</u>]). | *:Note: Not complete with respect to Unicode 5.1: lists only code points named "ARABIC LETTER ...", and not all of those. Has at least one typo ("U+0623 Arabic Letter Zain" [should be U+06<u>32</u>]). | ||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
- | |||
==Linguistic resources== | ==Linguistic resources== | ||
===Overview=== | ===Overview=== | ||
- | * MacKenzie, D.N. 1987. Pashto. [Transcription only.] In ''The World's Major Languages'', ed. Bernard Comrie, 1990, Oxford University Press; Chapter 26, pages 547-565. | + | * MacKenzie, D.N. 1987. Pashto. [Transcription only.] In ''The World's Major Languages'', ed. Bernard Comrie, 1990, Oxford University Press; Chapter 26, pages 547-565. ISBN 978-0195065114. |
* [http://www.lmp.ucla.edu/Profile.aspx?LangID=64&menu=004 UCLA Language Materials Project]. <small>''[Accessed 2009-08-6]''</small> | * [http://www.lmp.ucla.edu/Profile.aspx?LangID=64&menu=004 UCLA Language Materials Project]. <small>''[Accessed 2009-08-6]''</small> | ||
Line 129: | Line 105: | ||
* Roos-Keppel, George Olof, and Qazi Abdul Ghani Khan. 1901. ''A manual of Pushtu''. London: Sampson Low, Marston. | * Roos-Keppel, George Olof, and Qazi Abdul Ghani Khan. 1901. ''A manual of Pushtu''. London: Sampson Low, Marston. | ||
* Shafeev, D.A. 1964. ''A Short Grammatical Outline of Pashto''. Translated and edited by Herbert H. Paper. ''[Afghanistan. Transcription only.]'' Bloomington: Indiana University; The Hague: Mouton. | * Shafeev, D.A. 1964. ''A Short Grammatical Outline of Pashto''. Translated and edited by Herbert H. Paper. ''[Afghanistan. Transcription only.]'' Bloomington: Indiana University; The Hague: Mouton. | ||
- | * Tegey, Habibullah, and Robson, Barbara. 1996. ''A Reference Grammar of Pashto.'' [Afghanistan] Washington, DC: Center for Applied Linguistics. [http://www.eric.ed.gov/ERICWebPortal/Home.portal?_nfpb=true&ERICExtSearch_SearchValue_0=ED399825&searchtype=basic&ERICExtSearch_SearchType_0=kw&_pageLabel=RecordDetails&objectId=0900019b800ac2b6&accno=ED399825&_nfls=false ERIC #ED399825]. | + | * [http://www.eric.ed.gov/ERICWebPortal/contentdelivery/servlet/ERICServlet?accno=ED399825 Tegey, Habibullah, and Robson, Barbara. 1996.] ''A Reference Grammar of Pashto.'' [Afghanistan] Washington, DC: Center for Applied Linguistics. [http://www.eric.ed.gov/ERICWebPortal/Home.portal?_nfpb=true&ERICExtSearch_SearchValue_0=ED399825&searchtype=basic&ERICExtSearch_SearchType_0=kw&_pageLabel=RecordDetails&objectId=0900019b800ac2b6&accno=ED399825&_nfls=false ERIC #ED399825]. Developed with funding from Grant No. P017A50047-95 from the International Research and Studies Program of the US Department Of Education. |
===Lexicon=== | ===Lexicon=== | ||
* Morgenstierne, Georg. 2003. ''A new etymological vocabulary of Pashto''; compiled and edited by J. Elfenbein, D.N. MacKenzie and Nicholas Sims-Williams. Transliteration only. Wiesbaden: Reichert. | * Morgenstierne, Georg. 2003. ''A new etymological vocabulary of Pashto''; compiled and edited by J. Elfenbein, D.N. MacKenzie and Nicholas Sims-Williams. Transliteration only. Wiesbaden: Reichert. | ||
- | + | * [http://www.qamosona.com/download.php Qamosona English-Pashto Dictionary]. Ver. 1.0. 2005. ''[Afghanistan]''. {{hq|Based on the English to Pashto Dictionary, by Pashto Academy, Kabul, Afghanistan.}} About 22,000 words. Free download. Requirements: Windows XP, 2000, or Windows ME Arabic edition. | |
- | * [http://www.qamosona.com/download.php Qamosona English-Pashto Dictionary]. Ver. 1.0. 2005. ''[Afghanistan]''. | + | * [http://www.yorku.ca/twainweb/troberts/pashto/pashlex1.html Penzl, Herbert]. ''[Afghanistan]'' Online Version 1.0, released November 1998. {{hq|This dictionary contains all of the words from the glossary of Herbert Penzl's ''A grammar of Pashto: A descriptive study of the dialect of Kandahar, Afghanistan'' (Washington, DC: American Council of Learned Societies, 1955), pp. 154-165, which is available from [http://www.schoenhofs.com/ Schoenhof's Foreign Books]." Transliteration only. "An on-line key to the orthography is not yet available. In the meantime, please download the [http://www.yorku.ca/twainweb/troberts/pashto/lexicon.html Access database version] of this file, or consult Penzl (1955).}} |
- | * [http://www.yorku.ca/twainweb/troberts/pashto/pashlex1.html Penzl, Herbert]. ''[Afghanistan]'' Online Version 1.0, released November 1998. | + | |
* [http://www.eric.ed.gov/ERICWebPortal/contentdelivery/servlet/ERICServlet?accno=ED364083 Tegey, Habibullah, and Robson, Barbara]. 1993. Pashto-English Glossary for the CAL Pashto Materials. ''[Afghanistan]'' Washington, DC: Center for Applied Linguistics. Contract P017A90055. [http://www.eric.ed.gov/ERICWebPortal/Home.portal?_nfpb=true&ERICExtSearch_SearchValue_0=ED364083&searchtype=basic&ERICExtSearch_SearchType_0=kw&_pageLabel=RecordDetails&objectId=0900019b8009c82d&accno=ED364083&_nfls=false ERIC #ED364083]. Pashto script and transliteration. PDF, imaged text. <small>''[Note on pagination: Original page 87 (PDF p.96) is followed by original pp.97, 88-96, 98, and the rest in sequence.]''</small> | * [http://www.eric.ed.gov/ERICWebPortal/contentdelivery/servlet/ERICServlet?accno=ED364083 Tegey, Habibullah, and Robson, Barbara]. 1993. Pashto-English Glossary for the CAL Pashto Materials. ''[Afghanistan]'' Washington, DC: Center for Applied Linguistics. Contract P017A90055. [http://www.eric.ed.gov/ERICWebPortal/Home.portal?_nfpb=true&ERICExtSearch_SearchValue_0=ED364083&searchtype=basic&ERICExtSearch_SearchType_0=kw&_pageLabel=RecordDetails&objectId=0900019b8009c82d&accno=ED364083&_nfls=false ERIC #ED364083]. Pashto script and transliteration. PDF, imaged text. <small>''[Note on pagination: Original page 87 (PDF p.96) is followed by original pp.97, 88-96, 98, and the rest in sequence.]''</small> | ||
+ | * [http://ps.wiktionary.org Wiktionary]. Unicode. Monolingual. 613 entries {{CC-BY-SA}},{{GFDL}} {{si|[[User:Mamandel|Mamandel]] 16:31, 3 May 2010 (UTC)}} | ||
====Topical word lists==== | ====Topical word lists==== | ||
Line 164: | Line 140: | ||
===News=== | ===News=== | ||
- | |||
* [http://pa.azadiradio.org/ Azadi Radio]. Pashto service of Radio Free Europe / Radio Liberty | * [http://pa.azadiradio.org/ Azadi Radio]. Pashto service of Radio Free Europe / Radio Liberty | ||
* [http://www.bbc.co.uk/pashto/index.shtml BBC]. | * [http://www.bbc.co.uk/pashto/index.shtml BBC]. | ||
Line 180: | Line 155: | ||
* [http://www.pakhton.8k.com/ D'Zra Dardoona]. "Afflictions of the Heart: Poetic collections of Aminullah Zmaryalai". (imaged book pages) | * [http://www.pakhton.8k.com/ D'Zra Dardoona]. "Afflictions of the Heart: Poetic collections of Aminullah Zmaryalai". (imaged book pages) | ||
* [http://www.khyber.org/pashtolanguage.shtml Khyber.org]. Proverbs, short stories (some in English), jokes, poetry. | * [http://www.khyber.org/pashtolanguage.shtml Khyber.org]. Proverbs, short stories (some in English), jokes, poetry. | ||
- | |||
* [http://www.afghanan.net/pashto/landay/ Landay]. Traditional couplets, with English translation. (imaged text) (The "table of contents" lists the categories in Pashto, transcription, and English translation, but the links to the text pages are in the corresponding frame on the right, labeled only in Pashto.) | * [http://www.afghanan.net/pashto/landay/ Landay]. Traditional couplets, with English translation. (imaged text) (The "table of contents" lists the categories in Pashto, transcription, and English translation, but the links to the text pages are in the corresponding frame on the right, labeled only in Pashto.) | ||
- | |||
===Miscellaneous=== | ===Miscellaneous=== | ||
Line 188: | Line 161: | ||
* [http://www.afghanpoet.4t.com/ Masood Peganai] - Salzburg (Austria) based Afghan site. Mostly imaged pages. | * [http://www.afghanpoet.4t.com/ Masood Peganai] - Salzburg (Austria) based Afghan site. Mostly imaged pages. | ||
* [http://www.matalona.tk/ PashtonMatalona]. | * [http://www.matalona.tk/ PashtonMatalona]. | ||
- | + | * [http://ps.wikipedia.org Wikipedia]. Unicode. 1,617 articles {{CC-BY-SA}},{{GFDL}} {{si|[[User:Mamandel|Mamandel]] 16:31, 3 May 2010 (UTC)}} | |
===Speech=== | ===Speech=== | ||
Line 196: | Line 169: | ||
==Portals== | ==Portals== | ||
- | * [http://www.afghan-network.net/Ethnic-Groups/pashtu-history.html Afghan Network]. <br>'''Malware warnings''': See ''Afghan News Channel'' above under [[#News|News]]. | + | * [http://www.afghan-network.net/Ethnic-Groups/pashtu-history.html Afghan Network]. <!-- <br>'''Malware warnings''': See ''Afghan News Channel'' above under [[#News|News]]. -- Malware warning removed. ~~~~ Google Safe Browsing says: "This site is not currently listed as suspicious. Over the past 90 days, afghan-network.net did not appear to function as an intermediary for the infection of any sites... or host[ed] malicious software."--> {{si|Malware warning removed. [[User:Mamandel|Mamandel]] 2010-07-19}} |
* [http://afghanan.net Afghanan.net]: Mostly in English. | * [http://afghanan.net Afghanan.net]: Mostly in English. | ||
* [http://afg-info.com Afghanistan Information Centre]: Many links, organized as General, Government & Political Groups, Organizations, Reconstruction and Development, Tourism, and so on. Some are labeled with the languages they are in, Pashto among them. | * [http://afg-info.com Afghanistan Information Centre]: Many links, organized as General, Government & Political Groups, Organizations, Reconstruction and Development, Tourism, and so on. Some are labeled with the languages they are in, Pashto among them. | ||
Line 203: | Line 176: | ||
* [http://www.loc.gov/rr/international/amed/afghanistan/afghanistan.html Portals to the World: Afghanistan] (Library of Congress): <small>''[Accessed 2009-08-10 : Last update January 9, 2006]''</small> | * [http://www.loc.gov/rr/international/amed/afghanistan/afghanistan.html Portals to the World: Afghanistan] (Library of Congress): <small>''[Accessed 2009-08-10 : Last update January 9, 2006]''</small> | ||
* [http://www.lmp.ucla.edu/Profile.aspx?menu=004&links=64 UCLA Language Materials Project] Pashto: Language & Culture | * [http://www.lmp.ucla.edu/Profile.aspx?menu=004&links=64 UCLA Language Materials Project] Pashto: Language & Culture | ||
+ | |||
+ | ==Tools and Other NLP Resources== | ||
+ | ===Identifying Pashto=== | ||
+ | The following Perso-Arabic characters are probably unique to Pashto: | ||
+ | |||
+ | {| cellspacing="4" cellpadding="2" border="1" | ||
+ | !align="left" width="10"| Code | ||
+ | !align="center" | Glyph | ||
+ | !align="left" | Unicode Name | ||
+ | |- | ||
+ | | U+0659 || align="center" | ٙ || ARABIC ZWARAKAY | ||
+ | |- | ||
+ | | U+067C || align="center" | ټ || ARABIC LETTER TEH WITH RING | ||
+ | |- | ||
+ | | U+0685 || align="center" | څ || ARABIC LETTER HAH WITH THREE DOTS ABOVE | ||
+ | |- | ||
+ | | U+0689 || align="center" | ډ || ARABIC LETTER DAL WITH RING | ||
+ | |- | ||
+ | | U+0693 || align="center" | ړ || ARABIC LETTER REH WITH RING | ||
+ | |- | ||
+ | | U+0696 || align="center" | ږ || ARABIC LETTER REH WITH DOT BELOW AND DOT ABOVE | ||
+ | |- | ||
+ | | U+069A || align="center" | ښ || ARABIC LETTER SEEN WITH DOT BELOW AND DOT ABOVE | ||
+ | |- | ||
+ | | U+06AB || align="center" | ګ || ARABIC LETTER KAF WITH RING | ||
+ | |- | ||
+ | | U+06BC || align="center" | ڼ || ARABIC LETTER NOON WITH RING | ||
+ | |} | ||
+ | |||
[[Category:Pashto|Pashto]] | [[Category:Pashto|Pashto]] |
Latest revision as of 17:21, 28 June 2011
پښتو
Contents |
General
Pashto belongs to the Iranian subfamily of Indo-European. Its vocabulary includes many loans, chiefly from its Persian and Indo-Aryan neighbors, and from Arabic, especially via Islam.
Dialects
ISO 639-3 treats Pashto as a macrolanguage with three varieties (Central, Northern, and Southern) (below). Ethnologue treats Waneci as a fourth, and others (e.g., MacKenzie and UCLA) analyze the dialectology still differently.
Language summary
(Information based on Ethnologue and ISO 639, 2009-08-06)
- ISO 639-3 code: pus (macrolanguage)
- Population: 20,304,734
- Alternate names: Pushto
- Dialects: Central Pashto [pst], Northern Pashto [pbu], Southern Pashto [pbt]
- Classification: Indo-European, Indo-Iranian, Iranian, Eastern, Southeastern
Central Pashto
Information based on Ethnologue, 2009-08-06
- ISO 639-3 code: pst
- Spoken in: Southern Pakistan (Wazirstan, Bannu, Karak, southern ethnic group territories and adjacent areas)
- Population: 7,920,000.
- Alternate names: Mahsudi
- Dialects: Waciri (Waziri), Bannuchi (Bannochi, Bannu).
- Script: Arabic.
Northern Pashto
Information based on Ethnologue, 2009-08-06
- ISO 639-3 code: pbu
- Spoken in: Pakistan (Afghanistan border, most of NWFP, Yusufzai, and Peshawar), Afghanistan (Central Ghilzai area), United Arab Emirates
- Population: 9,720,700
- Alternate names:
- Pakistan: Pakhto, Pashtu, Pushto, Yusufzai Pashto
- Afghanistan: Afghan, Pakhtoo, Pakhtu, Paktu. Called ‘Pakhtoon’ in the north, ‘Pashtoon’ in the south.
- United Arab Emirates: Pakhtoo, Pashtu, Passtoo, Pushto, Pusto
- Dialects:
- Pakistan: Ningraharian Pashto, Northeastern Pashto.
- Afghanistan: Northwestern Pakhto, Ghilzai, Durani.
- Script: Arabic.
Southern Pashto
Information based on Ethnologue, 2009-08-06
- ISO 639-3 code: pbt
- Spoken in: Pakistan (Balochistan, Quetta area), Afghanistan (Kandahar area), Iran (Khorasan on Afghanistan border east of Qa’en), Tajikistan, United Arab Emirates
- Population: 2,680,100.
- Alternate names:
- Pakistan: Pashtu, Pushto, Pushtu, Quetta-Kandahar Pashto
- Iran: Afghani, Paktu, Pashtu
- UAE: Afghan, Pakhtoo, Pakhtu, Paktu
- Dialects:
- Pakistan: Southeastern Pashto, Quetta Pashto
- Afghanistan: Southwestern Pashto, Kandahar Pashto (Qandahar Pashto)
- Script: Arabic.
Linguistic notes
Writing
Pashto is written with a Perso-Arabic script, adapted from Persian script, which in turn was adapted from Arabic. There is a classical standard, but there have been divergences, in different directions, in Pakistan and Afghanistan. Pakistan has instituted a number of orthographic innovations since officializing Pashto, while Pakistani writing shows occasional influence from Urdu, as well as sometimes representing the "hard" dialect forms phonetically instead of phonologically. In addition, the educational level and varying dialect background of writers inevitably introduces further variation in texts.
Like other scripts of Semitic origin, in most contexts Pashto indicates consonants only, except in special-purpose texts such as educational materials. As a result, the script is phonologically underspecified, and it is not in general possible to infer pronunciation from spelling.
See below for a list of Unicode Perso-Arabic characters that are probably unique to Pashto.
- Omniglot
- Pashto Alphabets [i.e., letters] in Detail. Perso-Arabic letters in all four positional forms, with Unicode name and code point, Pashto name with Roman transliteration, and languages using (Arabic, Pashto, Farsi).
- Note: Not complete with respect to Unicode 5.1: lists only code points named "ARABIC LETTER ...", and not all of those. Has at least one typo ("U+0623 Arabic Letter Zain" [should be U+0632]).
Linguistic resources
Overview
- MacKenzie, D.N. 1987. Pashto. [Transcription only.] In The World's Major Languages, ed. Bernard Comrie, 1990, Oxford University Press; Chapter 26, pages 547-565. ISBN 978-0195065114.
- UCLA Language Materials Project. [Accessed 2009-08-6]
Linguistic portals and bibliographies
- Languages On The Web -- Portal for Pashto
- LINGUIST List resource pages
- Pashto Academy, University of Peshawar
- Portals to the World: Language and Literature. [Afghanistan] (Library of Congress): [Accessed 2009-08-10 : Last update January 11, 2006]
- SIL Bibliography
- UCLA Language Materials Project.
Grammar
- Chavarría-Aguilar, O.L. 1962. Pashto Basic Course. University of Michigan. [Afghanistan. Transcription only.] ERIC #ED014717. Prepared under Contract No. SAE-8888 between The University Of Michigan and the United States Office of Education.
- Lorenz, Manfred. 1979, 1982. Lehrbuch des Pashto (Afghanisch). [Afghanistan] VEB Verlag Enzyklopädie Leipzig.
- Penzl, Herbert. 1955. A grammar of Pashto; a descriptive study of the dialect of Kandahar, Afghanistan. [Afghanistan. Transcription only.] Washington, American Council of Learned Societies.
- Roos-Keppel, George Olof, and Qazi Abdul Ghani Khan. 1901. A manual of Pushtu. London: Sampson Low, Marston.
- Shafeev, D.A. 1964. A Short Grammatical Outline of Pashto. Translated and edited by Herbert H. Paper. [Afghanistan. Transcription only.] Bloomington: Indiana University; The Hague: Mouton.
- Tegey, Habibullah, and Robson, Barbara. 1996. A Reference Grammar of Pashto. [Afghanistan] Washington, DC: Center for Applied Linguistics. ERIC #ED399825. Developed with funding from Grant No. P017A50047-95 from the International Research and Studies Program of the US Department Of Education.
Lexicon
- Morgenstierne, Georg. 2003. A new etymological vocabulary of Pashto; compiled and edited by J. Elfenbein, D.N. MacKenzie and Nicholas Sims-Williams. Transliteration only. Wiesbaden: Reichert.
- Qamosona English-Pashto Dictionary. Ver. 1.0. 2005. [Afghanistan]. “Based on the English to Pashto Dictionary, by Pashto Academy, Kabul, Afghanistan.” About 22,000 words. Free download. Requirements: Windows XP, 2000, or Windows ME Arabic edition.
- Penzl, Herbert. [Afghanistan] Online Version 1.0, released November 1998. “This dictionary contains all of the words from the glossary of Herbert Penzl's A grammar of Pashto: A descriptive study of the dialect of Kandahar, Afghanistan (Washington, DC: American Council of Learned Societies, 1955), pp. 154-165, which is available from Schoenhof's Foreign Books." Transliteration only. "An on-line key to the orthography is not yet available. In the meantime, please download the Access database version of this file, or consult Penzl (1955).”
- Tegey, Habibullah, and Robson, Barbara. 1993. Pashto-English Glossary for the CAL Pashto Materials. [Afghanistan] Washington, DC: Center for Applied Linguistics. Contract P017A90055. ERIC #ED364083. Pashto script and transliteration. PDF, imaged text. [Note on pagination: Original page 87 (PDF p.96) is followed by original pp.97, 88-96, 98, and the rest in sequence.]
- Wiktionary. Unicode. Monolingual. 613 entries (CC-BY-SA),(GFDL) [Mamandel 16:31, 3 May 2010 (UTC)]
Topical word lists
- Babynology: List of Pashto names in Roman transliteration
Monographs
- Ijaz, Madiha. Phonemic Inventory of Pashto. 2003. [Pakistan: Yusufzai dialect, in and around Peshawar. Transcription only.] Annual Student Report 2002-2003, Center for Research in Urdu Language Processing. The PDF apparently does not include fonts; many of the transcription characters are missing.
Educational software
- Kodakan. Educational software in Pashto and Dari.
Encoding and Fonts
The Unicode range for Arabic script is 0600-06FF. See also Writing.
Input
- Pashto Phonetic Keyboard. For typing Pashto in Windows XP, 2000, or ME (Arabic Edition). Free download.
Data Sources
Most of the text available is evidently monolingual. Parallel text is noted for some entries.
Magazines
- FARDA. (Description from Library of Congress) Published bimonthly by the Afghans’ Pen Club in Stockholm. A critical, social, and cultural magazine committed to democratic ideals. Links to articles, contributors, activities, and more. Articles in Swedish, Pashto, and Dari; general information in English.
News
- Azadi Radio. Pashto service of Radio Free Europe / Radio Liberty
- BBC.
- Benawa. Noted as "very close to Yusufzai Pashto" (June 2006).
- CRI (China Radio International).
- Deutsche Welle.
- Killid news portal.
- RTA: National Radio and Television of Afghanistan.
- Sabawoon Online. [Afghanistan]
- Voice of America.
- Wahdat. Islamic Unity Party of Afghanistan. Dated by Persian calendar. [Our 2006 downloads contain an expected proportion of Pashto-specific characters, but fresh downloads as of 2009-08-11 have no such, even for archived articles from the same epoch. This may be an artifact of text conversion.]
Literature
- Dastanona. Bimonthly magazine of Pashto fiction submitted by Afghan authors worldwide. Published in Kabul. Also has index pages and content in English, German, French, Russian, and Dari; some is parallel.
- D'Zra Dardoona. "Afflictions of the Heart: Poetic collections of Aminullah Zmaryalai". (imaged book pages)
- Khyber.org. Proverbs, short stories (some in English), jokes, poetry.
- Landay. Traditional couplets, with English translation. (imaged text) (The "table of contents" lists the categories in Pashto, transcription, and English translation, but the links to the text pages are in the corresponding frame on the right, labeled only in Pashto.)
Miscellaneous
- Afghan Mental Health: The Journal of Mental Health: Vol. 1, Issue 1, 2001. A magazine publication of Kabul University/Psychotrauma Centre. (imaged pages)
- Masood Peganai - Salzburg (Austria) based Afghan site. Mostly imaged pages.
- PashtonMatalona.
- Wikipedia. Unicode. 1,617 articles (CC-BY-SA),(GFDL) [Mamandel 16:31, 3 May 2010 (UTC)]
Speech
See also News.
- Killid Group: radio stations in Kabul and Herat.
- Radio Station World. Many listings.
Portals
- Afghan Network. [Malware warning removed. Mamandel 2010-07-19]
- Afghanan.net: Mostly in English.
- Afghanistan Information Centre: Many links, organized as General, Government & Political Groups, Organizations, Reconstruction and Development, Tourism, and so on. Some are labeled with the languages they are in, Pashto among them.
- Chai Khaana: online Pashtun community
- Hewad Afghanistan; Pashto home page. News, articles, literature. [Accessed 2009-08-10]
- Portals to the World: Afghanistan (Library of Congress): [Accessed 2009-08-10 : Last update January 9, 2006]
- UCLA Language Materials Project Pashto: Language & Culture
Tools and Other NLP Resources
Identifying Pashto
The following Perso-Arabic characters are probably unique to Pashto:
Code | Glyph | Unicode Name |
---|---|---|
U+0659 | ٙ | ARABIC ZWARAKAY |
U+067C | ټ | ARABIC LETTER TEH WITH RING |
U+0685 | څ | ARABIC LETTER HAH WITH THREE DOTS ABOVE |
U+0689 | ډ | ARABIC LETTER DAL WITH RING |
U+0693 | ړ | ARABIC LETTER REH WITH RING |
U+0696 | ږ | ARABIC LETTER REH WITH DOT BELOW AND DOT ABOVE |
U+069A | ښ | ARABIC LETTER SEEN WITH DOT BELOW AND DOT ABOVE |
U+06AB | ګ | ARABIC LETTER KAF WITH RING |
U+06BC | ڼ | ARABIC LETTER NOON WITH RING |