General Meta-resources
From the LDC Language Resource Wiki
(Difference between revisions)
m (→Catalogs) |
m (→Resource Organizations) |
||
(17 intermediate revisions not shown) | |||
Line 2: | Line 2: | ||
{{si|[[User:Mamandel|Mamandel]] 17:06, 4 May 2011 (UTC)}} | {{si|[[User:Mamandel|Mamandel]] 17:06, 4 May 2011 (UTC)}} | ||
- | This page is for meta-resources that are applicable to many languages.<br> | + | This page is for meta-resources that are applicable to many languages and are not specifically for computational natural language processing.<br> |
Language-independent [[NLP Resources]] have their own page. | Language-independent [[NLP Resources]] have their own page. | ||
__TOC__ | __TOC__ | ||
- | == | + | == Resource Organizations == |
- | + | ||
- | + | ||
* [http://www.elra.info/ ELRA]: European Languages Resources Association | * [http://www.elra.info/ ELRA]: European Languages Resources Association | ||
- | ** [http://catalogue.elra.info/ ELRA Catalogue | + | ** [http://catalogue.elra.info/ ELRA Catalogue]: Language Resources available through ELRA |
- | ** [http://universal.elra.info/ ELRA Universal Catalogue] | + | ** [http://universal.elra.info/ ELRA Universal Catalogue]: other corpora, lexica, and terminological resources worldwide |
+ | |||
+ | * [http://www.mpi.nl/IMDI/ IMDI] (ISLE Meta Data Initiative): {{hq|The web-based Browsable Corpus at the Max Planck Institute for Psycholinguistics allows you to browse through IMDI corpora and search for language resources.}} (See also [[#Metadata standards and infrastructure|Metadata standards and infrastructure]], below.) {{si|[[User:Mamandel|Mamandel]] 14:19, 22 May 2011 (UTC)} | ||
+ | |||
+ | * [http://ldh.livingsources.org Language Description Heritage Open Access Library]. {{hq|The goal of the Language Description Heritage (LDH) Open Access Digital Library is to provide easy access to descriptive material about the world’s languages. This collection is being compiled at the Max Planck Society in Germany as an open access digital repository of existing scientific contribution describing the world-wide linguistic diversity, focussing on traditionally difficult to obtain works.}} | ||
* [http://www.ldc.upenn.edu/ LDC]: Linguistic Data Consortium (University of Pennsylvania). {{hq|supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards}} | * [http://www.ldc.upenn.edu/ LDC]: Linguistic Data Consortium (University of Pennsylvania). {{hq|supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards}} | ||
** [http://www.ldc.upenn.edu/Catalog/ The LDC Corpus Catalog]. | ** [http://www.ldc.upenn.edu/Catalog/ The LDC Corpus Catalog]. | ||
- | + | {{:OLAC}} | |
- | + | * [http://tlt.its.psu.edu/suggestions/international/bylanguage/index.html Penn State University: Computing with Accents, Symbols, and Foreign Scripts]. {{hq|Each page includes information on how to view foreign language Web pages, tips for typing and development and links to other web sites.}} {{si|18:28, 9 May 2011 (UTC)}} | |
- | *[http:// | + | |
+ | * [http://rosettaproject.org/ The Rosetta Project]: {{hq|a global collaboration of language specialists and native speakers working to build a publicly accessible digital library of human languages.}} | ||
== Metadata standards and infrastructure == | == Metadata standards and infrastructure == | ||
+ | |||
* [http://uakari.ling.washington.edu/e-linguistics/ e-linguistics]. {{hq|a cyber-infrastructure for linguistics ... meant to promote a paradigm shift within the field of linguistics where data are: interoperable -- shared -- open}} | * [http://uakari.ling.washington.edu/e-linguistics/ e-linguistics]. {{hq|a cyber-infrastructure for linguistics ... meant to promote a paradigm shift within the field of linguistics where data are: interoperable -- shared -- open}} | ||
- | |||
- | |||
- | * [http:// | + | * [http://emeld.org/index.cfm E-MELD]: Electronic Metastructure for Endangered Languages Data. {{hq|a 5-year project with a dual objective: 1) To aid in the preservation of endangered languages data and documentation. 2) To aid in the development of the infrastructure necessary for effective collaboration among electronic archives.}} A [http://linguistlist.org/ LINGUIST List] project. |
- | + | ||
- | + | ||
- | * [http:// | + | * [http://linguistics-ontology.org/ GOLD Community] ("General Ontology of Linguistic Description"). {{hq|The purpose of the GOLD Community is to bring together scholars interested in best-practice encoding of linguistic data.}} |
+ | {{:IMDI}} | ||
+ | |||
+ | {{:OLAC}} | ||
+ | * [http://www.openroad.net.au/languages/index.html Open Road]: {{hq|The Open Road will explore Unicode language support issues in minority and emerging community languages within Australia}}. Maintained by [http://www.vicnet.net.au/ Vicnet], a division of the [http://www.slv.vic.gov.au/ State Library of Victoria]. {{si|Accessed 2011-05-9}} | ||
[[Category:Non-language-specific]] | [[Category:Non-language-specific]] |
Latest revision as of 14:27, 22 May 2011
UNDER CONSTRUCTION
[Mamandel 17:06, 4 May 2011 (UTC)]
This page is for meta-resources that are applicable to many languages and are not specifically for computational natural language processing.
Language-independent NLP Resources have their own page.
Contents |
Resource Organizations
- ELRA: European Languages Resources Association
- ELRA Catalogue: Language Resources available through ELRA
- ELRA Universal Catalogue: other corpora, lexica, and terminological resources worldwide
- IMDI (ISLE Meta Data Initiative): “The web-based Browsable Corpus at the Max Planck Institute for Psycholinguistics allows you to browse through IMDI corpora and search for language resources.” (See also Metadata standards and infrastructure, below.) {{si|Mamandel 14:19, 22 May 2011 (UTC)}
- Language Description Heritage Open Access Library. “The goal of the Language Description Heritage (LDH) Open Access Digital Library is to provide easy access to descriptive material about the world’s languages. This collection is being compiled at the Max Planck Society in Germany as an open access digital repository of existing scientific contribution describing the world-wide linguistic diversity, focussing on traditionally difficult to obtain works.”
- LDC: Linguistic Data Consortium (University of Pennsylvania). “supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards”
- OLAC: Open Language Archives Community: “an international partnership of institutions and individuals who are creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources.” [Mamandel 14:01, 22 May 2011 (UTC)]
- Penn State University: Computing with Accents, Symbols, and Foreign Scripts. “Each page includes information on how to view foreign language Web pages, tips for typing and development and links to other web sites.” [18:28, 9 May 2011 (UTC)]
- The Rosetta Project: “a global collaboration of language specialists and native speakers working to build a publicly accessible digital library of human languages.”
Metadata standards and infrastructure
- e-linguistics. “a cyber-infrastructure for linguistics ... meant to promote a paradigm shift within the field of linguistics where data are: interoperable -- shared -- open”
- E-MELD: Electronic Metastructure for Endangered Languages Data. “a 5-year project with a dual objective: 1) To aid in the preservation of endangered languages data and documentation. 2) To aid in the development of the infrastructure necessary for effective collaboration among electronic archives.” A LINGUIST List project.
- GOLD Community ("General Ontology of Linguistic Description"). “The purpose of the GOLD Community is to bring together scholars interested in best-practice encoding of linguistic data.”
- IMDI (ISLE Meta Data Initiative): “a proposed metadata standard to describe multi-media and multi-modal language resources. The standard provides interoperability for browsable and searchable corpus structures and resource descriptions with help of specific tools.... The web-based Browsable Corpus at the Max Planck Institute for Psycholinguistics allows you to browse through IMDI corpora and search for language resources.” [Mamandel 14:19, 22 May 2011 (UTC)]
- OLAC: Open Language Archives Community: “an international partnership of institutions and individuals who are creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources.” [Mamandel 14:01, 22 May 2011 (UTC)]
- Open Road: “The Open Road will explore Unicode language support issues in minority and emerging community languages within Australia”. Maintained by Vicnet, a division of the State Library of Victoria. [Accessed 2011-05-9]