General Meta-resources

From the LDC Language Resource Wiki

(Difference between revisions)
Jump to: navigation, search
m
m (Resource Organizations)
 
(18 intermediate revisions not shown)
Line 2: Line 2:
{{si|[[User:Mamandel|Mamandel]] 17:06, 4 May 2011 (UTC)}}
{{si|[[User:Mamandel|Mamandel]] 17:06, 4 May 2011 (UTC)}}
-
This page is for meta-resources that are applicable to many languages.<br>
+
This page is for meta-resources that are applicable to many languages and are not specifically for computational natural language processing.<br>
Language-independent [[NLP Resources]] have their own page.
Language-independent [[NLP Resources]] have their own page.
__TOC__  
__TOC__  
-
== Catalogs ==
+
== Resource Organizations ==
-
* [http://www.language-archives.org/ OLAC: Open Language Archives Community]: {{Heavy quotes|an international partnership of institutions and individuals who are creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources.}}
+
-
 
+
* [http://www.elra.info/ ELRA]: European Languages Resources Association  
* [http://www.elra.info/ ELRA]: European Languages Resources Association  
-
** [http://catalogue.elra.info/ ELRA Catalogue of Language Resources] available through ELRA
+
** [http://catalogue.elra.info/ ELRA Catalogue]: Language Resources available through ELRA
-
** [http://universal.elra.info/ ELRA Universal Catalogue] of other corpora, lexica, and terminological resources worldwide
+
** [http://universal.elra.info/ ELRA Universal Catalogue]: other corpora, lexica, and terminological resources worldwide
 +
 
 +
* [http://www.mpi.nl/IMDI/ IMDI] (ISLE Meta Data Initiative): {{hq|The web-based Browsable Corpus at the Max Planck Institute for Psycholinguistics allows you to browse through IMDI corpora and search for language resources.}} (See also [[#Metadata standards and infrastructure|Metadata standards and infrastructure]], below.) {{si|[[User:Mamandel|Mamandel]] 14:19, 22 May 2011 (UTC)}
 +
 
 +
* [http://ldh.livingsources.org Language Description Heritage Open Access Library]. {{hq|The goal of the Language Description Heritage (LDH) Open Access Digital Library is to provide easy access to descriptive material about the world’s languages. This collection is being compiled at the Max Planck Society in Germany as an open access digital repository of existing scientific contribution describing the world-wide linguistic diversity, focussing on traditionally difficult to obtain works.}} 
* [http://www.ldc.upenn.edu/ LDC]: Linguistic Data Consortium (University of Pennsylvania). {{hq|supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards}}  
* [http://www.ldc.upenn.edu/ LDC]: Linguistic Data Consortium (University of Pennsylvania). {{hq|supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards}}  
** [http://www.ldc.upenn.edu/Catalog/ The LDC Corpus Catalog].
** [http://www.ldc.upenn.edu/Catalog/ The LDC Corpus Catalog].
-
* [http://rosettaproject.org/ The Rosetta Project]: {{hq|a global collaboration of language specialists and native speakers working to build a publicly accessible digital library of human languages.}}
+
{{:OLAC}}
-
==Description==
+
* [http://tlt.its.psu.edu/suggestions/international/bylanguage/index.html Penn State University: Computing with Accents, Symbols, and Foreign Scripts]. {{hq|Each page includes information on how to  view foreign language Web pages, tips for typing and development and  links to other web sites.}} {{si|18:28, 9 May 2011 (UTC)}}
-
*[http://ldh.livingsources.org Language Description Heritage Open Access Library]. {{hq|The goal of the Language Description Heritage (LDH) Open Access Digital Library is to provide easy access to descriptive material about the world’s languages. This collection is being compiled at the Max Planck Society in Germany as an open access digital repository of existing scientific contribution describing the world-wide linguistic diversity, focussing on traditionally difficult to obtain works.}}  
+
 
 +
* [http://rosettaproject.org/ The Rosetta Project]: {{hq|a global collaboration of language specialists and native speakers working to build a publicly accessible digital library of human languages.}}
== Metadata standards and infrastructure ==
== Metadata standards and infrastructure ==
 +
* [http://uakari.ling.washington.edu/e-linguistics/ e-linguistics]. {{hq|a cyber-infrastructure for linguistics ... meant to promote a paradigm shift within the field of linguistics where data are: interoperable -- shared -- open}}  
* [http://uakari.ling.washington.edu/e-linguistics/ e-linguistics]. {{hq|a cyber-infrastructure for linguistics ... meant to promote a paradigm shift within the field of linguistics where data are: interoperable -- shared -- open}}  
-
* [http://emeld.org/index.cfm E-MELD]: Electronic Metastructure for Endangered Languages Data. {{hq|a 5-year project with a dual objective: 1) To aid in the preservation of endangered languages data and documentation. 2) To aid in the development of the infrastructure necessary for effective collaboration among electronic archives.}} A LINGUIST List project.
 
-
** {{hq|If linguistic archives are to offer the widest possible access to the data and provide it in a maximally useful form, consensus must be reached about certain aspects of archive infrastructure. The primary goal of E-MELD is to promote this consensus.}}
 
-
* [http://linguistics-ontology.org/ GOLD Community] ("General Ontology of Linguistic Description"):
+
* [http://emeld.org/index.cfm E-MELD]: Electronic Metastructure for Endangered Languages Data. {{hq|a 5-year project with a dual objective: 1) To aid in the preservation of endangered languages data and documentation. 2) To aid in the development of the infrastructure necessary for effective collaboration among electronic archives.}} A [http://linguistlist.org/ LINGUIST List] project.
-
** {{hq|The purpose of the GOLD Community is to bring together scholars interested in best-practice encoding of linguistic data. We promote best practice as suggested by E-MELD, encourage data interoperability through the use of the GOLD Standard, facilitate search across disparate data sets and provide a platform for sharing existing data and tools from related research projects. [...] This standard encompasses linguistic concepts, definitions of these concepts and relationships between them in a freely available ontology.}}
+
 
-
** [http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0720670 NSF grant BCS-0720670], Implementing the GOLD Community of Practice: Laying the Foundations for a Linguistics Cyberinfrastructure
+
* [http://linguistics-ontology.org/ GOLD Community] ("General Ontology of Linguistic Description"){{hq|The purpose of the GOLD Community is to bring together scholars interested in best-practice encoding of linguistic data.}}
-
* [http://www.language-archives.org/ OLAC: Open Language Archives Community]. See [[#Catalogs]].
+
{{:IMDI}}
 +
{{:OLAC}}
 +
* [http://www.openroad.net.au/languages/index.html Open Road]: {{hq|The Open Road will explore Unicode language support issues in minority and emerging community languages within Australia}}. Maintained by [http://www.vicnet.net.au/ Vicnet], a division of the [http://www.slv.vic.gov.au/ State Library of Victoria]. {{si|Accessed 2011-05-9}}
[[Category:Non-language-specific]]
[[Category:Non-language-specific]]

Latest revision as of 14:27, 22 May 2011

THIS PAGE IS

UNDER CONSTRUCTION


[Mamandel 17:06, 4 May 2011 (UTC)]

This page is for meta-resources that are applicable to many languages and are not specifically for computational natural language processing.
Language-independent NLP Resources have their own page.

Contents


Resource Organizations

  • IMDI (ISLE Meta Data Initiative): The web-based Browsable Corpus at the Max Planck Institute for Psycholinguistics allows you to browse through IMDI corpora and search for language resources. (See also Metadata standards and infrastructure, below.) {{si|Mamandel 14:19, 22 May 2011 (UTC)}
  • Language Description Heritage Open Access Library. The goal of the Language Description Heritage (LDH) Open Access Digital Library is to provide easy access to descriptive material about the world’s languages. This collection is being compiled at the Max Planck Society in Germany as an open access digital repository of existing scientific contribution describing the world-wide linguistic diversity, focussing on traditionally difficult to obtain works.
  • LDC: Linguistic Data Consortium (University of Pennsylvania). supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards
  • OLAC: Open Language Archives Community: an international partnership of institutions and individuals who are creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources. [Mamandel 14:01, 22 May 2011 (UTC)]
  • The Rosetta Project: a global collaboration of language specialists and native speakers working to build a publicly accessible digital library of human languages.

Metadata standards and infrastructure

  • e-linguistics. a cyber-infrastructure for linguistics ... meant to promote a paradigm shift within the field of linguistics where data are: interoperable -- shared -- open
  • E-MELD: Electronic Metastructure for Endangered Languages Data. a 5-year project with a dual objective: 1) To aid in the preservation of endangered languages data and documentation. 2) To aid in the development of the infrastructure necessary for effective collaboration among electronic archives. A LINGUIST List project.
  • GOLD Community ("General Ontology of Linguistic Description"). The purpose of the GOLD Community is to bring together scholars interested in best-practice encoding of linguistic data.
  • IMDI (ISLE Meta Data Initiative): a proposed metadata standard to describe multi-media and multi-modal language resources. The standard provides interoperability for browsable and searchable corpus structures and resource descriptions with help of specific tools.... The web-based Browsable Corpus at the Max Planck Institute for Psycholinguistics allows you to browse through IMDI corpora and search for language resources. [Mamandel 14:19, 22 May 2011 (UTC)]
  • OLAC: Open Language Archives Community: an international partnership of institutions and individuals who are creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources. [Mamandel 14:01, 22 May 2011 (UTC)]
  • Open Road: The Open Road will explore Unicode language support issues in minority and emerging community languages within Australia. Maintained by Vicnet, a division of the State Library of Victoria. [Accessed 2011-05-9]
Personal tools