General Meta-resources
From the LDC Language Resource Wiki
UNDER CONSTRUCTION
[Mamandel 17:06, 4 May 2011 (UTC)]
This page is for meta-resources that are applicable to many languages.
Language-independent NLP Resources have their own page.
Contents |
Catalogs
- OLAC: Open Language Archives Community: “an international partnership of institutions and individuals who are creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources.” [Mamandel 14:01, 22 May 2011 (UTC)]
- ELRA: European Languages Resources Association
- ELRA Catalogue: Language Resources available through ELRA
- ELRA Universal Catalogue: other corpora, lexica, and terminological resources worldwide
- LDC: Linguistic Data Consortium (University of Pennsylvania). “supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards”
- The Rosetta Project: “a global collaboration of language specialists and native speakers working to build a publicly accessible digital library of human languages.”
Description
- Language Description Heritage Open Access Library. “The goal of the Language Description Heritage (LDH) Open Access Digital Library is to provide easy access to descriptive material about the world’s languages. This collection is being compiled at the Max Planck Society in Germany as an open access digital repository of existing scientific contribution describing the world-wide linguistic diversity, focussing on traditionally difficult to obtain works.”
Metadata standards and infrastructure
- e-linguistics. “a cyber-infrastructure for linguistics ... meant to promote a paradigm shift within the field of linguistics where data are: interoperable -- shared -- open”
- E-MELD: Electronic Metastructure for Endangered Languages Data. “a 5-year project with a dual objective: 1) To aid in the preservation of endangered languages data and documentation. 2) To aid in the development of the infrastructure necessary for effective collaboration among electronic archives.” A LINGUIST List project.
- “If linguistic archives are to offer the widest possible access to the data and provide it in a maximally useful form, consensus must be reached about certain aspects of archive infrastructure. The primary goal of E-MELD is to promote this consensus.”
- GOLD Community ("General Ontology of Linguistic Description"):
- “The purpose of the GOLD Community is to bring together scholars interested in best-practice encoding of linguistic data. We promote best practice as suggested by E-MELD, encourage data interoperability through the use of the GOLD Standard, facilitate search across disparate data sets and provide a platform for sharing existing data and tools from related research projects. [...] This standard encompasses linguistic concepts, definitions of these concepts and relationships between them in a freely available ontology.”
- NSF grant BCS-0720670, Implementing the GOLD Community of Practice: Laying the Foundations for a Linguistics Cyberinfrastructure
- OLAC: Open Language Archives Community: “an international partnership of institutions and individuals who are creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources.” [Mamandel 14:01, 22 May 2011 (UTC)]