General Meta-resources

From the LDC Language Resource Wiki

THIS PAGE IS

UNDER CONSTRUCTION

[Mamandel 17:06, 4 May 2011 (UTC)]

This page is for meta-resources that are applicable to many languages and are not specifically for computational natural language processing.
Language-independent NLP Resources have their own page.

Resource Organizations

ELRA: European Languages Resources Association
- ELRA Catalogue: Language Resources available through ELRA
- ELRA Universal Catalogue: other corpora, lexica, and terminological resources worldwide

IMDI (ISLE Meta Data Initiative): “The web-based Browsable Corpus at the Max Planck Institute for Psycholinguistics allows you to browse through IMDI corpora and search for language resources.” (See also Metadata standards and infrastructure, below.) {{si|Mamandel 14:19, 22 May 2011 (UTC)}

Language Description Heritage Open Access Library. “The goal of the Language Description Heritage (LDH) Open Access Digital Library is to provide easy access to descriptive material about the world’s languages. This collection is being compiled at the Max Planck Society in Germany as an open access digital repository of existing scientific contribution describing the world-wide linguistic diversity, focussing on traditionally difficult to obtain works.”

LDC: Linguistic Data Consortium (University of Pennsylvania). “supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards”
- The LDC Corpus Catalog.

OLAC: Open Language Archives Community: “an international partnership of institutions and individuals who are creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources.” [Mamandel 14:01, 22 May 2011 (UTC)]

Penn State University: Computing with Accents, Symbols, and Foreign Scripts. “Each page includes information on how to view foreign language Web pages, tips for typing and development and links to other web sites.” [18:28, 9 May 2011 (UTC)]

The Rosetta Project: “a global collaboration of language specialists and native speakers working to build a publicly accessible digital library of human languages.”

Metadata standards and infrastructure

e-linguistics. “a cyber-infrastructure for linguistics ... meant to promote a paradigm shift within the field of linguistics where data are: interoperable -- shared -- open”

E-MELD: Electronic Metastructure for Endangered Languages Data. “a 5-year project with a dual objective: 1) To aid in the preservation of endangered languages data and documentation. 2) To aid in the development of the infrastructure necessary for effective collaboration among electronic archives.” A LINGUIST List project.

GOLD Community ("General Ontology of Linguistic Description"). “The purpose of the GOLD Community is to bring together scholars interested in best-practice encoding of linguistic data.”

IMDI (ISLE Meta Data Initiative): “a proposed metadata standard to describe multi-media and multi-modal language resources. The standard provides interoperability for browsable and searchable corpus structures and resource descriptions with help of specific tools.... The web-based Browsable Corpus at the Max Planck Institute for Psycholinguistics allows you to browse through IMDI corpora and search for language resources.” [Mamandel 14:19, 22 May 2011 (UTC)]

OLAC: Open Language Archives Community: “an international partnership of institutions and individuals who are creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources.” [Mamandel 14:01, 22 May 2011 (UTC)]

Open Road: “The Open Road will explore Unicode language support issues in minority and emerging community languages within Australia”. Maintained by Vicnet, a division of the State Library of Victoria. [Accessed 2011-05-9]

General Meta-resources

From the LDC Language Resource Wiki

Contents

Resource Organizations

Metadata standards and infrastructure

Views

Personal tools

Navigation

Search

Toolbox