Material Detail

Linked Data in Linguistics for NLP and Web Annotation

Linked Data in Linguistics for NLP and Web Annotation

This video was recorded at W3C Workshop: The Multilingual Web − Linked Open Data and MultilingualWeb-LT Requirements, Dublin 2012. This presentation introduces three major data pools that have recently been made freely available as Linked Data by a collaborative community process: (1) the DBpedia Internationalization committee is concerned with the extraction of RDF from the language-specific Wikipedia editions; (2) the creation of a configurable extractor based on DBpedia and able to extract information from all languages of Wiktionary with manageable effort; (3) the Working Group for Open Lingustic Data, an Open Knowledge Foundation group with the goal of converting Open Linguistics data sets to RDF and interlinking them. The presentation highlights and stresses the role of Open Licences and RDF for the sustenance of such pools. It also provides a short update on the recent progress of NIF (Natural Language Processing Interchange Format) by the LOD2-EU project. NIF 2.0 will have many new features, including interoperability with the above-mentioned data pools as well as major RDF vocabularies such as OLiA, Lemon, and NERD. Furthermore, NIF can be used as an exchange language for Web annotation tools such as AnnotateIt as it uses robust Linked Data aware identifiers for Website annotation. The transcript of the Q&A session "Linking Resources" is available here.

Quality

  • User Rating
  • Comments
  • Learning Exercises
  • Bookmark Collections
  • Course ePortfolios
  • Accessibility Info

More about this material

Browse...

Disciplines with similar materials as Linked Data in Linguistics for NLP and Web Annotation

Comments

Log in to participate in the discussions or sign up if you are not already a MERLOT member.