Material Detail

Detecting Incorrect Numerical Data in DBpedia

Detecting Incorrect Numerical Data in DBpedia

This video was recorded at 11th Extended Semantic Web Conference (ESWC), Crete 2014. DBpedia is a central hub of Linked Open Data (LOD). Being based on crowd-sourced contents and heuristic extraction methods, it is not free of errors. In this paper, we study the application of unsupervised numerical outlier detection methods to DBpedia, using Interquantile Range (IQR), Kernel Density Estimation (KDE), and various dispersion estimators, combined with dierent semantic grouping methods. Our approach reaches 87% precision, and has lead to the identication of 11 systematic errors in the DBpedia extraction framework

Quality

  • User Rating
  • Comments
  • Learning Exercises
  • Bookmark Collections
  • Course ePortfolios
  • Accessibility Info

More about this material

Comments

Log in to participate in the discussions or sign up if you are not already a MERLOT member.