DBpedia is a collaborative project with the aim to extract structured information from Wikipedia and made them available for other Web applications. DBpedia also allows to combine these data with information from other Web applications.
Knowledge bases are playing an increasingly important role in enhancing the intelligence of Web. Data and metadata must serve many different communities and for that reason both must constantly evolve. This requires a model of structured information representation and management. Inconsistency, ambiguity, uncertainty, data provenance and implicit knowledge must be handled in a uniform way. The DBpedia project is focused on building a large-scale, multilingual knowledge base by converting Wikipedia content into structured knowledge. Wikipedia has a problem, like many other Web applications that its search capabilities are limited to the full-text search. This restricts access to this valuable knowledge base.
Wikipedia articles consist mostly of free text, but also include structured information. Such information can be info-box templates, categorization information, images, geo-coordinates, links to external web pages, disambiguation pages, redirects between pages, and links across different language editions of Wikipedia. This structured information are extracted by DBpedia extraction Framework and put in a uniform dataset which can be queried.
Extraction Framework: The DBpedia community uses a flexible and extensible framework to extract different kinds of structured information from Wikipedia. The DBpedia extraction is structured into four phases:
DBpedia Data Sets are interlinked with various other data sources. For this the method Linked Data for publishing structured data is used. Linked Data can be accessed by using Semantic Web browsers which enable surfers to navigate between different data sources by following RDF links. This RDF links can also be followed by search engines or robots for searching the Semantic Web.
DBpedia provides localized versions in 125 languages. All these versions together describe 38.3 million things. DBpedia is connected with other Linked Datasets by around 50 million RDF links. DBpedia is connected with other Linked Datasets by around 50 million RDF links. Altogether the DBpedia 2014 release consists of 3 billion pieces of information (RDF triples)