A joint project by inndata Datentechnik GmbH (Link: http://www.inndata.at/) and the E-Business & Web Science Research Group, Universität der Bundeswehr München. (Link: http://www.unibw.de/ebusiness/)
With this project, we expose a major dataset of the European building and construction materials market for the Semantic Web (Link: http://www.w3.org/standards/semanticweb/) on the basis of the GoodRelations Web Vocabulary for E-Commerce (Link: http://purl.org/goodrelations/). This allows for the fine-grained search for products, suppliers, and warehouses for any building-related sourcing needs.
BauDataWeb is one of the largest and richest public datasets for a well-defined vertical sector that is available on the Semantic Web. It covers a major share of the European market.
Key distinctions from other datasets are:
1. The market for building materials shows a very high item specificity, which makes it very interesting for new types of search.
2. Transportation costs for building materials are usually very significant, which makes the distance from the warehouse to the point of consumption a critical dimension of search.
3. A large part of the items includes a rich, machine-readable description of product features using the FreeClassOWL ontology (Link: http://www.freeclass.eu/freeclass_v1/).
A very interesting aspect of this dataset is that it can be combined with other related datasets on the Web of Linked Data, e.g.
- dbPedia information about population or transportation infrastructure,
- governmental information, or
- real estate offers.
The project consists of five major components.
a) Dataset: The full data is available in RDF. For fetching the dataset, please use the sitemap at http://semantic.eurobau.com/sitemap.xml.
The data consists of ca. xx million individual RDF/XML files plus a few large data dump files in Turtle syntax that simplify the crawling of all data at once.
b) FreeClassOWL: A GoodRelations-compliant ontology for describing construction and building materials and services
OWL in RDF/XML: http://www.freeclass.eu/freeclass_v1.owl
c) The Eurobau Utility Ontology, which defines a few extensions to GoodRelations for the particular vertical domain
OWL in RDF/XML: http://semantic.eurobau.com/eurobau-utility.owl
d) A demo application that demonstrates queries combining product features and warehouse distance
e) Public SPARQL endpoints that host the data, e.g.
- Over 60 million triples of real business data with a high domain density
- Fully GoodRelations-compliant
- Fully W3C-compliant
- Geo data for warehouse locations
- FreeClassOWL product classes and properties for a majority of the products
81 Manufacturers / Brands
183 Warehouse locations
56.360 Product types (including variants)
95 % of the product models include rich FreeClassOWL descriptions
You can load the data into any SPARQL endpoint of choice.
For fetching the dataset, please use the semantic sitemap at http://semantic.eurobau.com/sitemap.xml.
Currently, the data is available for SPARQL queries via the OpenLink Software Virtuoso repositories at
Example of a SPARQL query:
# Search for plan clay blocks (freeClass code 12201010) with a strength of 38 centimetres