BauDataWeb: The European Building and Construction Materials

A joint project by inndata Datentechnik GmbH (Link: and the E-Business & Web Science Research Group, Universität der Bundeswehr München. (Link:
With this project, we expose a major dataset of the European building and construction materials market for the Semantic Web (Link:­dards/seman­ticweb/) on the basis of the GoodRelations Web Vocabulary for E-Commerce (Link: This allows for the fine-grained search for products, suppliers, and warehouses for any building-related sourcing needs.
BauDataWeb is one of the largest and richest public datasets for a well-defined vertical sector that is available on the Semantic Web. It covers a major share of the European market. Key distinctions from other datasets are:
1. The market for building materials shows a very high item specificity, which makes it very interesting for new types of search. 2. Transportation costs for building materials are usually very significant, which makes the distance from the warehouse to the point of consumption a critical dimension of search. 3. A large part of the items includes a rich, machine-readable description of product features using the FreeClassOWL ontology (Link:­­class_v1/).
A very interesting aspect of this dataset is that it can be combined with other related datasets on the Web of Linked Data, e.g. - dbPedia information about population or transportation infrastructure, - governmental information, or - real estate offers.
The project consists of five major components. a) Dataset: The full data is available in RDF. For fetching the dataset, please use the sitemap at http://seman­tic.euro­­map.xml. The data consists of ca. xx million individual RDF/XML files plus a few large data dump files in Turtle syntax that simplify the crawling of all data at once.
b) FreeClassOWL: A GoodRelations-compliant ontology for describing construction and building materials and services
OWL in RDF/XML:­­class_v1.owl
c) The Eurobau Utility Ontology, which defines a few extensions to GoodRelations for the particular vertical domain
OWL in RDF/XML: http://semantic.euro­­bau-utility.owl
d) A demo application that demonstrates queries combining product features and warehouse distance
e) Public SPARQL endpoints that host the data, e.g. http://linked­data.uribur­
- Over 60 million triples of real business data with a high domain density - Fully GoodRelations-compliant - Fully W3C-compliant - Geo data for warehouse locations - FreeClassOWL product classes and properties for a majority of the products
81 Manufacturers / Brands 19 Reseller 183 Warehouse locations 56.360 Product types (including variants) 1.783.798 Offerings 95 % of the product models include rich FreeClassOWL descriptions
You can load the data into any SPARQL endpoint of choice. For fetching the dataset, please use the semantic sitemap at http://semantic.euro­­map.xml.
Currently, the data is available for SPARQL queries via the OpenLink Software Virtuoso repositories at­ and http://linked­data.uri­
Example of a SPARQL query:
# Search for plan clay blocks (freeClass code 12201010) with a strength of 38 centimetres
PREFIX gr: <>
PREFIX fc: <>
PREFIX vcard: <>
PREFIX foaf: <>

SELECT DISTINCT ?offer ?lososp ?lososp_label ?lososp_url ?long ?lat ?posm ?posm_label ?offerlink
  ?offer a gr:Offering .
  ?offer gr:includesObject [gr:typeOfGood ?ph] .
  ?ph gr:hasMakeAndModel ?posm .
  ?offer gr:availableAtOrFrom ?lososp .
  ?lososp rdfs:label ?lososp_label .
  ?lososp vcard:geo [vcard:longitude ?long; vcard:latitude ?lat] .
  ?lososp vcard:adr [vcard:url ?lososp_url] .
  ?offer foaf:page ?offerlink .
  ?posm a ?fc_gen .
  ?posm a gr:ProductOrServiceModel .
  ?posm rdfs:label ?posm_label .
  ?fc_gen rdfs:subClassOf ?fc_tax .
  ?fc_tax fc:hierarchyCode "12201010" .
  ?posm fc:P_4 [gr:hasValueFloat ?ValueP_4] . FILTER(?ValueP_4=38) .
ORDER BY ?lososp ?posm_label
Dipl.-Ing. Andreas Radinger
E-Business and Web Science Research Group
Universität der Bundeswehr München
Werner-Heisenberg-Weg 39
D-85579 Neubiberg, Germany
Phone: +49 89 6004-4218
Univ.-Prof. Dr. Martin Hepp
Chair of General Management and E-Business
E-Business and Web Science Research Group
Universität der Bundeswehr München
Werner-Heisenberg-Weg 39
D-85579 Neubiberg, Germany
Phone: +49 89 6004-4217
Bmstr. Ing. Otto Handle
inndata Datentechnik GmbH
A-6020 Innsbruck, Austria
Phone: +43 512 362233
The data conversion and implementation was carried out by Andreas Radinger and Martin Hepp at the E-Business & Web Science Research Group at the Universität der Bundeswehr München, Germany.
The underlying relational database has been designed by Otto Handle and is being maintained and operated by inndata Datentechnik GmbH.
The work on BauDataWeb was partially funded by the Austrian FFG under the project grant "icontent.document" (grant no. xyz).
GoodRelations Vocabulary for E-Commerce:
Hepp, Martin: GoodRelations: An Ontology for Describing Products and Services Offers on the Web, Proceedings of the 16th International Conference on Knowledge Engineering and Knowledge Management (EKAW2008), Acitrezza, Italy, September 29 - October 3, 2008, Springer LNCS, Vol 5268, pp. 332-347.
PDF at http://www.hepp­­Relations­EKAW2008-crc-final.pdf