The Semantic Web

There exist numerous definitions of the Semantic Web, which mean more or less the same. For this paper, we have selected two. The first one is from Berners-Lee and the second one is taken from the W3C Semantic Web Activity Statement.

The Semantic Web is a web of data, in some ways like a global database

The Semantic Web is a vision: the idea of having data on the web defined and linked in a way, that it can be used by machines - not just for display purposes, but for automation, integration and reuse of data across various applications.”

The current web was designed to improve the exchange of information between different people. Unfortunately machines cannot easily access this information since there is too much overhead involved: the content is not cleanly separated from the instructions on how to present content. An HTML document mixes content and layout instructions. Another problem is that a word can have different meanings. Any search engine on the web at present suffers badly from this defect, and a lot of effort is spent trying to remedy it. Using languages that allow automatic interpretation of content can solve this problem at a fundamental level.

It is the goal of the Semantic Web to make information on the web machine-processable. A lot of things which currently need human aid can be automated in this way, e.g. making a synthesis of information that is found in different documents, filtering the results of a search engine, etc.

Tim Berners-Lee - director of the W3C -, James Hendler - responsible for agent-based computing research at DARPA - and Ora Lassila - chief scientist of Nokia and co-author of the RDF specification - describe a hypothetical situation – automating the process of making an appointment with a physician - that could be real when the Semantic Web is ready. To bring this situation to a favourable conclusion, the authors use software agents, which roam the Semantic Web looking for information. The Semantic Web will be the natural habitat of software agents.