|
|
| Home | CV | Publications |
Research Papers | Events | Useful Links |
|
|
|
The semantic
web
Information World Review - Dec 02
Introduction | Future of search | XML | RDF | Ontologies | Intelligent agents | The global brain | Video When the web starts
thinking for itself Meaningful machine manipulation
The semantic web is based on established technologies such as XML, RDF, Ontologies and Intelligent Agents.
XML defined
Another technology, the Resource Description Framework (RDF) provides meaning to structure of XML documents. Just as in human language where meaning is expressed in a sentence composed of subject, verb and object, RDF helps express meaning and relationships between different web pages and concepts through a programming structure of things, properties and values. For example, David Green (thing) is the author of (property) this and other IWR articles (value). Subject, object and verb (or thing, property and value) are encoded in the document through a Uniform Resource Identifier (URI) which ensures that the words on the document are linked to a unique definition that everyone can access on the web. This enables much greater data interchange between systems.
However, whilst RDF allows a publisher to inform a visiting computer which terms it has used to tag the content in a document, different publishers will use different terms/identifiers to express the same concept. Ontologies provide a deeper level of meaning by providing equivalence relations between terms (i.e. term A on my web page is expressing the same concept as term B on your web page). An ontology is a file that formally defines relations among terms e.g. a taxonomy and set of inference rules. By providing such 'dictionaries of meaning' (in philosophy ontology means 'nature of existence') ontologies can improve the accuracy of web searches by allowing a search program to seek out pages that refer to a specific concept rather than just a particular term. back to the top
Whilst XML, RDF and Ontologies provide the basic infrastructure of the semantic web, it is intelligent agents that will realise its power. An intelligent agent can be best described as adaptive computer coding capable of reasoning and that learns from our behaviours and preferences (thus delivering 'proactive personalisation'). There are many thousands of different agents (or bots as they are also known as), each performing specific, specialised tasks (search bots, chatter bots, shopping bots etc). An important aspect of agents is that they are sociable - they can interact and communicate with humans and other agents. In the semantic web, different agents work together to create an 'information value chain' in which the user's search request is 'packet processed' through sub assemblies of information passed between agents - each adding value to construct the user's answer. A user will issue a high-level information request. An intelligent agent will then analyse this request and delegate it to other appropriate agents and services that it has each identified through service directory ads on the web. These agents will distil large amounts of data distributed across the web and progressively reduce it to a small amount of high-value customised information - in other words the answer! When broken down into a series of explicit search statements, and appropriate content sources to search, a simple user information request is revealed to be a complex task. Automating such tasks will result in an ever-larger role for Artificial Intelligence technologies such as agents. One key concern is the autonomy vs accountability of intelligent agents. How much information about our behaviours and content preferences do they digress to other agents, databases and systems? There is a need to construct boundaries such as user-determined privacy settings to safely contain such interactions. Similarly agents will need to authenticate the veracity of content sources and other agents they meet through the use of digital signatures - this is of particular concern when much future crime will involve the more profitable theft of personal details rather than artefacts. Recognising this, the Joint Research Centre of the European Commission is building an experimental privacy protection agent using semantic web technology and W3 Consortium's P3P privacy protocol. The agent will automate the process of protecting a user's privacy partially by comparing privacy policies and user's privacy preferences. back to the top.
The global brain The Open Directory project's slogan is 'Humans do it better'. Tim Berners Lee's vision is that the semantic web will do it better ('it' being low-level information discovery and exchange), thus enabling humans to do better things. This symbiotic intelligence of people, plus computers, plus AI agents offering immediate access to humanity's collective knowledge does sound somewhat utopian. Equally it raises the spectre of a self-adaptive intelligence that quickly surpasses our ability to comprehend it. Would such a global brain act as a digital dictator to whom individuals have a secondary role to society's demands? Two papers published in the 9th September 99 issue of the scientific journal Nature, revealed the Internet appeared to be 'evolving' rather than following the expected model of random inanimate networks. Quoted in the June 2000 issue of New Scientist, Daniel Dennett, director of the Centre for Cognitive Studies at the University of Medford, Massachusetts, commented 'the global communication network is already capable of complex behaviour that defies the efforts of human experts to comprehend'. The semantic web may act as a 'collective memory' augmenting individual brain power and accelerating the pace of human learning and discovery, but we will need to careful about controlling its development and our dependence on it if wish to avoid a dystopian digital dictator scenario. You Tube video
Related articles: Information World Review is Europe's leading information industry publication. This article is reprinted in its entirety with permission from Learned Information Europe Ltd. All material copyright Learned Information Europe Ltd. |
| Back to the top |