Gooruze

First VisitRegister with GooruzeLog in to Gooruze
 
   
 

Article Rating

AverageAverageAverageAverageAverage 3.09 OK from 2 votes
(71 Visits)
 
 

Semantic Web: Garbage in is garbage out

by SearchForensics Pupil(October 2007) (rank 209th)
 
 

What is the Semantic Web?

The Semantic Web consists of a grouping of technologies and standards for data interchange that make the content of the Web easier to access and interrelate by machines and easier to process by both machines and humans. The vision is to fulfill more of the Web’s potential by allowing data to be shared effectively by wider communities.

According to W3C:

“The Semantic Web is about two things. It is about common formats for integration and combination of data drawn from diverse sources, where on the original Web mainly concentrated on the interchange of documents. It is also about language for recording how the data relates to real world objects. That allows a person, or a machine, to start off in one database, and then move through an unending set of databases which are connected not by wires but by being about the same thing.”

Benefits

As the vision of the Semantic Web is realized, the benefits are quite significant. Both commercial and non-commercial enterprises stand to gain from:

  • Interconnectivity of diverse data sources,
  • Gleaning more usable and actionable information from raw data,
  • The blending of machine categorization of information with human insight and expertise, and
  • Improved functionality of Internet and intranet search engines.

Challenges

One of the main challenges the Semantic Web faces is in the role that people play. This is the area of human contributions to help explain to machines the relationships between data. There is a tendency to put excessive trust in `computerized' data, and a propensity for individuals to accept blindly whatever comes from the computer.

You may be familiar with GIGO (Garbage In is Garbage Out). As input and contributions come from a multitude of users via tagging, behavioral data, and other forms, there will be at least three issues that will need attention:

* Incorrect tagging,

* Malicious tagging, and

* Spam tagging.

To the extent that results returned to an Internet user are influenced by the relevant input from other users, those results may become skewed due to any of (or a combination of any of) those factors. Steps will have to be taken to address these potential problems as much as possible, both preventative and corrective measures.

Related social implications

To the extent that search results returned to an Internet user are influenced by the relevant data from other individuals, do we lose factual accuracy or objectivity in the search results? This is one example of a potentially negative influence from the human element of the semantic web.

In one form of postmodernism or pragmatism, meaning is a product of whatever linguistic community you're in and there is nothing beyond that which you should seek because there is nothing beyond that to be had – no truth with a capital T. In the semantic web, are the contributors akin to the linguistic community and the accuracy of the results from your search akin to the postmodern notion of meaning; no facts with a capital F, no objectivity, no Truth in advertising?

"There is nothing either good or bad but thinking tagging makes it so" – William Shakespeare [Hamlet Act II, Sc. II] (modified).

Analysts often discuss the impact of the Internet and email on culture. There may arise similar discussions about the impact of the semantic web on culture as the information that people find, hold on to, and make use of may be viewed as accurate and true, all the while, it is only a product of the collective musings, however ill-informed, of the masses. Conversely, there will most likely be some very well-informed areas of the semantic web as well; this could create quite a disparity in the quality of and therefore benefit from the semantic web in that regard.

 
 

Any contributed content above is the subjective opinion of that member or external author, and not of Gooruze.com Pty Ltd. View our House Rules for more details.

 
 

Related Articles

No related articles available

Bookmarks

No bookmarks available

 
 

Related keywords: accuracy, internet, search, semantic, sources, tagging, web

 
  ARTICLE RATING
AverageAverageAverageAverageAverage 3.09 OK from 2 votes
 
 

Thankyou for your vote (you can change your vote at any time). Please leave some helpful comments about this article using the box below.

 
 

Help us rank this article

Vote: ExcellentExcellentExcellentExcellentExcellent
Vote: GoodGoodGoodGoodGood
Vote: AverageAverageAverageAverageAverage
Vote: PoorPoorPoorPoorPoor
Vote: Very PoorVery PoorVery PoorVery PoorVery Poor
 
 

Add a comment

 
 
Add a comment on this article.
 
 

Comments

 
   
 

Invite someone to Gooruze

Home | Read News | Post News | Read Articles | Write Articles | Q & A | Groups | Activity | Members | More

Privacy Policy | House Rules | About Us | Contact Us | House Blog | FAQ

© Copyright 2007 Gooruze ™ | Built by Market United