Forum for Science, Industry and Business
Sponsored by:     Siemens  n-tv 
Search our Site:

Topic (optional):

 

Home Reports Information Technology Content

Search engine makes social calls

next article
07.03.2002

 


New algorithm exploits community structure of the web.


The web has spontaneously organized itself into communities. A new search algorithm that pinpoints these could help surfers find what they want and avoid offensive content.

Page builders can link anywhere. But they don’t, Gary Flake, of the NEC Research Institute in Princeton, and his colleagues have found. Instead, pages congregate into social groups that focus most of their attention on each other.

Web directories compiled by hand, such as Yahoo!, recognize this to an extent. Flake’s team has automated the process. "We find extremely high-quality sites that Yahoo! and Google don’t know about," he says.

The new search ignores a page’s text, looking only at its links. It crawls from a starting page to others it links to, and so on out into the web, picking out islands of expertise in the sea of information.

A test of the algorithm starting at the home pages of biologist Francis Crick, astrophysicist Stephen Hawking, and computer scientist Ronald Rivest yielded groups of sites that are tightly focused on each researcher’s life, work and field. "The sites are remarkably topically related - the clusters’ properties are completely intuitive," says Flake.

"You can extract a lot of meaning from links," agrees Mike Thelwall, who studies search engines at the University of Wolverhampton, UK. The new approach is, he says, a clever way to find meaningful groups among the effectively infinite number of ways to subdivide linked pages.

The first application of community searching may be to fence off areas of the web such as pornography or hate-speech communities, says Flake. Current content filters are largely text-based; these are easy to dodge and require intensive human management.

Community service

Google pioneered the use of links to deduce pages’ relevance. Its PageRank technology counts a link from site A to site B as a vote for B from A. But it does not take account of all the other sites to which A has links, as NEC’s new technique does.

Flake does not expect to displace the market leader - "Google’s a great search engine," he says. Rather, he wants to add an extra dimension to searching.

Using link structures could lead to more efficient, customized searching, particularly for scientists, who are careful to link to each other’s pages. "For academics it’s going to be a big improvement," says Thelwall.

Flake’s team is now mapping out communities in the web as a whole, without using a starter page. This could detect hitherto unsuspected communities, says computer scientist and network researcher Jon Kleinberg of Cornell University in Ithaca, New York.

"It could bring together people with common interests that may not know of each other’s existence. You could also catch the early stages of new trends," says Kleinberg.

References

  1. Flake, G. W., Lawrence, S., Giles, C. L. & Coetzee, F. Self-organization and identification of communities. IEEE Computer, 35, 66 - 71, (2002).

JOHN WHITFIELD | Source: © Nature News Service

next article

More articles from Information Technology:

nachricht SMOS satellite instrument comes alive
23.11.2009 | European Space Agency

nachricht Search engines are source of learning
23.11.2009 | Penn State

All articles from Information Technology >>>

B2B Search

Product / Service
Company / Organisation

Latest News

UCSB physicists move 1 step closer to quantum computing

23.11.2009 | Physics and Astronomy

Fat around the middle increases the risk of dementia

23.11.2009 | Studies and Analyses

New discovery about the formation of new brain cells

23.11.2009 | Health and Medicine

VideoLinks

Event News

Multidisciplinary meeting on Urological Cancers aims to benefit cancer patients

20.11.2009 | Event News

'Golden Age' for clinical psychology in Northern Ireland

20.11.2009 | Event News

New Perspectives in Marine Anti-Fouling Research

11.11.2009 | Event News