Information Technology

02.06.2009

Better and faster search engines

The goal of all search engines is to attain the most relevant responses as quickly as possible. When search engines calculate their search results, they are steered by an algorithm that assigns higher or lower values to features of Web pages. The most common search engines on the Net, such as Google, generate a gigantic single ranking based on a search of all pages available on the Net.

The algorithm that Ola Ågren has developed ranks pages, instead, on the basis of each relevant starting page, and includes pages that are directly or indirectly linked to by the starting page. Then a normalised mean value of the relevance of the various pages is calculated. A page that has links to it from several different pages is therefore assigned a higher value than those that are found only once. In this way it is faster to find pages of interest. For ordinary standard algorithms it takes more than seven days to go through and rank Web pages in a certain database. Using his algorithm, Ola Ågren has managed to do this in 158 seconds.

What's more, his algorithm has proven to yield the most relevant responses. He studied the relevance of hits in the top ten lists for three different algorithms: the one he developed and two variants of PageRank, the algorithm used by Google. He examined a total of 100 different expressions for all Nordic languages and English, including the expression master of engineering science (civlingeniör in Swedish). The top ten lists always had some form of overlapping between the different algorithms, but they were never completely identical. Users were then asked to judge the relevance of the various hits, without knowing which search engines had generated the alternative responses.

“The users in the study found that the search engine I developed is better than the others in more than 60 percent of cases,” says Ola Ågren.

Besides search engines, the dissertation is also about methods for finding structures in huge masses of information, such as keywords and methods for extracting free text, such as parts of the documentation from the source code.

Dissertation title: Finding, Extracting and Exploiting Structure in Text and Hypertext.

For further information, please contact: Ola Ågren, Department of Computing Science; Cell phone: +46 (0)730-283852; E-mail: ola.agren@cs.umu.se

Pressofficer Karin Wikman, karin.wikman@adm.umu.se or +46-70 6100805

Media Contact

Karin Wikman idw

More Information:

http://www.vr.se http://umu.diva-portal.org/smash/record.jsf?searchId=1&pid=diva2:214648

All latest news from the category: Information Technology

Here you can find a summary of innovations in the fields of information and data processing and up-to-date developments on IT equipment and hardware.

This area covers topics such as IT services, IT architectures, IT management and telecommunications.

Trotting robots reveal emergence of animal gait transitions

30.04.2024 / Information Technology

Combining robotics and ChatGPT

30.04.2024 / Information Technology

Airborne single-photon lidar system achieves high-resolution 3D imaging

26.04.2024 / Information Technology

AI tool creates ‘synthetic’ images of cells

23.04.2024 / Information Technology

Back to home

Comments (0) Cancel reply

Newest articles

Physics and Astronomy

Webb captures top of iconic horsehead nebula in unprecedented detail

NASA’s James Webb Space Telescope has captured the sharpest infrared images to date of a zoomed-in portion of one of the most distinctive objects in our skies, the Horsehead Nebula….

02.05.2024

Materials Sciences

Cost-effective, high-capacity, and cyclable lithium-ion battery cathodes

Charge-recharge cycling of lithium-superrich iron oxide, a cost-effective and high-capacity cathode for new-generation lithium-ion batteries, can be greatly improved by doping with readily available mineral elements. The energy capacity and…

02.05.2024

Life Sciences and Chemistry

Novel genetic plant regeneration approach

…without the application of phytohormones. Researchers develop a novel plant regeneration approach by modulating the expression of genes that control plant cell differentiation. For ages now, plants have been the…

02.05.2024

News and reports

Latest News

Webb captures top of iconic horsehead nebula in unprecedented detail

Cost-effective, high-capacity, and cyclable lithium-ion battery cathodes

Novel genetic plant regeneration approach

Roadmap to close the carbon cycle

Better and faster search engines

Media Contact

More Information:

Trotting robots reveal emergence of animal gait transitions

Combining robotics and ChatGPT

Airborne single-photon lidar system achieves high-resolution 3D imaging

AI tool creates ‘synthetic’ images of cells

Comments (0) Cancel reply

Newest articles

Webb captures top of iconic horsehead nebula in unprecedented detail

Cost-effective, high-capacity, and cyclable lithium-ion battery cathodes

Novel genetic plant regeneration approach

Partners & Sponsors