The goal of all search engines is to attain the most relevant responses as quickly as possible. When search engines calculate their search results, they are steered by an algorithm that assigns higher or lower values to features of Web pages. The most common search engines on the Net, such as Google, generate a gigantic single ranking based on a search of all pages available on the Net.
The algorithm that Ola Ågren has developed ranks pages, instead, on the basis of each relevant starting page, and includes pages that are directly or indirectly linked to by the starting page. Then a normalised mean value of the relevance of the various pages is calculated. A page that has links to it from several different pages is therefore assigned a higher value than those that are found only once. In this way it is faster to find pages of interest. For ordinary standard algorithms it takes more than seven days to go through and rank Web pages in a certain database. Using his algorithm, Ola Ågren has managed to do this in 158 seconds.
What's more, his algorithm has proven to yield the most relevant responses. He studied the relevance of hits in the top ten lists for three different algorithms: the one he developed and two variants of PageRank, the algorithm used by Google. He examined a total of 100 different expressions for all Nordic languages and English, including the expression master of engineering science (civlingeniör in Swedish). The top ten lists always had some form of overlapping between the different algorithms, but they were never completely identical. Users were then asked to judge the relevance of the various hits, without knowing which search engines had generated the alternative responses.
"The users in the study found that the search engine I developed is better than the others in more than 60 percent of cases," says Ola Ågren.
Besides search engines, the dissertation is also about methods for finding structures in huge masses of information, such as keywords and methods for extracting free text, such as parts of the documentation from the source code.
Dissertation title: Finding, Extracting and Exploiting Structure in Text and Hypertext.
For further information, please contact: Ola Ågren, Department of Computing Science; Cell phone: +46 (0)730-283852; E-mail: firstname.lastname@example.org
Pressofficer Karin Wikman, email@example.com or +46-70 6100805
Karin Wikman | idw
Cutting edge research for the industries of tomorrow – DFKI and NICT expand cooperation
21.03.2017 | Deutsches Forschungszentrum für Künstliche Intelligenz GmbH, DFKI
Molecular motor-powered biocomputers
20.03.2017 | Technische Universität Dresden
Astronomers from Bonn and Tautenburg in Thuringia (Germany) used the 100-m radio telescope at Effelsberg to observe several galaxy clusters. At the edges of these large accumulations of dark matter, stellar systems (galaxies), hot gas, and charged particles, they found magnetic fields that are exceptionally ordered over distances of many million light years. This makes them the most extended magnetic fields in the universe known so far.
The results will be published on March 22 in the journal „Astronomy & Astrophysics“.
Galaxy clusters are the largest gravitationally bound structures in the universe. With a typical extent of about 10 million light years, i.e. 100 times the...
Researchers at the Goethe University Frankfurt, together with partners from the University of Tübingen in Germany and Queen Mary University as well as Francis Crick Institute from London (UK) have developed a novel technology to decipher the secret ubiquitin code.
Ubiquitin is a small protein that can be linked to other cellular proteins, thereby controlling and modulating their functions. The attachment occurs in many...
In the eternal search for next generation high-efficiency solar cells and LEDs, scientists at Los Alamos National Laboratory and their partners are creating...
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are less stable. Now researchers at the Technical University of Munich (TUM) have, for the first time ever, produced a composite material combining silicon nanosheets and a polymer that is both UV-resistant and easy to process. This brings the scientists a significant step closer to industrial applications like flexible displays and photosensors.
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are...
Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to simulate these confined natural conditions in artificial vesicles for the first time. As reported in the academic journal Small, the results are offering better insight into the development of nanoreactors and artificial organelles.
Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to...
20.03.2017 | Event News
14.03.2017 | Event News
07.03.2017 | Event News
24.03.2017 | Materials Sciences
24.03.2017 | Physics and Astronomy
24.03.2017 | Physics and Astronomy