Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

Effective search terms yield the right information

18.10.2010
It does not matter how good a search engine is if the person doing a search does not ask for the desired information in the right way. So far, a great deal of the research on information retrieval has aimed to develop search algorithms and powerful search engines. Yet, a new doctoral thesis on natural language processing from the University of Gothenburg shows that it is also important to look at the terms people type into the search box.

’Users usually know what kind of information they are looking for, but they don’t know what question to ask. The problem these days is not for the search engine to locate the right documents but to make the most relevant texts end up towards the top of the list,’ says the author of the thesis Karin Friberg Heppin.

Friberg Heppin used a database of medical texts written in Swedish to explore what makes a search term effective or ineffective. What are the features of good search terms and what characterises bad ones?

Today patients often find their own information on the internet, both before and after seeing a doctor. However, not all documents are easily understood by a lay person. Doctors surf for information too, but won’t find much new in popular science texts.

’The language differs between texts written for doctors and texts written for patients. People can use these differences to find the types of documents they want, with respect to both subject and target group,’ says Friberg Heppin.

Her point is that if a doctor does a search for, say, the word flu, he or she will not find many texts of interest. Yet, a search for the word influenza will yield more texts that suit the needs of doctors.

Another difficulty arises when the used search term is only available in a text as a compound word, or vice versa. For example, if a Swedish user types in the word diabetes (=diabetes), the search engine will not catch a text that only includes the compound word diabetesbehandling (=diabetes treatment).

‘This type of problem is more common in Swedish than in English since compound words are rare in English compared to in Swedish. The fact that almost all information retrieval research has focused on English, a language with entirely different inherent problems, suggests that more Swedish research in the area is essential,’ says Friberg Heppin, who points to the importance of the field of linguistics in this context.

’Information retrieval is a multidisciplinary subject where the focus has traditionally been on information and computer science. It’s time for linguists to start contributing to improved search effectiveness,’ says Friberg Heppin.

The thesis have been successfully defended.

For further information, please contact: Karin Friberg Heppin
Tel.: +46 (0)31 786 45 49

E-mail: karin.friberg@svenska.gu.se

Helena Aaberg | idw
Further information:
http://hdl.handle.net/2077/22709

More articles from Communications Media:

nachricht Between filter bubbles, uneven visibility and transnationality
06.12.2017 | Schweizerischer Nationalfonds SNF

nachricht New Technologies for A/V Analysis and Search
13.04.2017 | Fraunhofer-Institut für Digitale Medientechnologie IDMT

All articles from Communications Media >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Error-free into the Quantum Computer Age

A study carried out by an international team of researchers and published in the journal Physical Review X shows that ion-trap technologies available today are suitable for building large-scale quantum computers. The scientists introduce trapped-ion quantum error correction protocols that detect and correct processing errors.

In order to reach their full potential, today’s quantum computer prototypes have to meet specific criteria: First, they have to be made bigger, which means...

Im Focus: Search for planets with Carmenes successful

German and Spanish researchers plan, build and use modern spectrograph

Since 2016, German and Spanish researchers, among them scientists from the University of Göttingen, have been hunting for exoplanets with the “Carmenes”...

Im Focus: First-of-its-kind chemical oscillator offers new level of molecular control

DNA molecules that follow specific instructions could offer more precise molecular control of synthetic chemical systems, a discovery that opens the door for engineers to create molecular machines with new and complex behaviors.

Researchers have created chemical amplifiers and a chemical oscillator using a systematic method that has the potential to embed sophisticated circuit...

Im Focus: Long-lived storage of a photonic qubit for worldwide teleportation

MPQ scientists achieve long storage times for photonic quantum bits which break the lower bound for direct teleportation in a global quantum network.

Concerning the development of quantum memories for the realization of global quantum networks, scientists of the Quantum Dynamics Division led by Professor...

Im Focus: Electromagnetic water cloak eliminates drag and wake

Detailed calculations show water cloaks are feasible with today's technology

Researchers have developed a water cloaking concept based on electromagnetic forces that could eliminate an object's wake, greatly reducing its drag while...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

Event News

See, understand and experience the work of the future

11.12.2017 | Event News

Innovative strategies to tackle parasitic worms

08.12.2017 | Event News

AKL’18: The opportunities and challenges of digitalization in the laser industry

07.12.2017 | Event News

 
Latest News

Error-free into the Quantum Computer Age

18.12.2017 | Physics and Astronomy

Disarray in the brain

18.12.2017 | Studies and Analyses

2 million euros in funding for new MR-compatible electrophysiological brain implants

18.12.2017 | Medical Engineering

VideoLinks
B2B-VideoLinks
More VideoLinks >>>