Two Dutch researchers analyse striking behaviour of websurfers
What behaviour do website visitors exhibit? Do they buy a specific product mainly on Mondays? Do they always return at a certain time of day?
Being able to recognise and make use of such patterns is lucrative business for companies. Edgar de Graaf discovered that interesting patterns often contain a time aspect. Jeroen De Knijf developed methods to detect relevant patterns quicker.
In subject jargon it is called data mining: looking for interesting relationships within large quantities of data. Many data-mining programs produce a flood of potentially interesting patterns: as a user, how can you then find what you are looking for? Furthermore, the files are not always set up for such search actions, as is the case on the Internet or for instance in bioinformatics. It usually concerns semi-structured files: they often contain, for example, hyperlinks to other files, and contain (partial) information in a range of formats, such as text, images and sound.
Edgar de Graaf and Jeroen De Knijf both worked within the NWO-funded MISTA project (Mining in Semi-Structured Data) on methods to find patterns more quickly and effectively within large quantities of semi-structured data. De Graaf discovered that some patterns are interesting because they occur in quick succession. Other patterns are striking because, for example, they occur weekly. According to De Graaf, this time aspect merits further investigation.
The patterns can best be presented visually so that the user can find the information sought at a single glance. To realise this De Graaf described various ways of presenting different types of information.
De Knijf demonstrated that the number of patterns can be drastically reduced by allowing the user to indicate in advance the minimum requirements that a pattern must satisfy. This allows the data-mining program to find the interesting patterns much faster.
A second method De Knijf devised to reduce the number of results is the compression of the entire collection of documents (for example, Wikipedia pages) into a single document. By building accurate models that only make use of the compressed document, De Knijf was able to demonstrate that this summary does indeed contain the essential information from the entire collection.
The research was funded from the Open Competition 2003 of NWO Physical Sciences.
Kim van den Wijngaard | alfa
The most recent press releases about innovation >>>
Die letzten 5 Focus-News des innovations-reports im Überblick:
Whether you call it effervescent, fizzy, or sparkling, carbonated water is making a comeback as a beverage. Aside from quenching thirst, researchers at the University of Illinois at Urbana-Champaign have discovered a new use for these "bubbly" concoctions that will have major impact on the manufacturer of the world's thinnest, flattest, and one most useful materials -- graphene.
As graphene's popularity grows as an advanced "wonder" material, the speed and quality at which it can be manufactured will be paramount. With that in mind,...
Physicists at the University of Bonn have managed to create optical hollows and more complex patterns into which the light of a Bose-Einstein condensate flows. The creation of such highly low-loss structures for light is a prerequisite for complex light circuits, such as for quantum information processing for a new generation of computers. The researchers are now presenting their results in the journal Nature Photonics.
Light particles (photons) occur as tiny, indivisible portions. Many thousands of these light portions can be merged to form a single super-photon if they are...
For the first time, scientists have shown that circular RNA is linked to brain function. When a RNA molecule called Cdr1as was deleted from the genome of mice, the animals had problems filtering out unnecessary information – like patients suffering from neuropsychiatric disorders.
While hundreds of circular RNAs (circRNAs) are abundant in mammalian brains, one big question has remained unanswered: What are they actually good for? In the...
An experimental small satellite has successfully collected and delivered data on a key measurement for predicting changes in Earth's climate.
The Radiometer Assessment using Vertically Aligned Nanotubes (RAVAN) CubeSat was launched into low-Earth orbit on Nov. 11, 2016, in order to test new...
A study led by scientists of the Max Planck Institute for the Structure and Dynamics of Matter (MPSD) at the Center for Free-Electron Laser Science in Hamburg presents evidence of the coexistence of superconductivity and “charge-density-waves” in compounds of the poorly-studied family of bismuthates. This observation opens up new perspectives for a deeper understanding of the phenomenon of high-temperature superconductivity, a topic which is at the core of condensed matter research since more than 30 years. The paper by Nicoletti et al has been published in the PNAS.
Since the beginning of the 20th century, superconductivity had been observed in some metals at temperatures only a few degrees above the absolute zero (minus...