With predictive modeling techniques, it is possible to predict anything from clients’ shopping habits and illnesses to a golfer’s handicap. The only prerequisite is to have enough examples. In a doctoral thesis from the University of Borås in Sweden, Rikard König has adapted the technique of genetic programming so it can be used for such purposes.
The doctoral thesis, Enhancing Genetic Programming for Predictive Modeling, is about machine learning, more specifically predictive modeling, a field of computer science. Machine learning entails getting a computer to learn something, to become intelligent. Predictive modeling is a broad area of machine learning where a computer learns things on the basis of positive and negative examples, finds connections and explains why things turn out in a certain way.
Within predictive modeling, there is an array of techniques that are used to produce models that can predict practically anything, for instance, how people might be expected to respond to advertisements. Since these are general techniques, it is possible to predict just about anything as long as there are enough previous examples, i.e. sufficient information. The goal of predictive modeling is to find an accurate model and preferably one that explains something that was not previously known.
Genetic programming (GP) is a general optimization technique that is based on Darwin’s theories on evolution and natural selection. It is a technique that was not really designed for predictive modelling.
”In my thesis, I present several improvements that increase the accuracy and comprehensibility of models created with GP. There are many researchers who work with GP but my solutions are unique,” says Rikard König, PhD student at the School of Business and IT at the University of Borås.
In order to produce a model with the help of GP, you start off with, say, a thousand randomly chosen models and let them compete with each other. You work out how many errors the models make on known examples and then base a natural selection on the results. The most accurate models have a greater chance of surviving and having “children” – you pair off two models. These “children” are then a combination of their parents and form a new generation which is hopefully stronger. A small number of models can also be subjected to mutation, just like in nature.
”The new generation is assessed in the same way, using the known examples. They compete, pair off and give rise to an even stronger new generation. The process is repeated until a sufficiently accurate model has been found. The fascinating thing is that evolution is such a powerful way of searching through all possible solutions,” says Rikard König.
GP has several properties that make it suitable for predictive modeling. One example is that the search is independent of the representation of the model. This means that the exact representation and way of measuring errors can be adapted to individual problems. This is not normally the case with traditional predictive techniques. At the same time, the technique is problematic when a highly complex model is needed since the search goes through all possible solutions and the number of solutions increases exponentially with the complexity of the models.
”One of my improvements is a hybrid technique for creating an accurate and comprehensible model when the search space is extremely large, i.e. when a model with high complexity is required. The solution is to send relatively strong models created by a traditional predictive technique into a generation to guide the search in a promising direction.”
As part of his research, Rikard König has also produced an application that realizes his research results. The programme can be downloaded from www.grex.se
Rikard König is working on several research projects where these solutions may be put to use. For instance, one project is in collaboration with Scania where data from tens of thousands of lorries have been saved and will be analysed in order to explain what effect the driver has on fuel consumption. Another example, which also shows how generic the technique is, is a new project where golf swings from 500 golfers will be analysed. Here, the aim is to find general explanations for what distinguishes good swings from bad swings. Another aim is to be able to automatically recommend exercises for individual golfers on the basis of each person’s particular needs.
Thesis title: Enhancing Genetic Programming for Predictive Modeling
Contact: Rikard König, firstname.lastname@example.org, +46-33 435 5945, +46- 73922 96 56
Pressofficer Anna Kjellsson, email@example.com, +46-734 61 20 01
Anna Kjellsson | idw - Informationsdienst Wissenschaft
Cryo-electron microscopy achieves unprecedented resolution using new computational methods
24.03.2017 | DOE/Lawrence Berkeley National Laboratory
How cheetahs stay fit and healthy
24.03.2017 | Forschungsverbund Berlin e.V.
Astronomers from Bonn and Tautenburg in Thuringia (Germany) used the 100-m radio telescope at Effelsberg to observe several galaxy clusters. At the edges of these large accumulations of dark matter, stellar systems (galaxies), hot gas, and charged particles, they found magnetic fields that are exceptionally ordered over distances of many million light years. This makes them the most extended magnetic fields in the universe known so far.
The results will be published on March 22 in the journal „Astronomy & Astrophysics“.
Galaxy clusters are the largest gravitationally bound structures in the universe. With a typical extent of about 10 million light years, i.e. 100 times the...
Researchers at the Goethe University Frankfurt, together with partners from the University of Tübingen in Germany and Queen Mary University as well as Francis Crick Institute from London (UK) have developed a novel technology to decipher the secret ubiquitin code.
Ubiquitin is a small protein that can be linked to other cellular proteins, thereby controlling and modulating their functions. The attachment occurs in many...
In the eternal search for next generation high-efficiency solar cells and LEDs, scientists at Los Alamos National Laboratory and their partners are creating...
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are less stable. Now researchers at the Technical University of Munich (TUM) have, for the first time ever, produced a composite material combining silicon nanosheets and a polymer that is both UV-resistant and easy to process. This brings the scientists a significant step closer to industrial applications like flexible displays and photosensors.
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are...
Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to simulate these confined natural conditions in artificial vesicles for the first time. As reported in the academic journal Small, the results are offering better insight into the development of nanoreactors and artificial organelles.
Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to...
20.03.2017 | Event News
14.03.2017 | Event News
07.03.2017 | Event News
24.03.2017 | Materials Sciences
24.03.2017 | Physics and Astronomy
24.03.2017 | Physics and Astronomy