Toward new solar cells with active learning

Visualization of the chemical space explored so far.
Credit: © Kunkel/FHI

How can I prepare myself for something I do not yet know?

Scientists from the Fritz Haber Institute in Berlin and from the Technical University of Munich have addressed this almost philosophical question in the context of machine learning. Learning is no more than drawing on prior experience. In order to deal with a new situation, one needs to have dealt with roughly similar situations before.

In machine learning, this correspondingly means that a learning algorithm needs to have been exposed to roughly similar data. But what can we do if there is a nearly infinite amount of possibilities so that it is simply impossible to generate data that covers all situations?

This problem comes up a lot when dealing with an endless number of possible candidate molecules. Organic semiconductors enable important future technologies such as portable solar cells or rollable displays. For such applications, improved organic molecules – which make up these materials – need to be discovered.

Tasks of this nature are increasingly using methods of machine learning, while training on data from computer simulations or experiments. The number of potentially possible small organic molecules is, however, estimated to be on the order of 1033. This overwhelming number of possibilities makes it practically impossible to generate enough data to reflect such a large material diversity. In addition, many of those molecules are not even suitable for organic semiconductors. One is essentially looking for the proverbial needle in a haystack.

In their work published recently in Nature Communications the team around Prof. Karsten Reuter, Director of the Theory Department at the Fritz-Haber-Institute, addressed this problem using so-called active learning. Instead of learning from existing data, the machine learning algorithm iteratively decides for itself which data it actually needs to learn about the problem.

The scientists first carry out simulations on a few smaller molecules, and obtain data related to the molecules’ electrical conductivity – a measure of their usefulness when looking at possible solar cell materials. Based on this data, the algorithm decides if small modifications to these molecules could already lead to useful properties or whether it is uncertain due to a lack of similar data.

In both cases, it automatically requests new simulations, improves itself through the newly generated data, considers new molecules, and goes on to repeat this procedure. In their work, the scientists show how new and promising molecules can efficiently be identified this way, while the algorithm continues its exploration into the vast molecular space, even now, at this very moment. Every week new molecules are being proposed that could usher in the next generation of solar cells and the algorithm just keeps getting better and better.

Media Contact

Agatha Frischmuth
Fritz Haber Institute of the Max Planck Society

All latest news from the category: Life Sciences and Chemistry

Articles and reports from the Life Sciences and chemistry area deal with applied and basic research into modern biology, chemistry and human medicine.

Valuable information can be found on a range of life sciences fields including bacteriology, biochemistry, bionics, bioinformatics, biophysics, biotechnology, genetics, geobotany, human biology, marine biology, microbiology, molecular biology, cellular biology, zoology, bioinorganic chemistry, microchemistry and environmental chemistry.

Back to home

Comments (0)

Write a comment

Newest articles

A universal framework for spatial biology

SpatialData is a freely accessible tool to unify and integrate data from different omics technologies accounting for spatial information, which can provide holistic insights into health and disease. Biological processes…

How complex biological processes arise

A $20 million grant from the U.S. National Science Foundation (NSF) will support the establishment and operation of the National Synthesis Center for Emergence in the Molecular and Cellular Sciences (NCEMS) at…

Airborne single-photon lidar system achieves high-resolution 3D imaging

Compact, low-power system opens doors for photon-efficient drone and satellite-based environmental monitoring and mapping. Researchers have developed a compact and lightweight single-photon airborne lidar system that can acquire high-resolution 3D…

Partners & Sponsors