'It has so far been impossible to produce a translation tool that covers entire languages,' says Aarne Ranta, professor at the Department of Computer Science and Engineering at the University of Gothenburg, Sweden.
Google Translator is a widely spread translation programme that gradually improves the quality of translations through machine learning - the system learns from its own mistakes via system feedback, but tries to do without explicit grammatical rules.
In contrast, MOLTO is being developed in the opposite direction, meaning it begins with precision and grammar, while wide coverage comes later. We wanted to work with a translation technique that is so accurate that people who produce texts can use our translations directly. We have now started to move from precision to increased coverage, meaning that we have started to add more languages to the tool and database.
Professor Ranta is the coordinator of the MOLTO (Multilingual On-Line Translation) project, which includes three universities and two companies. The project is to receive 25 million SEK (2.375 euro) in EU funding over three years. The grant falls in the Machine Translation category, and one requirement has been that the system be developed to include a majority of EU's official languages.
The technique used in MOLTO is based on type theory, just like the technique used by Professor Thierry Coquand when introducing mathematical formulas into computer software. In Coquand's project, type theory serves as a bridge between programming language and mathematics, while in MOLTO it is used to bridge natural languages. The advantage of type theory is that each 'type' expresses content in a language-independent manner. This feature is used in speech technology to transfer meaning from one human language to another.
It is time-consuming to implement the system. First, all words needed for the field of application must be inserted in the language database. Each word is then provided with a type that indicates all possible meanings of the word. Finally, the grammar needs to be defined. At this point, the system needs to be told all the possible combinations of different types, which alternative expressions there are, in which forms the words can occur and how they should be ordered.
The database containing the grammar is called 'resource grammar', and the idea is to make it very easy for a user to extend the grammatical content and add new words. One of the main ideas of the project is that it is open source, meaning that the software shall be accessible to all.
'The purpose of the EU grant is to enable us to use the MOLTO technology to create a system that can be used for translation on the Internet', says Ranta. 'The plan is that producers of web pages should be able to freely download the tool and translate texts into several languages simultaneously. Although the technology does exist already, it is quite cumbersome to use unless you are a computer scientist. In a nutshell, the EU gives us money to modify the tool and make it user friendly for a large number of users.
The project aims at developing the system to suit different areas of applications. One area is translation of patent descriptions. Ultimately, people around the world should be able to take advantage of new technology immediately without having to master the language in which the patent description is written. A large number of translators have long had to be engaged in connection with new patents. Another sub-project aims at meeting the needs of mathematicians for a precise terminology for translation of mathematical teaching material, and then there is one sub-project that concerns descriptions of cultural heritage and museum objects, with a goal that anybody should be able to access these descriptions regardless of native tongue.
The three universities participating in the MOLTO project are the University of Gothenburg, from where the project is coordinated, the University of Helsinki in Finland and the Polytechnic University of Catalonia in Spain. The two participating companies are Ontotext AD, Bulgaria, and Matrix GmbH, Austria.Contact information:
Helena Aaberg | idw
13.11.2018 | Albert-Ludwigs-Universität Freiburg im Breisgau
Improving the understanding of death receptor functions in cells
07.11.2018 | Goethe-Universität Frankfurt am Main
Biochips have been developed at TU Wien (Vienna), on which tissue can be produced and examined. This allows supplying the tissue with different substances in a very controlled way.
Cultivating human cells in the Petri dish is not a big challenge today. Producing artificial tissue, however, permeated by fine blood vessels, is a much more...
Faster and secure data communication: This is the goal of a new joint project involving physicists from the University of Würzburg. The German Federal Ministry of Education and Research funds the project with 14.8 million euro.
In our digital world data security and secure communication are becoming more and more important. Quantum communication is a promising approach to achieve...
On Saturday, 10 November 2018, the research icebreaker Polarstern will leave its homeport of Bremerhaven, bound for Cape Town, South Africa.
When choosing materials to make something, trade-offs need to be made between a host of properties, such as thickness, stiffness and weight. Depending on the application in question, finding just the right balance is the difference between success and failure
Now, a team of Penn Engineers has demonstrated a new material they call "nanocardboard," an ultrathin equivalent of corrugated paper cardboard. A square...
Physicists at ETH Zurich demonstrate how errors that occur during the manipulation of quantum system can be monitored and corrected on the fly
The field of quantum computation has seen tremendous progress in recent years. Bit by bit, quantum devices start to challenge conventional computers, at least...
09.11.2018 | Event News
06.11.2018 | Event News
23.10.2018 | Event News
14.11.2018 | Life Sciences
14.11.2018 | Life Sciences
14.11.2018 | Earth Sciences