Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

Creating linguistic resources for automated translation

10.02.2005


A major difficulty in developing automated language translation is that you need a system with a fairly extensive vocabulary from which it can learn, before any degree of reliability or accuracy is possible. The LC-STAR project developed just such a vocabulary.



“First, we created large lexica for several language databases,” explains project coordinator Ute Ziegenhain of Siemens in Germany. “Secondly, we developed a demonstrator that could automatically translate speech to speech for output to another interface.”

Having finished on 31 January 2005, the IST programme funded-LC-STAR developed vocabularies, called lexica, and bodies of writings, called corpora, for some 13 languages in all, ranging from Italian and Greek to include Arabic, Chinese, Hebrew and Russian. These linguistic databases comprise a minimum of 100,000 entries per language.


The lexica and corpora are needed to train such systems for reliable, automated speech-to-speech translation (SST). Once developed, the various SST components (flexible speech recognition, high-quality text-to-speech synthesis and speech-centred translation) can be integrated into speech-driven interfaces embedded into mobile appliances and network servers.

The team also produced a working demonstrator called ‘Gaia’, which is a telephone server capable of translating between the project partners’ languages of English, Spanish and Catalan within a single register. LC-STAR focused on the tourism register, however Ziegenhain stresses that the system can be opened up to any domain if it is provided with sufficient vocabulary.

Results already in use

LC-STAR project results are already in use by Siemens within its own speech recognition and speech synthesis systems. They have also been supplied to the European Language Resources Association (ELRA) for further dissemination. ELRA makes available a variety of language resources for language engineering and the evaluation of language-engineering technologies.

In addition, LC-STAR vocabularies and machine-translation technology have been incorporated into the ongoing TC-STAR project. TC-STAR is a long-term effort (six years) focused on advanced research into core technologies for speech-to-speech translation – its goal is to make a breakthrough in reducing the gap between human and machine performance.

Tara Morris | alfa
Further information:
http://istresults.cordis.lu/

More articles from Information Technology:

nachricht Stable magnetic bit of three atoms
21.09.2017 | Sonderforschungsbereich 668

nachricht Drones can almost see in the dark
20.09.2017 | Universität Zürich

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: The pyrenoid is a carbon-fixing liquid droplet

Plants and algae use the enzyme Rubisco to fix carbon dioxide, removing it from the atmosphere and converting it into biomass. Algae have figured out a way to increase the efficiency of carbon fixation. They gather most of their Rubisco into a ball-shaped microcompartment called the pyrenoid, which they flood with a high local concentration of carbon dioxide. A team of scientists at Princeton University, the Carnegie Institution for Science, Stanford University and the Max Plank Institute of Biochemistry have unravelled the mysteries of how the pyrenoid is assembled. These insights can help to engineer crops that remove more carbon dioxide from the atmosphere while producing more food.

A warming planet

Im Focus: Highly precise wiring in the Cerebral Cortex

Our brains house extremely complex neuronal circuits, whose detailed structures are still largely unknown. This is especially true for the so-called cerebral cortex of mammals, where among other things vision, thoughts or spatial orientation are being computed. Here the rules by which nerve cells are connected to each other are only partly understood. A team of scientists around Moritz Helmstaedter at the Frankfiurt Max Planck Institute for Brain Research and Helene Schmidt (Humboldt University in Berlin) have now discovered a surprisingly precise nerve cell connectivity pattern in the part of the cerebral cortex that is responsible for orienting the individual animal or human in space.

The researchers report online in Nature (Schmidt et al., 2017. Axonal synapse sorting in medial entorhinal cortex, DOI: 10.1038/nature24005) that synapses in...

Im Focus: Tiny lasers from a gallery of whispers

New technique promises tunable laser devices

Whispering gallery mode (WGM) resonators are used to make tiny micro-lasers, sensors, switches, routers and other devices. These tiny structures rely on a...

Im Focus: Ultrafast snapshots of relaxing electrons in solids

Using ultrafast flashes of laser and x-ray radiation, scientists at the Max Planck Institute of Quantum Optics (Garching, Germany) took snapshots of the briefest electron motion inside a solid material to date. The electron motion lasted only 750 billionths of the billionth of a second before it fainted, setting a new record of human capability to capture ultrafast processes inside solids!

When x-rays shine onto solid materials or large molecules, an electron is pushed away from its original place near the nucleus of the atom, leaving a hole...

Im Focus: Quantum Sensors Decipher Magnetic Ordering in a New Semiconducting Material

For the first time, physicists have successfully imaged spiral magnetic ordering in a multiferroic material. These materials are considered highly promising candidates for future data storage media. The researchers were able to prove their findings using unique quantum sensors that were developed at Basel University and that can analyze electromagnetic fields on the nanometer scale. The results – obtained by scientists from the University of Basel’s Department of Physics, the Swiss Nanoscience Institute, the University of Montpellier and several laboratories from University Paris-Saclay – were recently published in the journal Nature.

Multiferroics are materials that simultaneously react to electric and magnetic fields. These two properties are rarely found together, and their combined...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

Event News

“Lasers in Composites Symposium” in Aachen – from Science to Application

19.09.2017 | Event News

I-ESA 2018 – Call for Papers

12.09.2017 | Event News

EMBO at Basel Life, a new conference on current and emerging life science research

06.09.2017 | Event News

 
Latest News

Rainbow colors reveal cell history: Uncovering β-cell heterogeneity

22.09.2017 | Life Sciences

Penn first in world to treat patient with new radiation technology

22.09.2017 | Medical Engineering

Calculating quietness

22.09.2017 | Physics and Astronomy

VideoLinks
B2B-VideoLinks
More VideoLinks >>>