Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

Creating linguistic resources for automated translation

10.02.2005


A major difficulty in developing automated language translation is that you need a system with a fairly extensive vocabulary from which it can learn, before any degree of reliability or accuracy is possible. The LC-STAR project developed just such a vocabulary.



“First, we created large lexica for several language databases,” explains project coordinator Ute Ziegenhain of Siemens in Germany. “Secondly, we developed a demonstrator that could automatically translate speech to speech for output to another interface.”

Having finished on 31 January 2005, the IST programme funded-LC-STAR developed vocabularies, called lexica, and bodies of writings, called corpora, for some 13 languages in all, ranging from Italian and Greek to include Arabic, Chinese, Hebrew and Russian. These linguistic databases comprise a minimum of 100,000 entries per language.


The lexica and corpora are needed to train such systems for reliable, automated speech-to-speech translation (SST). Once developed, the various SST components (flexible speech recognition, high-quality text-to-speech synthesis and speech-centred translation) can be integrated into speech-driven interfaces embedded into mobile appliances and network servers.

The team also produced a working demonstrator called ‘Gaia’, which is a telephone server capable of translating between the project partners’ languages of English, Spanish and Catalan within a single register. LC-STAR focused on the tourism register, however Ziegenhain stresses that the system can be opened up to any domain if it is provided with sufficient vocabulary.

Results already in use

LC-STAR project results are already in use by Siemens within its own speech recognition and speech synthesis systems. They have also been supplied to the European Language Resources Association (ELRA) for further dissemination. ELRA makes available a variety of language resources for language engineering and the evaluation of language-engineering technologies.

In addition, LC-STAR vocabularies and machine-translation technology have been incorporated into the ongoing TC-STAR project. TC-STAR is a long-term effort (six years) focused on advanced research into core technologies for speech-to-speech translation – its goal is to make a breakthrough in reducing the gap between human and machine performance.

Tara Morris | alfa
Further information:
http://istresults.cordis.lu/

More articles from Information Technology:

nachricht Fingerprints of quantum entanglement
16.02.2018 | University of Vienna

nachricht Simple in the Cloud: The digitalization of brownfield systems made easy
07.02.2018 | Deutsches Forschungszentrum für Künstliche Intelligenz GmbH, DFKI

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Demonstration of a single molecule piezoelectric effect

Breakthrough provides a new concept of the design of molecular motors, sensors and electricity generators at nanoscale

Researchers from the Institute of Organic Chemistry and Biochemistry of the CAS (IOCB Prague), Institute of Physics of the CAS (IP CAS) and Palacký University...

Im Focus: Hybrid optics bring color imaging using ultrathin metalenses into focus

For photographers and scientists, lenses are lifesavers. They reflect and refract light, making possible the imaging systems that drive discovery through the microscope and preserve history through cameras.

But today's glass-based lenses are bulky and resist miniaturization. Next-generation technologies, such as ultrathin cameras or tiny microscopes, require...

Im Focus: Stem cell divisions in the adult brain seen for the first time

Scientists from the University of Zurich have succeeded for the first time in tracking individual stem cells and their neuronal progeny over months within the intact adult brain. This study sheds light on how new neurons are produced throughout life.

The generation of new nerve cells was once thought to taper off at the end of embryonic development. However, recent research has shown that the adult brain...

Im Focus: Interference as a new method for cooling quantum devices

Theoretical physicists propose to use negative interference to control heat flow in quantum devices. Study published in Physical Review Letters

Quantum computer parts are sensitive and need to be cooled to very low temperatures. Their tiny size makes them particularly susceptible to a temperature...

Im Focus: Autonomous 3D scanner supports individual manufacturing processes

Let’s say the armrest is broken in your vintage car. As things stand, you would need a lot of luck and persistence to find the right spare part. But in the world of Industrie 4.0 and production with batch sizes of one, you can simply scan the armrest and print it out. This is made possible by the first ever 3D scanner capable of working autonomously and in real time. The autonomous scanning system will be on display at the Hannover Messe Preview on February 6 and at the Hannover Messe proper from April 23 to 27, 2018 (Hall 6, Booth A30).

Part of the charm of vintage cars is that they stopped making them long ago, so it is special when you do see one out on the roads. If something breaks or...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

VideoLinks
Industry & Economy
Event News

2nd International Conference on High Temperature Shape Memory Alloys (HTSMAs)

15.02.2018 | Event News

Aachen DC Grid Summit 2018

13.02.2018 | Event News

How Global Climate Policy Can Learn from the Energy Transition

12.02.2018 | Event News

 
Latest News

Fingerprints of quantum entanglement

16.02.2018 | Information Technology

'Living bandages': NUST MISIS scientists develop biocompatible anti-burn nanofibers

16.02.2018 | Health and Medicine

Hubble sees Neptune's mysterious shrinking storm

16.02.2018 | Physics and Astronomy

VideoLinks
Science & Research
Overview of more VideoLinks >>>