Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

Creating linguistic resources for automated translation

10.02.2005


A major difficulty in developing automated language translation is that you need a system with a fairly extensive vocabulary from which it can learn, before any degree of reliability or accuracy is possible. The LC-STAR project developed just such a vocabulary.



“First, we created large lexica for several language databases,” explains project coordinator Ute Ziegenhain of Siemens in Germany. “Secondly, we developed a demonstrator that could automatically translate speech to speech for output to another interface.”

Having finished on 31 January 2005, the IST programme funded-LC-STAR developed vocabularies, called lexica, and bodies of writings, called corpora, for some 13 languages in all, ranging from Italian and Greek to include Arabic, Chinese, Hebrew and Russian. These linguistic databases comprise a minimum of 100,000 entries per language.


The lexica and corpora are needed to train such systems for reliable, automated speech-to-speech translation (SST). Once developed, the various SST components (flexible speech recognition, high-quality text-to-speech synthesis and speech-centred translation) can be integrated into speech-driven interfaces embedded into mobile appliances and network servers.

The team also produced a working demonstrator called ‘Gaia’, which is a telephone server capable of translating between the project partners’ languages of English, Spanish and Catalan within a single register. LC-STAR focused on the tourism register, however Ziegenhain stresses that the system can be opened up to any domain if it is provided with sufficient vocabulary.

Results already in use

LC-STAR project results are already in use by Siemens within its own speech recognition and speech synthesis systems. They have also been supplied to the European Language Resources Association (ELRA) for further dissemination. ELRA makes available a variety of language resources for language engineering and the evaluation of language-engineering technologies.

In addition, LC-STAR vocabularies and machine-translation technology have been incorporated into the ongoing TC-STAR project. TC-STAR is a long-term effort (six years) focused on advanced research into core technologies for speech-to-speech translation – its goal is to make a breakthrough in reducing the gap between human and machine performance.

Tara Morris | alfa
Further information:
http://istresults.cordis.lu/

More articles from Information Technology:

nachricht Satellite data for agriculture
28.07.2017 | Julius-Maximilians-Universität Würzburg

nachricht Magnetic Quantum Objects in a "Nano Egg-Box"
25.07.2017 | Universität Wien

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Abrupt motion sharpens x-ray pulses

Spectrally narrow x-ray pulses may be “sharpened” by purely mechanical means. This sounds surprisingly, but a team of theoretical and experimental physicists developed and realized such a method. It is based on fast motions, precisely synchronized with the pulses, of a target interacting with the x-ray light. Thereby, photons are redistributed within the x-ray pulse to the desired spectral region.

A team of theoretical physicists from the MPI for Nuclear Physics (MPIK) in Heidelberg has developed a novel method to intensify the spectrally broad x-ray...

Im Focus: Physicists Design Ultrafocused Pulses

Physicists working with researcher Oriol Romero-Isart devised a new simple scheme to theoretically generate arbitrarily short and focused electromagnetic fields. This new tool could be used for precise sensing and in microscopy.

Microwaves, heat radiation, light and X-radiation are examples for electromagnetic waves. Many applications require to focus the electromagnetic fields to...

Im Focus: Carbon Nanotubes Turn Electrical Current into Light-emitting Quasi-particles

Strong light-matter coupling in these semiconducting tubes may hold the key to electrically pumped lasers

Light-matter quasi-particles can be generated electrically in semiconducting carbon nanotubes. Material scientists and physicists from Heidelberg University...

Im Focus: Flexible proximity sensor creates smart surfaces

Fraunhofer IPA has developed a proximity sensor made from silicone and carbon nanotubes (CNT) which detects objects and determines their position. The materials and printing process used mean that the sensor is extremely flexible, economical and can be used for large surfaces. Industry and research partners can use and further develop this innovation straight away.

At first glance, the proximity sensor appears to be nothing special: a thin, elastic layer of silicone onto which black square surfaces are printed, but these...

Im Focus: 3-D scanning with water

3-D shape acquisition using water displacement as the shape sensor for the reconstruction of complex objects

A global team of computer scientists and engineers have developed an innovative technique that more completely reconstructs challenging 3D objects. An ancient...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

Event News

Clash of Realities 2017: Registration now open. International Conference at TH Köln

26.07.2017 | Event News

Closing the Sustainability Circle: Protection of Food with Biobased Materials

21.07.2017 | Event News

»We are bringing Additive Manufacturing to SMEs«

19.07.2017 | Event News

 
Latest News

New 3-D imaging reveals how human cell nucleus organizes DNA and chromatin of its genome

28.07.2017 | Health and Medicine

Heavy metals in water meet their match

28.07.2017 | Power and Electrical Engineering

Oestrogen regulates pathological changes of bones via bone lining cells

28.07.2017 | Life Sciences

VideoLinks
B2B-VideoLinks
More VideoLinks >>>