Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

An Ear For Robots

04.07.2005


A fundamentally new approach to computer identification of words was been suggested by Russian scientists. With its help, people will be able to give orders even to the most primitive cellular phones.



A sentient being recognizes without difficulty a familiar word regardless of the voice and intonation it is pronounced with. “Six” or “eight” remain six and eight for a person no matter how they are pronounced – in a loud voice or in a whisper, in an excited or a calm voice, by the voice of an old man or a child, by that of a man or a woman. The brain of a person immediately separates the semantic part from the mass of background sounds.

As for a machine, each variant of voice is unique. That is why the speech recognition program usually has to be taught. As a result of training, an enormous library appears in the memory of the silicon brain, where thousands of possible options of pronunciation of the same words (for example, numerals) are stored. Having heard a word, the computer would look through the library and almost certainly something similar to the heard word will be found in it.


The approach suggested by the scientists from the Institute of Radio Engineering and Electronics, Russian Academy of Sciences, is rather human, than machine one: a computer under the researchers’ guidance filters individual peculiarities, i.e. picks out the most basic things and rejects all immaterial ones. As a result, the machine even acquires the ability to discern individual sounds and to put together in its mind familiar words from these sounds.

As a result, memory of only 1 KB would be sufficient for a processor to confidently recognize all numerals and some simple commands, however, pronounced only in Russian yet. Several dozens of human beings – men and women, with irreproachable and far-from-ideal articulation – tried to confuse a quick-witted program, pronouncing numerals either in a whisper or in a voice trembling with excitement. However, the computer successfully rejected emotional frequencies as immaterial.

“The prototype software interface developed and established by ourspecialists for the system of data and management commands voice input is intended for mass mobile electronic devices, says the project manager, Vyacheslav Anciperov. Perhaps, the most important and fundamentally new about our work is that we have managed to single out essential elements of speech being guided by the notion of hierarchical structure of speech. Like in a musical composition, one can recognize more or less high levels of organization - rhythm, main theme, arrangement, so we have also learned to single out the ranges in the speech flow (i.e. in the wide frequency spectrum), which carry the major semantic loading. It has turned out that this is a very small part of human speech sounds – only up to 1 KHz. All the rest relates to psychophysis. Thus we simplified the task for the computer to the maximum. And one more thing – we have taught the computer to recognize individual sounds, which is sometimes far from easy. As a result, our system wins in processing speed and in processor time and memory consumption as compared to those of all known similar systems. This is the path to efficient speech processors that nobody has passed yet.”

Sergey Komarov | alfa
Further information:
http://www.informnauka.ru

More articles from Information Technology:

nachricht Construction of practical quantum computers radically simplified
05.12.2016 | University of Sussex

nachricht UT professor develops algorithm to improve online mapping of disaster areas
29.11.2016 | University of Tennessee at Knoxville

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Shape matters when light meets atom

Mapping the interaction of a single atom with a single photon may inform design of quantum devices

Have you ever wondered how you see the world? Vision is about photons of light, which are packets of energy, interacting with the atoms or molecules in what...

Im Focus: Novel silicon etching technique crafts 3-D gradient refractive index micro-optics

A multi-institutional research collaboration has created a novel approach for fabricating three-dimensional micro-optics through the shape-defined formation of porous silicon (PSi), with broad impacts in integrated optoelectronics, imaging, and photovoltaics.

Working with colleagues at Stanford and The Dow Chemical Company, researchers at the University of Illinois at Urbana-Champaign fabricated 3-D birefringent...

Im Focus: Quantum Particles Form Droplets

In experiments with magnetic atoms conducted at extremely low temperatures, scientists have demonstrated a unique phase of matter: The atoms form a new type of quantum liquid or quantum droplet state. These so called quantum droplets may preserve their form in absence of external confinement because of quantum effects. The joint team of experimental physicists from Innsbruck and theoretical physicists from Hannover report on their findings in the journal Physical Review X.

“Our Quantum droplets are in the gas phase but they still drop like a rock,” explains experimental physicist Francesca Ferlaino when talking about the...

Im Focus: MADMAX: Max Planck Institute for Physics takes up axion research

The Max Planck Institute for Physics (MPP) is opening up a new research field. A workshop from November 21 - 22, 2016 will mark the start of activities for an innovative axion experiment. Axions are still only purely hypothetical particles. Their detection could solve two fundamental problems in particle physics: What dark matter consists of and why it has not yet been possible to directly observe a CP violation for the strong interaction.

The “MADMAX” project is the MPP’s commitment to axion research. Axions are so far only a theoretical prediction and are difficult to detect: on the one hand,...

Im Focus: Molecules change shape when wet

Broadband rotational spectroscopy unravels structural reshaping of isolated molecules in the gas phase to accommodate water

In two recent publications in the Journal of Chemical Physics and in the Journal of Physical Chemistry Letters, researchers around Melanie Schnell from the Max...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

Event News

ICTM Conference 2017: Production technology for turbomachine manufacturing of the future

16.11.2016 | Event News

Innovation Day Laser Technology – Laser Additive Manufacturing

01.11.2016 | Event News

#IC2S2: When Social Science meets Computer Science - GESIS will host the IC2S2 conference 2017

14.10.2016 | Event News

 
Latest News

IHP presents the fastest silicon-based transistor in the world

05.12.2016 | Power and Electrical Engineering

InLight study: insights into chemical processes using light

05.12.2016 | Materials Sciences

High-precision magnetic field sensing

05.12.2016 | Power and Electrical Engineering

VideoLinks
B2B-VideoLinks
More VideoLinks >>>