Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

An Ear For Robots

04.07.2005


A fundamentally new approach to computer identification of words was been suggested by Russian scientists. With its help, people will be able to give orders even to the most primitive cellular phones.



A sentient being recognizes without difficulty a familiar word regardless of the voice and intonation it is pronounced with. “Six” or “eight” remain six and eight for a person no matter how they are pronounced – in a loud voice or in a whisper, in an excited or a calm voice, by the voice of an old man or a child, by that of a man or a woman. The brain of a person immediately separates the semantic part from the mass of background sounds.

As for a machine, each variant of voice is unique. That is why the speech recognition program usually has to be taught. As a result of training, an enormous library appears in the memory of the silicon brain, where thousands of possible options of pronunciation of the same words (for example, numerals) are stored. Having heard a word, the computer would look through the library and almost certainly something similar to the heard word will be found in it.


The approach suggested by the scientists from the Institute of Radio Engineering and Electronics, Russian Academy of Sciences, is rather human, than machine one: a computer under the researchers’ guidance filters individual peculiarities, i.e. picks out the most basic things and rejects all immaterial ones. As a result, the machine even acquires the ability to discern individual sounds and to put together in its mind familiar words from these sounds.

As a result, memory of only 1 KB would be sufficient for a processor to confidently recognize all numerals and some simple commands, however, pronounced only in Russian yet. Several dozens of human beings – men and women, with irreproachable and far-from-ideal articulation – tried to confuse a quick-witted program, pronouncing numerals either in a whisper or in a voice trembling with excitement. However, the computer successfully rejected emotional frequencies as immaterial.

“The prototype software interface developed and established by ourspecialists for the system of data and management commands voice input is intended for mass mobile electronic devices, says the project manager, Vyacheslav Anciperov. Perhaps, the most important and fundamentally new about our work is that we have managed to single out essential elements of speech being guided by the notion of hierarchical structure of speech. Like in a musical composition, one can recognize more or less high levels of organization - rhythm, main theme, arrangement, so we have also learned to single out the ranges in the speech flow (i.e. in the wide frequency spectrum), which carry the major semantic loading. It has turned out that this is a very small part of human speech sounds – only up to 1 KHz. All the rest relates to psychophysis. Thus we simplified the task for the computer to the maximum. And one more thing – we have taught the computer to recognize individual sounds, which is sometimes far from easy. As a result, our system wins in processing speed and in processor time and memory consumption as compared to those of all known similar systems. This is the path to efficient speech processors that nobody has passed yet.”

Sergey Komarov | alfa
Further information:
http://www.informnauka.ru

More articles from Information Technology:

nachricht Smart Computers
21.08.2017 | Albert-Ludwigs-Universität Freiburg im Breisgau

nachricht AI implications: Engineer's model lays groundwork for machine-learning device
18.08.2017 | Washington University in St. Louis

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Fizzy soda water could be key to clean manufacture of flat wonder material: Graphene

Whether you call it effervescent, fizzy, or sparkling, carbonated water is making a comeback as a beverage. Aside from quenching thirst, researchers at the University of Illinois at Urbana-Champaign have discovered a new use for these "bubbly" concoctions that will have major impact on the manufacturer of the world's thinnest, flattest, and one most useful materials -- graphene.

As graphene's popularity grows as an advanced "wonder" material, the speed and quality at which it can be manufactured will be paramount. With that in mind,...

Im Focus: Exotic quantum states made from light: Physicists create optical “wells” for a super-photon

Physicists at the University of Bonn have managed to create optical hollows and more complex patterns into which the light of a Bose-Einstein condensate flows. The creation of such highly low-loss structures for light is a prerequisite for complex light circuits, such as for quantum information processing for a new generation of computers. The researchers are now presenting their results in the journal Nature Photonics.

Light particles (photons) occur as tiny, indivisible portions. Many thousands of these light portions can be merged to form a single super-photon if they are...

Im Focus: Circular RNA linked to brain function

For the first time, scientists have shown that circular RNA is linked to brain function. When a RNA molecule called Cdr1as was deleted from the genome of mice, the animals had problems filtering out unnecessary information – like patients suffering from neuropsychiatric disorders.

While hundreds of circular RNAs (circRNAs) are abundant in mammalian brains, one big question has remained unanswered: What are they actually good for? In the...

Im Focus: RAVAN CubeSat measures Earth's outgoing energy

An experimental small satellite has successfully collected and delivered data on a key measurement for predicting changes in Earth's climate.

The Radiometer Assessment using Vertically Aligned Nanotubes (RAVAN) CubeSat was launched into low-Earth orbit on Nov. 11, 2016, in order to test new...

Im Focus: Scientists shine new light on the “other high temperature superconductor”

A study led by scientists of the Max Planck Institute for the Structure and Dynamics of Matter (MPSD) at the Center for Free-Electron Laser Science in Hamburg presents evidence of the coexistence of superconductivity and “charge-density-waves” in compounds of the poorly-studied family of bismuthates. This observation opens up new perspectives for a deeper understanding of the phenomenon of high-temperature superconductivity, a topic which is at the core of condensed matter research since more than 30 years. The paper by Nicoletti et al has been published in the PNAS.

Since the beginning of the 20th century, superconductivity had been observed in some metals at temperatures only a few degrees above the absolute zero (minus...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

Event News

Call for Papers – ICNFT 2018, 5th International Conference on New Forming Technology

16.08.2017 | Event News

Sustainability is the business model of tomorrow

04.08.2017 | Event News

Clash of Realities 2017: Registration now open. International Conference at TH Köln

26.07.2017 | Event News

 
Latest News

Molecular volume control

22.08.2017 | Life Sciences

When fish swim in the holodeck

22.08.2017 | Life Sciences

Biochemical 'fingerprints' reveal diabetes progression

22.08.2017 | Life Sciences

VideoLinks
B2B-VideoLinks
More VideoLinks >>>