Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

An Ear For Robots

04.07.2005


A fundamentally new approach to computer identification of words was been suggested by Russian scientists. With its help, people will be able to give orders even to the most primitive cellular phones.



A sentient being recognizes without difficulty a familiar word regardless of the voice and intonation it is pronounced with. “Six” or “eight” remain six and eight for a person no matter how they are pronounced – in a loud voice or in a whisper, in an excited or a calm voice, by the voice of an old man or a child, by that of a man or a woman. The brain of a person immediately separates the semantic part from the mass of background sounds.

As for a machine, each variant of voice is unique. That is why the speech recognition program usually has to be taught. As a result of training, an enormous library appears in the memory of the silicon brain, where thousands of possible options of pronunciation of the same words (for example, numerals) are stored. Having heard a word, the computer would look through the library and almost certainly something similar to the heard word will be found in it.


The approach suggested by the scientists from the Institute of Radio Engineering and Electronics, Russian Academy of Sciences, is rather human, than machine one: a computer under the researchers’ guidance filters individual peculiarities, i.e. picks out the most basic things and rejects all immaterial ones. As a result, the machine even acquires the ability to discern individual sounds and to put together in its mind familiar words from these sounds.

As a result, memory of only 1 KB would be sufficient for a processor to confidently recognize all numerals and some simple commands, however, pronounced only in Russian yet. Several dozens of human beings – men and women, with irreproachable and far-from-ideal articulation – tried to confuse a quick-witted program, pronouncing numerals either in a whisper or in a voice trembling with excitement. However, the computer successfully rejected emotional frequencies as immaterial.

“The prototype software interface developed and established by ourspecialists for the system of data and management commands voice input is intended for mass mobile electronic devices, says the project manager, Vyacheslav Anciperov. Perhaps, the most important and fundamentally new about our work is that we have managed to single out essential elements of speech being guided by the notion of hierarchical structure of speech. Like in a musical composition, one can recognize more or less high levels of organization - rhythm, main theme, arrangement, so we have also learned to single out the ranges in the speech flow (i.e. in the wide frequency spectrum), which carry the major semantic loading. It has turned out that this is a very small part of human speech sounds – only up to 1 KHz. All the rest relates to psychophysis. Thus we simplified the task for the computer to the maximum. And one more thing – we have taught the computer to recognize individual sounds, which is sometimes far from easy. As a result, our system wins in processing speed and in processor time and memory consumption as compared to those of all known similar systems. This is the path to efficient speech processors that nobody has passed yet.”

Sergey Komarov | alfa
Further information:
http://www.informnauka.ru

More articles from Information Technology:

nachricht World's thinnest hologram paves path to new 3-D world
18.05.2017 | RMIT University

nachricht Internet of things made simple: One sensor package does work of many
11.05.2017 | Carnegie Mellon University

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Turmoil in sluggish electrons’ existence

An international team of physicists has monitored the scattering behaviour of electrons in a non-conducting material in real-time. Their insights could be beneficial for radiotherapy.

We can refer to electrons in non-conducting materials as ‘sluggish’. Typically, they remain fixed in a location, deep inside an atomic composite. It is hence...

Im Focus: Wafer-thin Magnetic Materials Developed for Future Quantum Technologies

Two-dimensional magnetic structures are regarded as a promising material for new types of data storage, since the magnetic properties of individual molecular building blocks can be investigated and modified. For the first time, researchers have now produced a wafer-thin ferrimagnet, in which molecules with different magnetic centers arrange themselves on a gold surface to form a checkerboard pattern. Scientists at the Swiss Nanoscience Institute at the University of Basel and the Paul Scherrer Institute published their findings in the journal Nature Communications.

Ferrimagnets are composed of two centers which are magnetized at different strengths and point in opposing directions. Two-dimensional, quasi-flat ferrimagnets...

Im Focus: World's thinnest hologram paves path to new 3-D world

Nano-hologram paves way for integration of 3-D holography into everyday electronics

An Australian-Chinese research team has created the world's thinnest hologram, paving the way towards the integration of 3D holography into everyday...

Im Focus: Using graphene to create quantum bits

In the race to produce a quantum computer, a number of projects are seeking a way to create quantum bits -- or qubits -- that are stable, meaning they are not much affected by changes in their environment. This normally needs highly nonlinear non-dissipative elements capable of functioning at very low temperatures.

In pursuit of this goal, researchers at EPFL's Laboratory of Photonics and Quantum Measurements LPQM (STI/SB), have investigated a nonlinear graphene-based...

Im Focus: Bacteria harness the lotus effect to protect themselves

Biofilms: Researchers find the causes of water-repelling properties

Dental plaque and the viscous brown slime in drainpipes are two familiar examples of bacterial biofilms. Removing such bacterial depositions from surfaces is...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

Event News

AWK Aachen Machine Tool Colloquium 2017: Internet of Production for Agile Enterprises

23.05.2017 | Event News

Dortmund MST Conference presents Individualized Healthcare Solutions with micro and nanotechnology

22.05.2017 | Event News

Innovation 4.0: Shaping a humane fourth industrial revolution

17.05.2017 | Event News

 
Latest News

Scientists propose synestia, a new type of planetary object

23.05.2017 | Physics and Astronomy

Zap! Graphene is bad news for bacteria

23.05.2017 | Life Sciences

Medical gamma-ray camera is now palm-sized

23.05.2017 | Medical Engineering

VideoLinks
B2B-VideoLinks
More VideoLinks >>>