This is intelligent telephone technology based on speech recognition and synthesis. Simply speaking, this is an automatic telephone, which you can talk to and which can find out answers to your questions and is able not only to listen but also to hear. It is unable to be rude on the phone, to fling the receiver or to neglect its duties in general.
“The core of our technology is the speech recognition system, says Nina Petukhova, senior staff scientist, one of the development authors. In principle, this is a commercially available product already, which we did not create but only used. Our part is the software and appropriate hardware, which would enable to carry on a dialogue between a person and a machine, the dialogue being in the defined area. Certainly, we should provide the machine with a database containing responses to the asked questions or we should teach the machine to obtain the data in the interactive mode. This was not easy, but we have done that for a number of directions, and we still continue working on some projects.”
How does such a conversation take place? On the whole, it is like a conversation with a human operator, but with a very patient one. For instance, we need to book a ticket. The automatic machine will ask the caller about the date and time the ticket is needed for, it will ascertain the preferred options, it will offer the available options, if the machine fails to understand – it will repeat the question or phrase the question in other words, it will offer to choose one of available variants and so on. At that, this is not an answering machine, which is operated according to the telephone tone mode searching principle when all responses are recorded in advance. In this case, the automatic machine does recognize speech by revealing key words that contain the information necessary for an adequate answer. Once the machine received the required response (for example, the customer informed about the departure date) – it would pass to the next question – for instance, it would ask about the flight and then find out if the customer wants an aisle or a window seat.
Of course, the question arises – what about the unique individuality of a customer, or more precisely – of his/her voice? One person would speak in a deep voice, another – would squeal, but the third customer would groan like elderly people do. Will such diversity impact the speech recognition capability? It has turned out that it won’t. The system does not depend on individual speech peculiarities and it disregards them except for two special cases.
If the speech is totally indistinct, with serious logopaedic problems, the automatic operator may fail in this case and it would offer the customer to contact a human operator. But in the other case, the individuality will on the contrary be strictly accounted for and validated. For example, if the account status is to be cleared up by phone, the individual approach will be needed – first the machine will be taught by giving a reference pattern of your voice to it, and then, after the identification procedure is fulfilled, the machine will talk and answer the questions.
Of course, the authors understand that alternative variants to such “information self-service” are available now – via the Internet and simple “human” enquiry service – which operates and is sometimes very successful. However, the Internet access is not always/everywhere available, and human telephone operators get tired, fall ill and, besides, few enquiry services operate by night and on days off. In this case, speech servers will help, which are being developed by the Institute of Management Problems, Russian Academy of Sciences. Or more precisely, automated queuing systems, which do not force out the existing systems but extend them and consequently – our opportunities as well.
Nadezda Markina | alfa
The plastic brain: Better connectivity of brain regions with training
02.07.2018 | Leibniz-Institut für Wissensmedien
Arguments, Emotions, and News distribution in social media - Leibniz-WissenschaftsCampus Tübingen
04.05.2018 | Leibniz-Institut für Wissensmedien
Scientists develop first tool to use machine learning methods to compute flow around interactively designable 3D objects. Tool will be presented at this year’s prestigious SIGGRAPH conference.
When engineers or designers want to test the aerodynamic properties of the newly designed shape of a car, airplane, or other object, they would normally model...
Researchers from TU Graz and their industry partners have unveiled a world first: the prototype of a robot-controlled, high-speed combined charging system (CCS) for electric vehicles that enables series charging of cars in various parking positions.
Global demand for electric vehicles is forecast to rise sharply: by 2025, the number of new vehicle registrations is expected to reach 25 million per year....
Proteins must be folded correctly to fulfill their molecular functions in cells. Molecular assistants called chaperones help proteins exploit their inbuilt folding potential and reach the correct three-dimensional structure. Researchers at the Max Planck Institute of Biochemistry (MPIB) have demonstrated that actin, the most abundant protein in higher developed cells, does not have the inbuilt potential to fold and instead requires special assistance to fold into its active state. The chaperone TRiC uses a previously undescribed mechanism to perform actin folding. The study was recently published in the journal Cell.
Actin is the most abundant protein in highly developed cells and has diverse functions in processes like cell stabilization, cell division and muscle...
Scientists have discovered that the electrical resistance of a copper-oxide compound depends on the magnetic field in a very unusual way -- a finding that could help direct the search for materials that can perfectly conduct electricity at room temperatur
What happens when really powerful magnets--capable of producing magnetic fields nearly two million times stronger than Earth's--are applied to materials that...
The quality of materials often depends on the manufacturing process. In casting and welding, for example, the rate at which melts solidify and the resulting microstructure of the alloy is important. With metallic foams as well, it depends on exactly how the foaming process takes place. To understand these processes fully requires fast sensing capability. The fastest 3D tomographic images to date have now been achieved at the BESSY II X-ray source operated by the Helmholtz-Zentrum Berlin.
Dr. Francisco Garcia-Moreno and his team have designed a turntable that rotates ultra-stably about its axis at a constant rotational speed. This really depends...
08.08.2018 | Event News
27.07.2018 | Event News
25.07.2018 | Event News
14.08.2018 | Life Sciences
14.08.2018 | Life Sciences
14.08.2018 | Earth Sciences