Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

It makes sense to communicate with computers

27.01.2005


The art of communication becomes a science when dealing with computers. Laying the foundations for future research in human-computer interactions, PF-STAR’s speech and gesture databases, and virtual agents open up new approaches to machine-based communications.



Completed in September 2004, the IST project PF-STAR aimed to lay the foundations for future research efforts in Multilingual and Multisensorial Communication, or MMC for short. Over the project’s two-year term, researchers worked to develop a range of advanced technological baselines, comparative speech and non-verbal communication evaluations, as well as an assessment of the prospects in some key areas of technology.

Machines that can communicate like human beings?


Project coordinator Fabio Pianesi of the Istituto Trentino di Cultura in Italy explains MMC as follows, “It’s the kind of technology that you need if you want to communicate with the same facility to both the PC and other human beings. The PC needs to be capable of interpreting and reproducing your gestures and facial expressions, as well as the emotion expressed in your speech, in the same way as humans do.”

Interpreting such subtle visual and aural cues, as well as the meaning of the spoken word, is a highly complex business. Facial expression, gesture, and even variations in pitch and tone of the voice all play their part in the way human beings interact. We use and respond to such subtle elements of human communication in our day-to-day lives almost without being aware of it, since our training in such communication develops from birth.

The challenge for the researchers is how to get a machine to interpret and reproduce such communication subtleties. Linguists have for many years reckoned the task to be near impossible given the number of channels and the complexity of signals involved. However PF-STAR’s work has provided a promising foundation on which future research can develop.

Virtual agents for intelligent interaction

The project partners in PF-STAR have built on several years of research within a variety of national and international projects, most notably NESPOLE!, C-STAR, Verbmobil and SmartKom. In PF-STAR, work focused on three key technological areas: speech-to-speech translation, the detection and expression of emotional states in both verbal and non-verbal channels, and core speech technologies for children. The partners also worked in five languages: English, German, Italian, Spanish and Swedish.

Two project partners, the Royal Institute of Technology (KTH) in Stockholm and the Istituto Trentino di Cultura, hired professional actors at the start of the project to study how speech tone and facial expressions changed while expressing emotions. This data was then fed into the project databases, which led to the development of a series of on-screen facial images, or ‘talking heads’, that offered a machine-based visual alternative to the human face.

These on-screen talking heads, which could be either 2D or 3D facial images, are designed to act as ‘virtual agents’ that can interact intelligently with human beings, other agents or, depending on their level of autonomy, the environment around them. Such virtual agents are believed to have a huge potential for future man/machine communication, in applications from teaching through helpdesks to entertainment.

The project has also allowed for variations in facial expression resulting from cultural differences, says Pianesi. “We should not forget that the expression of emotion is culturally dependent. We had to adapt the expressions on the talking heads to the language concerned, to see how our hypotheses work in the different countries.”

Speech technologies for children were a key area of research for the participants. Error rates for machine-based translation of children’s speech are believed to be some 100 per cent greater than for adults. To help improve such recognition rates, the partners used on-screen virtual agents based on children’s faces rather than on those of adults.

Strong foundation for future research

PF-STAR has laid strong foundations for further research into MMC, says Pianesi. “Two years ago there were no real databases available covering children’s speech, for example. Now we have such speech databases, as well as visual and gesture databases, that we are making available to partners and others.”

The project has also produced several new approaches to machine-based communication. The virtual agents for example are capable of reproducing the emotions expressed, either verbally or as facial expressions, along with the semantics of the message. They can be set to use either both channels (i.e. verbal and non-verbal), or only one.

And the results are more than just data, stresses Pianesi. Since August 2004 the project has made available the databases, the platform and the software for constructing virtual agents, as well as the code to enable further development to be carried out.

Development continues

While PF-STAR is now complete, the project partners are maintaining their development work in the basic technology of machine-based translation. As well as further improving the virtual agents, they are continuing to distribute the technology to client organisations to gain vital feedback on its use. Some of the partners have also commenced within the Sixth Framework Programme (FP6) a project called TC-STAR, a six-year project focused on exploring and evaluating new approaches to machine-based translation, and for creating the infrastructure needed for accelerating the rate of progress in the field.

The area of children’s speech remains of particular interest, says Pianesi. “How can we develop interfaces for instruction, for entertainment and so on, that are suitable for children? How can we produce suitable outputs for children?” Certain partners have come together within another FP6 project, CHIL, to further research children’s communication in schools.

Tara Morris | alfa
Further information:
http://istresults.cordis.lu/

More articles from Information Technology:

nachricht Supercomputing the emergence of material behavior
18.05.2018 | University of Texas at Austin, Texas Advanced Computing Center

nachricht Keeping a Close Eye on Ice Loss
18.05.2018 | Alfred-Wegener-Institut, Helmholtz-Zentrum für Polar- und Meeresforschung

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: LZH showcases laser material processing of tomorrow at the LASYS 2018

At the LASYS 2018, from June 5th to 7th, the Laser Zentrum Hannover e.V. (LZH) will be showcasing processes for the laser material processing of tomorrow in hall 4 at stand 4E75. With blown bomb shells the LZH will present first results of a research project on civil security.

At this year's LASYS, the LZH will exhibit light-based processes such as cutting, welding, ablation and structuring as well as additive manufacturing for...

Im Focus: Self-illuminating pixels for a new display generation

There are videos on the internet that can make one marvel at technology. For example, a smartphone is casually bent around the arm or a thin-film display is rolled in all directions and with almost every diameter. From the user's point of view, this looks fantastic. From a professional point of view, however, the question arises: Is that already possible?

At Display Week 2018, scientists from the Fraunhofer Institute for Applied Polymer Research IAP will be demonstrating today’s technological possibilities and...

Im Focus: Explanation for puzzling quantum oscillations has been found

So-called quantum many-body scars allow quantum systems to stay out of equilibrium much longer, explaining experiment | Study published in Nature Physics

Recently, researchers from Harvard and MIT succeeded in trapping a record 53 atoms and individually controlling their quantum state, realizing what is called a...

Im Focus: Dozens of binaries from Milky Way's globular clusters could be detectable by LISA

Next-generation gravitational wave detector in space will complement LIGO on Earth

The historic first detection of gravitational waves from colliding black holes far outside our galaxy opened a new window to understanding the universe. A...

Im Focus: Entangled atoms shine in unison

A team led by Austrian experimental physicist Rainer Blatt has succeeded in characterizing the quantum entanglement of two spatially separated atoms by observing their light emission. This fundamental demonstration could lead to the development of highly sensitive optical gradiometers for the precise measurement of the gravitational field or the earth's magnetic field.

The age of quantum technology has long been heralded. Decades of research into the quantum world have led to the development of methods that make it possible...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

VideoLinks
Industry & Economy
Event News

Save the date: Forum European Neuroscience – 07-11 July 2018 in Berlin, Germany

02.05.2018 | Event News

Invitation to the upcoming "Current Topics in Bioinformatics: Big Data in Genomics and Medicine"

13.04.2018 | Event News

Unique scope of UV LED technologies and applications presented in Berlin: ICULTA-2018

12.04.2018 | Event News

 
Latest News

Designer cells: artificial enzyme can activate a gene switch

22.05.2018 | Life Sciences

PR of MCC: Carbon removal from atmosphere unavoidable for 1.5 degree target

22.05.2018 | Earth Sciences

Achema 2018: New camera system monitors distillation and helps save energy

22.05.2018 | Trade Fair News

VideoLinks
Science & Research
Overview of more VideoLinks >>>