Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

It makes sense to communicate with computers

27.01.2005


The art of communication becomes a science when dealing with computers. Laying the foundations for future research in human-computer interactions, PF-STAR’s speech and gesture databases, and virtual agents open up new approaches to machine-based communications.



Completed in September 2004, the IST project PF-STAR aimed to lay the foundations for future research efforts in Multilingual and Multisensorial Communication, or MMC for short. Over the project’s two-year term, researchers worked to develop a range of advanced technological baselines, comparative speech and non-verbal communication evaluations, as well as an assessment of the prospects in some key areas of technology.

Machines that can communicate like human beings?


Project coordinator Fabio Pianesi of the Istituto Trentino di Cultura in Italy explains MMC as follows, “It’s the kind of technology that you need if you want to communicate with the same facility to both the PC and other human beings. The PC needs to be capable of interpreting and reproducing your gestures and facial expressions, as well as the emotion expressed in your speech, in the same way as humans do.”

Interpreting such subtle visual and aural cues, as well as the meaning of the spoken word, is a highly complex business. Facial expression, gesture, and even variations in pitch and tone of the voice all play their part in the way human beings interact. We use and respond to such subtle elements of human communication in our day-to-day lives almost without being aware of it, since our training in such communication develops from birth.

The challenge for the researchers is how to get a machine to interpret and reproduce such communication subtleties. Linguists have for many years reckoned the task to be near impossible given the number of channels and the complexity of signals involved. However PF-STAR’s work has provided a promising foundation on which future research can develop.

Virtual agents for intelligent interaction

The project partners in PF-STAR have built on several years of research within a variety of national and international projects, most notably NESPOLE!, C-STAR, Verbmobil and SmartKom. In PF-STAR, work focused on three key technological areas: speech-to-speech translation, the detection and expression of emotional states in both verbal and non-verbal channels, and core speech technologies for children. The partners also worked in five languages: English, German, Italian, Spanish and Swedish.

Two project partners, the Royal Institute of Technology (KTH) in Stockholm and the Istituto Trentino di Cultura, hired professional actors at the start of the project to study how speech tone and facial expressions changed while expressing emotions. This data was then fed into the project databases, which led to the development of a series of on-screen facial images, or ‘talking heads’, that offered a machine-based visual alternative to the human face.

These on-screen talking heads, which could be either 2D or 3D facial images, are designed to act as ‘virtual agents’ that can interact intelligently with human beings, other agents or, depending on their level of autonomy, the environment around them. Such virtual agents are believed to have a huge potential for future man/machine communication, in applications from teaching through helpdesks to entertainment.

The project has also allowed for variations in facial expression resulting from cultural differences, says Pianesi. “We should not forget that the expression of emotion is culturally dependent. We had to adapt the expressions on the talking heads to the language concerned, to see how our hypotheses work in the different countries.”

Speech technologies for children were a key area of research for the participants. Error rates for machine-based translation of children’s speech are believed to be some 100 per cent greater than for adults. To help improve such recognition rates, the partners used on-screen virtual agents based on children’s faces rather than on those of adults.

Strong foundation for future research

PF-STAR has laid strong foundations for further research into MMC, says Pianesi. “Two years ago there were no real databases available covering children’s speech, for example. Now we have such speech databases, as well as visual and gesture databases, that we are making available to partners and others.”

The project has also produced several new approaches to machine-based communication. The virtual agents for example are capable of reproducing the emotions expressed, either verbally or as facial expressions, along with the semantics of the message. They can be set to use either both channels (i.e. verbal and non-verbal), or only one.

And the results are more than just data, stresses Pianesi. Since August 2004 the project has made available the databases, the platform and the software for constructing virtual agents, as well as the code to enable further development to be carried out.

Development continues

While PF-STAR is now complete, the project partners are maintaining their development work in the basic technology of machine-based translation. As well as further improving the virtual agents, they are continuing to distribute the technology to client organisations to gain vital feedback on its use. Some of the partners have also commenced within the Sixth Framework Programme (FP6) a project called TC-STAR, a six-year project focused on exploring and evaluating new approaches to machine-based translation, and for creating the infrastructure needed for accelerating the rate of progress in the field.

The area of children’s speech remains of particular interest, says Pianesi. “How can we develop interfaces for instruction, for entertainment and so on, that are suitable for children? How can we produce suitable outputs for children?” Certain partners have come together within another FP6 project, CHIL, to further research children’s communication in schools.

Tara Morris | alfa
Further information:
http://istresults.cordis.lu/

More articles from Information Technology:

nachricht Ultra-precise chip-scale sensor detects unprecedentedly small changes at the nanoscale
18.01.2017 | The Hebrew University of Jerusalem

nachricht Data analysis optimizes cyber-physical systems in telecommunications and building automation
18.01.2017 | Fraunhofer-Institut für Algorithmen und Wissenschaftliches Rechnen SCAI

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Traffic jam in empty space

New success for Konstanz physicists in studying the quantum vacuum

An important step towards a completely new experimental access to quantum physics has been made at University of Konstanz. The team of scientists headed by...

Im Focus: How gut bacteria can make us ill

HZI researchers decipher infection mechanisms of Yersinia and immune responses of the host

Yersiniae cause severe intestinal infections. Studies using Yersinia pseudotuberculosis as a model organism aim to elucidate the infection mechanisms of these...

Im Focus: Interfacial Superconductivity: Magnetic and superconducting order revealed simultaneously

Researchers from the University of Hamburg in Germany, in collaboration with colleagues from the University of Aarhus in Denmark, have synthesized a new superconducting material by growing a few layers of an antiferromagnetic transition-metal chalcogenide on a bismuth-based topological insulator, both being non-superconducting materials.

While superconductivity and magnetism are generally believed to be mutually exclusive, surprisingly, in this new material, superconducting correlations...

Im Focus: Studying fundamental particles in materials

Laser-driving of semimetals allows creating novel quasiparticle states within condensed matter systems and switching between different states on ultrafast time scales

Studying properties of fundamental particles in condensed matter systems is a promising approach to quantum field theory. Quasiparticles offer the opportunity...

Im Focus: Designing Architecture with Solar Building Envelopes

Among the general public, solar thermal energy is currently associated with dark blue, rectangular collectors on building roofs. Technologies are needed for aesthetically high quality architecture which offer the architect more room for manoeuvre when it comes to low- and plus-energy buildings. With the “ArKol” project, researchers at Fraunhofer ISE together with partners are currently developing two façade collectors for solar thermal energy generation, which permit a high degree of design flexibility: a strip collector for opaque façade sections and a solar thermal blind for transparent sections. The current state of the two developments will be presented at the BAU 2017 trade fair.

As part of the “ArKol – development of architecturally highly integrated façade collectors with heat pipes” project, Fraunhofer ISE together with its partners...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

Event News

Sustainable Water use in Agriculture in Eastern Europe and Central Asia

19.01.2017 | Event News

12V, 48V, high-voltage – trends in E/E automotive architecture

10.01.2017 | Event News

2nd Conference on Non-Textual Information on 10 and 11 May 2017 in Hannover

09.01.2017 | Event News

 
Latest News

Helmholtz International Fellow Award for Sarah Amalia Teichmann

20.01.2017 | Awards Funding

An innovative high-performance material: biofibers made from green lacewing silk

20.01.2017 | Materials Sciences

Ion treatments for cardiac arrhythmia — Non-invasive alternative to catheter-based surgery

20.01.2017 | Life Sciences

VideoLinks
B2B-VideoLinks
More VideoLinks >>>