Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:


It makes sense to communicate with computers


The art of communication becomes a science when dealing with computers. Laying the foundations for future research in human-computer interactions, PF-STAR’s speech and gesture databases, and virtual agents open up new approaches to machine-based communications.

Completed in September 2004, the IST project PF-STAR aimed to lay the foundations for future research efforts in Multilingual and Multisensorial Communication, or MMC for short. Over the project’s two-year term, researchers worked to develop a range of advanced technological baselines, comparative speech and non-verbal communication evaluations, as well as an assessment of the prospects in some key areas of technology.

Machines that can communicate like human beings?

Project coordinator Fabio Pianesi of the Istituto Trentino di Cultura in Italy explains MMC as follows, “It’s the kind of technology that you need if you want to communicate with the same facility to both the PC and other human beings. The PC needs to be capable of interpreting and reproducing your gestures and facial expressions, as well as the emotion expressed in your speech, in the same way as humans do.”

Interpreting such subtle visual and aural cues, as well as the meaning of the spoken word, is a highly complex business. Facial expression, gesture, and even variations in pitch and tone of the voice all play their part in the way human beings interact. We use and respond to such subtle elements of human communication in our day-to-day lives almost without being aware of it, since our training in such communication develops from birth.

The challenge for the researchers is how to get a machine to interpret and reproduce such communication subtleties. Linguists have for many years reckoned the task to be near impossible given the number of channels and the complexity of signals involved. However PF-STAR’s work has provided a promising foundation on which future research can develop.

Virtual agents for intelligent interaction

The project partners in PF-STAR have built on several years of research within a variety of national and international projects, most notably NESPOLE!, C-STAR, Verbmobil and SmartKom. In PF-STAR, work focused on three key technological areas: speech-to-speech translation, the detection and expression of emotional states in both verbal and non-verbal channels, and core speech technologies for children. The partners also worked in five languages: English, German, Italian, Spanish and Swedish.

Two project partners, the Royal Institute of Technology (KTH) in Stockholm and the Istituto Trentino di Cultura, hired professional actors at the start of the project to study how speech tone and facial expressions changed while expressing emotions. This data was then fed into the project databases, which led to the development of a series of on-screen facial images, or ‘talking heads’, that offered a machine-based visual alternative to the human face.

These on-screen talking heads, which could be either 2D or 3D facial images, are designed to act as ‘virtual agents’ that can interact intelligently with human beings, other agents or, depending on their level of autonomy, the environment around them. Such virtual agents are believed to have a huge potential for future man/machine communication, in applications from teaching through helpdesks to entertainment.

The project has also allowed for variations in facial expression resulting from cultural differences, says Pianesi. “We should not forget that the expression of emotion is culturally dependent. We had to adapt the expressions on the talking heads to the language concerned, to see how our hypotheses work in the different countries.”

Speech technologies for children were a key area of research for the participants. Error rates for machine-based translation of children’s speech are believed to be some 100 per cent greater than for adults. To help improve such recognition rates, the partners used on-screen virtual agents based on children’s faces rather than on those of adults.

Strong foundation for future research

PF-STAR has laid strong foundations for further research into MMC, says Pianesi. “Two years ago there were no real databases available covering children’s speech, for example. Now we have such speech databases, as well as visual and gesture databases, that we are making available to partners and others.”

The project has also produced several new approaches to machine-based communication. The virtual agents for example are capable of reproducing the emotions expressed, either verbally or as facial expressions, along with the semantics of the message. They can be set to use either both channels (i.e. verbal and non-verbal), or only one.

And the results are more than just data, stresses Pianesi. Since August 2004 the project has made available the databases, the platform and the software for constructing virtual agents, as well as the code to enable further development to be carried out.

Development continues

While PF-STAR is now complete, the project partners are maintaining their development work in the basic technology of machine-based translation. As well as further improving the virtual agents, they are continuing to distribute the technology to client organisations to gain vital feedback on its use. Some of the partners have also commenced within the Sixth Framework Programme (FP6) a project called TC-STAR, a six-year project focused on exploring and evaluating new approaches to machine-based translation, and for creating the infrastructure needed for accelerating the rate of progress in the field.

The area of children’s speech remains of particular interest, says Pianesi. “How can we develop interfaces for instruction, for entertainment and so on, that are suitable for children? How can we produce suitable outputs for children?” Certain partners have come together within another FP6 project, CHIL, to further research children’s communication in schools.

Tara Morris | alfa
Further information:

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Light-driven atomic rotations excite magnetic waves

Terahertz excitation of selected crystal vibrations leads to an effective magnetic field that drives coherent spin motion

Controlling functional properties by light is one of the grand goals in modern condensed matter physics and materials science. A new study now demonstrates how...

Im Focus: New 3-D wiring technique brings scalable quantum computers closer to reality

Researchers from the Institute for Quantum Computing (IQC) at the University of Waterloo led the development of a new extensible wiring technique capable of controlling superconducting quantum bits, representing a significant step towards to the realization of a scalable quantum computer.

"The quantum socket is a wiring method that uses three-dimensional wires based on spring-loaded pins to address individual qubits," said Jeremy Béjanin, a PhD...

Im Focus: Scientists develop a semiconductor nanocomposite material that moves in response to light

In a paper in Scientific Reports, a research team at Worcester Polytechnic Institute describes a novel light-activated phenomenon that could become the basis for applications as diverse as microscopic robotic grippers and more efficient solar cells.

A research team at Worcester Polytechnic Institute (WPI) has developed a revolutionary, light-activated semiconductor nanocomposite material that can be used...

Im Focus: Diamonds aren't forever: Sandia, Harvard team create first quantum computer bridge

By forcefully embedding two silicon atoms in a diamond matrix, Sandia researchers have demonstrated for the first time on a single chip all the components needed to create a quantum bridge to link quantum computers together.

"People have already built small quantum computers," says Sandia researcher Ryan Camacho. "Maybe the first useful one won't be a single giant quantum computer...

Im Focus: New Products - Highlights of COMPAMED 2016

COMPAMED has become the leading international marketplace for suppliers of medical manufacturing. The trade fair, which takes place every November and is co-located to MEDICA in Dusseldorf, has been steadily growing over the past years and shows that medical technology remains a rapidly growing market.

In 2016, the joint pavilion by the IVAM Microtechnology Network, the Product Market “High-tech for Medical Devices”, will be located in Hall 8a again and will...

All Focus news of the innovation-report >>>



Event News

#IC2S2: When Social Science meets Computer Science - GESIS will host the IC2S2 conference 2017

14.10.2016 | Event News

Agricultural Trade Developments and Potentials in Central Asia and the South Caucasus

14.10.2016 | Event News

World Health Summit – Day Three: A Call to Action

12.10.2016 | Event News

Latest News

Oasis of life in the ice-covered central Arctic

24.10.2016 | Earth Sciences

‘Farming’ bacteria to boost growth in the oceans

24.10.2016 | Life Sciences

Light-driven atomic rotations excite magnetic waves

24.10.2016 | Physics and Astronomy

More VideoLinks >>>