Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

Algorithm enables computers to identify actions much more efficiently

14.05.2014

Techniques from natural-language processing enable computers to efficiently search video for actions

With the commodification of digital cameras, digital video has become so easy to produce that human beings can have trouble keeping up with it. Among the tools that computer scientists are developing to make the profusion of video more useful are algorithms for activity recognition — or determining what the people on camera are doing when.

At the Conference on Computer Vision and Pattern Recognition in June, Hamed Pirsiavash, a postdoc at MIT, and his former thesis advisor, Deva Ramanan of the University of California at Irvine, will present a new activity-recognition algorithm that has several advantages over its predecessors.

One is that the algorithm's execution time scales linearly with the size of the video file it's searching. That means that if one file is 10 times the size of another, the new algorithm will take 10 times as long to search it — not 1,000 times as long, as some earlier algorithms would.

Another is that the algorithm is able to make good guesses about partially completed actions, so it can handle streaming video. Partway through an action, it will issue a probability that the action is of the type that it's looking for. It may revise that probability as the video continues, but it doesn't have to wait until the action is complete to assess it.

Finally, the amount of memory the algorithm requires is fixed, regardless of how many frames of video it's already reviewed. That means that, unlike many of its predecessors, it can handle video streams of any length (or files of any size).

The grammar of action

Enabling all of these advances is the appropriation of a type of algorithm used in natural language processing, the computer science discipline that seeks techniques for interpreting sentences written in natural language.

"One of the challenging problems they try to solve is, if you have a sentence, you want to basically parse the sentence, saying what is the subject, what is the verb, what is the adverb," Pirsiavash says. "We see an analogy here, which is, if you have a complex action — like making tea or making coffee — that has some subactions, we can basically stitch together these subactions and look at each one as something like verb, adjective, and adverb."

On that analogy, the rules defining relationships between subactions are like rules of grammar. When you make tea, for instance, it doesn't matter whether you first put the teabag in the cup or put the kettle on the stove. But it's essential that you put the kettle on the stove before pouring the water into the cup. Similarly, in a given language, it could be the case that nouns can either precede or follow verbs, but that adjectives must always precede nouns.

For any given action, Pirsiavash and Ramanan's algorithm must thus learn a new "grammar." And the mechanism that it uses is the one that many natural-language-processing systems rely on: machine learning. Pirsiavash and Ramanan feed their algorithm training examples of videos depicting a particular action, and specify the number of subactions that the algorithm should look for. But they don't give it any information about what those subactions are, or what the transitions between them look like.

Pruning possibilities

The rules relating subactions are the key to the algorithm's efficiency. As a video plays, the algorithm constructs a set of hypotheses about which subactions are being depicted where, and it ranks them according to probability. It can't limit itself to a single hypothesis, as each new frame could require it to revise its probabilities. But it can eliminate hypotheses that don't conform to its grammatical rules, which dramatically limits the number of possibilities it has to canvass.

The researchers tested their algorithm on eight different types of athletic endeavor — such as weightlifting and bowling — with training videos culled from YouTube. They found that, according to metrics standard in the field of computer vision, their algorithm identified new instances of the same activities more accurately than its predecessors.

Pirsiavash is particularly interested in possible medical applications of action detection. The proper execution of physical-therapy exercises, for instance, could have a grammar that's distinct from improper execution; similarly, the return of motor function in patients with neurological damage could be identified by its unique grammar. Action-detection algorithms could also help determine whether, for instance, elderly patients remembered to take their medication — and issue alerts if they didn't.

Abby Abazorius | newswise
Further information:
http://www.mit.edu

Further reports about: Massachusetts Technology algorithm identified identify problems

More articles from Information Technology:

nachricht New technique controls autonomous vehicles on a dirt track
24.05.2016 | Georgia Institute of Technology

nachricht Engineers take first step toward flexible, wearable, tricorder-like device
24.05.2016 | University of California - San Diego

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Computational high-throughput screening finds hard magnets containing less rare earth elements

Permanent magnets are very important for technologies of the future like electromobility and renewable energy, and rare earth elements (REE) are necessary for their manufacture. The Fraunhofer Institute for Mechanics of Materials IWM in Freiburg, Germany, has now succeeded in identifying promising approaches and materials for new permanent magnets through use of an in-house simulation process based on high-throughput screening (HTS). The team was able to improve magnetic properties this way and at the same time replaced REE with elements that are less expensive and readily available. The results were published in the online technical journal “Scientific Reports”.

The starting point for IWM researchers Wolfgang Körner, Georg Krugel, and Christian Elsässer was a neodymium-iron-nitrogen compound based on a type of...

Im Focus: Atomic precision: technologies for the next-but-one generation of microchips

In the Beyond EUV project, the Fraunhofer Institutes for Laser Technology ILT in Aachen and for Applied Optics and Precision Engineering IOF in Jena are developing key technologies for the manufacture of a new generation of microchips using EUV radiation at a wavelength of 6.7 nm. The resulting structures are barely thicker than single atoms, and they make it possible to produce extremely integrated circuits for such items as wearables or mind-controlled prosthetic limbs.

In 1965 Gordon Moore formulated the law that came to be named after him, which states that the complexity of integrated circuits doubles every one to two...

Im Focus: Researchers demonstrate size quantization of Dirac fermions in graphene

Characterization of high-quality material reveals important details relevant to next generation nanoelectronic devices

Quantum mechanics is the field of physics governing the behavior of things on atomic scales, where things work very differently from our everyday world.

Im Focus: Graphene: A quantum of current

When current comes in discrete packages: Viennese scientists unravel the quantum properties of the carbon material graphene

In 2010 the Nobel Prize in physics was awarded for the discovery of the exceptional material graphene, which consists of a single layer of carbon atoms...

Im Focus: Transparent - Flexible - Printable: Key technologies for tomorrow’s displays

The trend-forward world of display technology relies on innovative materials and novel approaches to steadily advance the visual experience, for example through higher pixel densities, better contrast, larger formats or user-friendler design. Fraunhofer ISC’s newly developed materials for optics and electronics now broaden the application potential of next generation displays. Learn about lower cost-effective wet-chemical printing procedures and the new materials at the Fraunhofer ISC booth # 1021 in North Hall D during the SID International Symposium on Information Display held from 22 to 27 May 2016 at San Francisco’s Moscone Center.

Economical processing

All Focus news of the innovation-report >>>

Anzeige

Anzeige

Event News

Networking 4.0: International Laser Technology Congress AKL’16 Shows New Ways of Cooperations

24.05.2016 | Event News

Challenges of rural labor markets

20.05.2016 | Event News

International expert meeting “Health Business Connect” in France

19.05.2016 | Event News

 
Latest News

LZH shows the potential of the laser for industrial manufacturing at the LASYS 2016

25.05.2016 | Trade Fair News

Great apes communicate cooperatively

25.05.2016 | Life Sciences

Thermo-Optical Measuring method (TOM) could save several million tons of CO2 in coal-fired plants

25.05.2016 | Power and Electrical Engineering

VideoLinks
B2B-VideoLinks
More VideoLinks >>>