Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:


Intuitive Visual Control Provides Faster Robot Operation

Using a novel method of integrating video technology and familiar control devices, a research team from Georgia Tech and the Georgia Tech Research Institute (GTRI) is developing a technique to simplify remote control of robotic devices.

The researchers' aim is to enhance a human operator's ability to perform precise tasks using a multi-jointed robotic device such as an articulated mechanical arm. The new approach has been shown to be easier and faster than older methods, especially when the robot is controlled by an operator who is watching it in a video monitor.

Known as Uncalibrated Visual Servoing for Intuitive Human Guidance of Robots, the new method uses a special implementation of an existing vision-guided control method called visual servoing (VS). By applying visual-servoing technology in innovative ways, the researchers have constructed a robotic system that responds to human commands more directly and intuitively than older techniques.

"Our approach exploits 3-D video technology to let an operator guide a robotic device in ways that are more natural and time-saving, yet are still very precise," said Ai-Ping Hu, a GTRI senior research engineer who is leading the effort. "This capability could have numerous applications – especially in situations where directly observing the robot's operation is hazardous or not possible – including bomb disposal, handling of hazardous materials and search-and-rescue missions."

A paper on this technology was presented at the 2012 IEEE International Conference on Robotics and Automation held in St. Paul, Minn.

For decades articulated robots have been used by industry to perform precision tasks such as welding vehicle seams or assembling electronics, Hu explained. The user develops a software program that enables the device to cycle through the required series of motions, using feedback from sensors built into the robot.

But such programming can be complex and time-consuming. The robot must typically be maneuvered joint by joint through the numerous actions required to complete a task. Moreover, such technology works only in a structured and unchanging environment, such as a factory assembly line, where spatial relationships are constant.

The Human Operator

In recent years, new techniques have enabled human operators to freely guide remote robots through unstructured and unfamiliar environments, to perform such challenging tasks as bomb disposal, Hu said. Operators have controlled the device in one of two ways: by "line of sight" – direct user observation – or by means of conventional, two-dimensional camera that is mounted on the robot to send back an image of both the robot and its target.

But humans guiding robots via either method face some of the same complexities that challenge those who program industrial robots, he added. Manipulating a remote robot into place is generally slow and laborious.

That's especially true when the operator must depend on the imprecise images provided by 2-D video feedback. Manipulating separate controls for each of the robot's multiple joint axes, users have only limited visual information to help them and must maneuver to the target by trial and error.

"Essentially, the user is trying to visualize and reconstruct a 3-D scenario from flat 2-D camera images," Hu said. "The process can become particularly confusing when operators are facing in a different direction from the robot and must mentally reorient themselves to try to distinguish right from left. It's somewhat similar to backing up a vehicle with an attached trailer – you have to turn the steering wheel to the left to get the trailer to move right, which is decidedly non-intuitive."

The Visual Servoing Advantage

To simplify user control, the Georgia Tech team turned to visual servoing (a term synonymous with visual activation). Visual servoing has been studied for years as a way to use video cameras to help robots re-orient themselves within a structured environment such as an assembly line.

Traditional visual servoing is calibrated, meaning that position information generated by a video camera can be transformed into data meaningful to the robot. Using these data, the robot can adjust itself to stay in a correct spatial relationship with target objects.

"Say a conveyor line is accidently moved a few millimeters," Hu said. "A robot with a calibrated visual servoing capability can automatically detect the movement using the video image and a fixed reference point, and then readjust to compensate."

But visual servoing offers additional possibilities. The research team – which includes Hu, associate professor Harvey Lipkin of the School of Mechanical Engineering, graduate student Matthew Marshall, GTRI research engineer Michael Matthews and GTRI principal research engineer Gary McMurray -- has adapted visual-servoing technology in ways that facilitate human control of remote robots.

The new technique takes advantage of both calibrated and uncalibrated techniques. A calibrated 3-D "time of flight" camera is mounted on the robot – typically at the end of a robotic arm, in a gripping device called an end-effector. This approach is sometimes called an eye-in-hand system, because of the camera's location in the robot's "hand."

The camera utilizes an active sensor that detects depth data, allowing it to send back 3-D coordinates that pinpoint the end-effector's spatial location. At the same time, the eye-in-hand camera also supplies a standard, uncalibrated 2-D grayscale video image to the operator's monitor.

The result is that the operator, without seeing the robot, now has a robot's-eye view of the target. Watching this image in a monitor, an operator can visually guide the robot using a gamepad, in a manner somewhat reminiscent of a first-person 3-D video game.

In addition, visual-servoing technology now automatically actuates all the joints needed to complete whatever action the user indicates on the gamepad – rather than the user having to manipulate those joints one by one. In the background, the Georgia Tech system performs the complex computation needed to coordinate the monitor image, the 3-D camera information, the robot's spatial position and the user's gamepad commands.

Testing System Usability

"The guidance process is now very intuitive – pressing 'left' on the gamepad will actuate all the requisite robot joints to effect a leftward displacement," Hu said. "What's more, the robot could be upside down and the controls will still respond in the same intuitive way – left is still left and right is still right."

To judge system usability, the Georgia Tech research team recently conducted trials to test whether the visual-servoing approach enabled faster task-completion times. Using a gamepad that controls an articulated-arm robot with six degrees of freedom, subjects performed four tests: they used visual-servoing guidance as well as conventional joint-based guidance, in both line-of-sight and camera-view modes.

In the line-of-sight test, volunteer participants using visual-servoing guidance averaged task-completion times that were 15 percent faster than when they used joint-based guidance. However, in camera-view mode, participants using visual-servoing guidance averaged 227 percent faster results than with the joint-based technique.

Hu noted that the visual-servoing system used in this test scenario was only one of numerous possible applications of the technology. The research team's plans include testing a mobile platform with a VS-guided robotic arm mounted on it. Also underway is a proof-of-concept effort that incorporates visual-servoing control into a low-cost, consumer-level robot.

"Our ultimate goal is to develop a generic, uncalibrated control framework that is able to use image data to guide many different kinds of robots," he said.

Research News & Publications Office
Georgia Institute of Technology
75 Fifth Street, N.W., Suite 309
Atlanta, Georgia 30308 USA
Media Relations Contact: John Toon (404-894-6986)(
Writer: Rick Robinson

John Toon | Newswise Science News
Further information:

More articles from Power and Electrical Engineering:

nachricht Solid progress in carbon capture
27.10.2016 | King Abdullah University of Science & Technology (KAUST)

nachricht Greater Range and Longer Lifetime
26.10.2016 | Technologie Lizenz-Büro (TLB) der Baden-Württembergischen Hochschulen GmbH

All articles from Power and Electrical Engineering >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Etching Microstructures with Lasers

Ultrafast lasers have introduced new possibilities in engraving ultrafine structures, and scientists are now also investigating how to use them to etch microstructures into thin glass. There are possible applications in analytics (lab on a chip) and especially in electronics and the consumer sector, where great interest has been shown.

This new method was born of a surprising phenomenon: irradiating glass in a particular way with an ultrafast laser has the effect of making the glass up to a...

Im Focus: Light-driven atomic rotations excite magnetic waves

Terahertz excitation of selected crystal vibrations leads to an effective magnetic field that drives coherent spin motion

Controlling functional properties by light is one of the grand goals in modern condensed matter physics and materials science. A new study now demonstrates how...

Im Focus: New 3-D wiring technique brings scalable quantum computers closer to reality

Researchers from the Institute for Quantum Computing (IQC) at the University of Waterloo led the development of a new extensible wiring technique capable of controlling superconducting quantum bits, representing a significant step towards to the realization of a scalable quantum computer.

"The quantum socket is a wiring method that uses three-dimensional wires based on spring-loaded pins to address individual qubits," said Jeremy Béjanin, a PhD...

Im Focus: Scientists develop a semiconductor nanocomposite material that moves in response to light

In a paper in Scientific Reports, a research team at Worcester Polytechnic Institute describes a novel light-activated phenomenon that could become the basis for applications as diverse as microscopic robotic grippers and more efficient solar cells.

A research team at Worcester Polytechnic Institute (WPI) has developed a revolutionary, light-activated semiconductor nanocomposite material that can be used...

Im Focus: Diamonds aren't forever: Sandia, Harvard team create first quantum computer bridge

By forcefully embedding two silicon atoms in a diamond matrix, Sandia researchers have demonstrated for the first time on a single chip all the components needed to create a quantum bridge to link quantum computers together.

"People have already built small quantum computers," says Sandia researcher Ryan Camacho. "Maybe the first useful one won't be a single giant quantum computer...

All Focus news of the innovation-report >>>



Event News

#IC2S2: When Social Science meets Computer Science - GESIS will host the IC2S2 conference 2017

14.10.2016 | Event News

Agricultural Trade Developments and Potentials in Central Asia and the South Caucasus

14.10.2016 | Event News

World Health Summit – Day Three: A Call to Action

12.10.2016 | Event News

Latest News

How nanoscience will improve our health and lives in the coming years

27.10.2016 | Materials Sciences

OU-led team discovers rare, newborn tri-star system using ALMA

27.10.2016 | Physics and Astronomy

'Neighbor maps' reveal the genome's 3-D shape

27.10.2016 | Life Sciences

More VideoLinks >>>