A new computer algorithm developed at the University of Washington uses hundreds of thousands of tourist photos to automatically reconstruct an entire city in about a day.
The tool is the most recent in a series developed at the UW to harness the increasingly large digital photo collections available on photo-sharing Web sites. The digital Rome was built from 150,000 tourist photos tagged with the word "Rome" or "Roma" that were downloaded from the popular photo-sharing Web site, Flickr.
Computers analyzed each image and in 21 hours combined them to create a 3-D digital model. With this model a viewer can fly around Rome’s landmarks, from the Trevi Fountain to the Pantheon to the inside of the Sistine Chapel.
"How to match these massive collections of images to each other was a challenge," said Sameer Agarwal, a UW acting assistant professor of computer science and engineering and lead author of a paper being presented in October at the International Conference on Computer Vision in Kyoto, Japan. Until now, he said, "even if we had all the hardware we could get our hands on and then some, a reconstruction using this many photos would take forever."
Earlier versions of the UW photo-stitching technology are known as Photo Tourism. That technology was licensed in 2006 to Microsoft, which now offers it as a free tool called Photosynth.
"With Photosynth and Photo Tourism, we basically reconstruct individual landmarks. Here we're trying to reconstruct entire cities," said co-author Noah Snavely, who developed Photo Tourism as his UW doctoral work and is now an assistant professor at Cornell University.
Other co-authors of the new paper are Rick Szeliski of Microsoft Research, UW computer science professor Steve Seitz and UW graduate student Ian Simon.
In addition to Rome, the team recreated the Croatian coastal city of Dubrovnik, processing 60,000 images in less than 23 hours using a cluster of 350 computers, and Venice, Italy, processing 250,000 images in 65 hours using a cluster of 500 computers. Many historians see Venice as a candidate for digital preservation before water does more damage to the city, the researchers said.
Transitioning from landmarks to cities – going from hundreds of photos to hundreds of thousands of photos – is not trivial. Previous versions of the Photo Tourism software matched each photo to every other photo in the set. But as the number of photos increases the number of matches explodes, increasing with the square of the number of photos. A set of 250,000 images would take at least a year for 500 computers to process, Agarwal said. A million photos would take more than a decade.
The newly developed code works more than a hundred times faster than the previous version. It first establishes likely matches and then concentrates on those parts. The code also uses parallel processing techniques, allowing it to run simultaneously on many computers, or even on remote servers connected through the Internet.
The new, faster code makes it possible to tackle more ambitious projects.
"If a city reconstruction took several months, it would be just about building Rome," Seitz said. "But on a timeline of one day you can methodically start going through all the cities and start building models of them."
This technique could create online maps that offer viewers a virtual-reality experience. The software could build cities for video games automatically, instead of doing so by hand. It also might be used in architecture for digital preservation of cities, or integrated with online maps, Seitz said.
In the near term, the “Rome in a Day” code could be used with Photo Tourism, Photosynth or other software designed to view the model output.
The research was supported by the National Science Foundation, the Office of Naval Research and its Spawar lab, Microsoft Research, and Google.
For more information, contact Agarwal at 206-543-6876 or email@example.com and Seitz at 206-616-9431 or firstname.lastname@example.org.
The project Web site is http://grail.cs.washington.edu/rome/.
Agarwal | Newswise Science News
Football through the eyes of a computer
14.06.2018 | Universität Konstanz
People recall information better through virtual reality, says new UMD study
14.06.2018 | University of Maryland
Moving into its fourth decade, AchemAsia is setting out for new horizons: The International Expo and Innovation Forum for Sustainable Chemical Production will take place from 21-23 May 2019 in Shanghai, China. With an updated event profile, the eleventh edition focusses on topics that are especially relevant for the Chinese process industry, putting a strong emphasis on sustainability and innovation.
Founded in 1989 as a spin-off of ACHEMA to cater to the needs of China’s then developing industry, AchemAsia has since grown into a platform where the latest...
The BMBF-funded OWICELLS project was successfully completed with a final presentation at the BMW plant in Munich. The presentation demonstrated a Li-Fi communication with a mobile robot, while the robot carried out usual production processes (welding, moving and testing parts) in a 5x5m² production cell. The robust, optical wireless transmission is based on spatial diversity; in other words, data is sent and received simultaneously by several LEDs and several photodiodes. The system can transmit data at more than 100 Mbit/s and five milliseconds latency.
Modern production technologies in the automobile industry must become more flexible in order to fulfil individual customer requirements.
An international team of scientists has discovered a new way to transfer image information through multimodal fibers with almost no distortion - even if the fiber is bent. The results of the study, to which scientist from the Leibniz-Institute of Photonic Technology Jena (Leibniz IPHT) contributed, were published on 6thJune in the highly-cited journal Physical Review Letters.
Endoscopes allow doctors to see into a patient’s body like through a keyhole. Typically, the images are transmitted via a bundle of several hundreds of optical...
Light detection and control lies at the heart of many modern device applications, such as smartphone cameras. Using graphene as a light-sensitive material for...
Water molecules exist in two different forms with almost identical physical properties. For the first time, researchers have succeeded in separating the two forms to show that they can exhibit different chemical reactivities. These results were reported by researchers from the University of Basel and their colleagues in Hamburg in the scientific journal Nature Communications.
From a chemical perspective, water is a molecule in which a single oxygen atom is linked to two hydrogen atoms. It is less well known that water exists in two...
13.06.2018 | Event News
08.06.2018 | Event News
05.06.2018 | Event News
15.06.2018 | Materials Sciences
15.06.2018 | Ecology, The Environment and Conservation
15.06.2018 | Power and Electrical Engineering