The DILIGENT team used the EGEE computing Grid to process 37 million images from the online Flickr database in just 16 weeks. This computation generated approximately 112 million text and image objects—nearly 5 TB of data—containing more than 150 million extracted features. This is equivalent to an average processing capacity of over 300,000 images per day.
This unique collection will be used by the SAPIR project to develop new large-scale content-based data retrieval and automatic data classification techniques that combine both text and image content, expanding the limits of conventional search engines, which can only search text associated to images and audio-visual content.
The computational load required to generate this massive data collection was outsourced to DILIGENT, and then delegated to the EGEE Pre-Production Service (PPS) Grid infrastructure via the gLite middleware. A total of 44,333 gLite jobs were successfully executed by the EGEE PPS infrastructure resource broker. Each job processed approximately 1000 images.
The data challenge lasted for 116 days, from 16 June to 9 October 2007, and was organized in three different phases. During the initial preparation phase experimental jobs were submitted to some EGEE PPS sites to test the feature extraction application and optimize the number of images to process per day.
The next two phases involved actual execution of the data challenge, exploiting ten EGEE PPS sites that contributed their computational resources: University of Athens, Scuola Normale Superiore, ISTI-CNR, LIP, ESA-ESRIN, CERN, CESGA, University of Macedonia, Ben Gurion University, and CYFRONET. Four of these sites are maintained by DILIGENT partners.
Novel approach improves graphene-based supercapacitors
03.08.2020 | University of Technology Sydney
Germany-wide rainfall measurements by utilizing the mobile network
03.08.2020 | Karlsruher Institut für Technologie (KIT)
An international research team has found a new approach that may be able to reduce bone loss in osteoporosis and maintain bone health.
Osteoporosis is the most common age-related bone disease which affects hundreds of millions of individuals worldwide. It is estimated that one in three women...
Traditional single-cell sequencing methods help to reveal insights about cellular differences and functions - but they do this with static snapshots only...
“Core-shell” clusters pave the way for new efficient nanomaterials that make catalysts, magnetic and laser sensors or measuring devices for detecting electromagnetic radiation more efficient.
Whether in innovative high-tech materials, more powerful computer chips, pharmaceuticals or in the field of renewable energies, nanoparticles – smallest...
An international research team with Prof. Cornelia Denz from the Institute of Applied Physics at the University of Münster develop for the first time light fields using caustics that do not change during propagation. With the new method, the physicists cleverly exploit light structures that can be seen in rainbows or when light is transmitted through drinking glasses.
Modern applications as high resolution microsopy or micro- or nanoscale material processing require customized laser beams that do not change during...
Although no life has been detected on the Martian surface, a new study from astrophysicist and research scientist at the Center for Space Science at NYU Abu...
23.07.2020 | Event News
21.07.2020 | Event News
07.07.2020 | Event News
05.08.2020 | Physics and Astronomy
05.08.2020 | Health and Medicine
05.08.2020 | Earth Sciences