While researchers started the software design seven years ago, it is only now that the music world is beginning to meet the conditions for exploiting what Hugues Vinet, the research coordinator, bills as the “first of its kind” large-scale research project for automatically extracting and classifying audio signals.
Such metadata, as it is called, can be used to tag audio files so they can be more accurately picked up by search engines equipped to handle this kind of information. Standardising the metadata for various audiovisual media is the goal of the new Mpeg-7 specification, in which the project partners participated and provided some input for descriptors, such as musical timbre.
The software could be the next big step in boosting online music sales, as it could allow companies to exploit their archives more thoroughly and help consumers dig out tracks they might not have discovered otherwise.
“We are in concrete discussions with a number of interested companies on using some of the developments from our project,” Vinet says. “We are finally starting to collaborate with companies to market these resources. Such software still does not exist in any way.”
Vinet, who is scientific director at the Paris-based Institute for Music/Acoustic Research and Coordination (IRCAM), was part of a team that included researchers from universities in Spain and Israel, along with companies such as Oracle and Sony. The EU-funded project was called Cuidado.
The packages they developed – consisting of a music browser, an online sound palette and sound authoring software – can analyse and index sound according to the digital patterns displayed by each particular song. To do this, the researchers developed a number of techniques for capturing specific qualities from audio files, such as timbre, energy and rhythm.
This system goes far beyond the methods used online by the music industry, which is slowly warming up to selling music over the internet.
Currently, music download sites are heavily dependent on the manual input of the basic text metadata needed to generate the kind of suggestions that might hook consumers into making a purchase. The Cuidado packages produce complementary metadata based on audio descriptors, making any search engine equipped to handle such information much more accurate in taking into account the actual sonic content of the tracks.
Of particular interest is the ability of the software to make automatic connections to music tracks that cross over into other categories a listener might not have thought of, and enable new discoveries.
This ability would allow music companies to exploit their vast back catalogues, a lot of which are unavailable at the local music store.Business tuning into the potential
For example, researchers at Ecoute, a France-based project, are using some of the techniques developed by the Cuidado team to create a portal for electronic music distribution.
IRCAM is also working on audio sample management based on Cuidado indexing and content-based management and retrieval techniques.
The research results obtained by IRCAM are currently being further developed and applied as part of France’s national Sample Orchestrator project. This project is designing a new-generation audio-software sampler, a software instrument based on a database of recorded sounds.
According to Vinet, such techniques would be useful not only for delivering a new generation of musical instruments, but also for designing special effects for cinema and TV, or for the management of databases in specific applications, such as sounds of animals, engines and boats.
The sampler will include advanced content-based search features, built around different approaches initiated by the Cuidado team, including search by perceptual similarity, says Vinet.What comes to pass
Sony notes on its internet site that its interest in Cuidado, which ran from January 2001 to December 2003, is related to the development of techniques that would allow the sharing of musical tastes and information within online communities.
“The Cuidado project enabled us to gather a core of experts together to develop a vision and a new set of audio extraction technologies,” Vinet says. “It helped establish us as international leaders with multidisciplinary competences in this area. It is evident that what we foresaw, the evolution of the music industry to full digital distribution, is coming to pass.”
If so, full digital distribution of the music industry’s vast archives, coupled with powerful search engines based on Cuidado’s techniques, could put the power of the beat into listeners’ hands.
Christian Nielsen | alfa
Controlling robots with brainwaves and hand gestures
20.06.2018 | Massachusetts Institute of Technology, CSAIL
Innovative autonomous system for identifying schools of fish
20.06.2018 | IMDEA Networks Institute
In a recent publication in the renowned journal Optica, scientists of Leibniz-Institute of Photonic Technology (Leibniz IPHT) in Jena showed that they can accurately control the optical properties of liquid-core fiber lasers and therefore their spectral band width by temperature and pressure tuning.
Already last year, the researchers provided experimental proof of a new dynamic of hybrid solitons– temporally and spectrally stationary light waves resulting...
Scientists from the University of Freiburg and the University of Basel identified a master regulator for bone regeneration. Prasad Shastri, Professor of...
Moving into its fourth decade, AchemAsia is setting out for new horizons: The International Expo and Innovation Forum for Sustainable Chemical Production will take place from 21-23 May 2019 in Shanghai, China. With an updated event profile, the eleventh edition focusses on topics that are especially relevant for the Chinese process industry, putting a strong emphasis on sustainability and innovation.
Founded in 1989 as a spin-off of ACHEMA to cater to the needs of China’s then developing industry, AchemAsia has since grown into a platform where the latest...
The BMBF-funded OWICELLS project was successfully completed with a final presentation at the BMW plant in Munich. The presentation demonstrated a Li-Fi communication with a mobile robot, while the robot carried out usual production processes (welding, moving and testing parts) in a 5x5m² production cell. The robust, optical wireless transmission is based on spatial diversity; in other words, data is sent and received simultaneously by several LEDs and several photodiodes. The system can transmit data at more than 100 Mbit/s and five milliseconds latency.
Modern production technologies in the automobile industry must become more flexible in order to fulfil individual customer requirements.
An international team of scientists has discovered a new way to transfer image information through multimodal fibers with almost no distortion - even if the fiber is bent. The results of the study, to which scientist from the Leibniz-Institute of Photonic Technology Jena (Leibniz IPHT) contributed, were published on 6thJune in the highly-cited journal Physical Review Letters.
Endoscopes allow doctors to see into a patient’s body like through a keyhole. Typically, the images are transmitted via a bundle of several hundreds of optical...
13.06.2018 | Event News
08.06.2018 | Event News
05.06.2018 | Event News
21.06.2018 | Earth Sciences
21.06.2018 | Life Sciences
21.06.2018 | Earth Sciences