Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

The first public electronic Spanish dictionary on the Internet

11.02.2008
The first public domain and freely distributed electronic Spanish dictionary was developed as part of the COES project, led by Santiago Rodríguez, professor of the Universidad Politécnica de Madrid’s School of Computing (FIUPM), and Jesús Carretero, now professor of the Universidad Carlos III de Madrid and former professor of the FIUPM.

COES Spanish language tools are one of the Department of Computer Systems Architecture and Technology’s (DATSI) fields of research at the FIUPM. The key objective of this research is to formalize a set of Spanish grammar rules and apply the rules to check documents written in Spanish for all-round correctness. COES has been distributed as open source software since early 1994. Even though it is over ten years old, the tool is regularly updated and can be consulted at the project web site.

The Spanish dictionary system is composed of a text format electronic dictionary, containing 53,000 terms, a file of Spanish inflectional classes and a script that can generate a binary format expanded dictionary, containing all the inflectional forms of the verbs, nouns, adjectives and the invariable forms, like adverbs and conjunctions, etc., in the dictionary of lemmas.

This set of files constitutes a Spanish dictionary containing a constantly increasing number of terms, although new versions are not released until they have been checked for correct operation. Only properly operating versions are released to the public. The current version of COES includes a spelling checker. Using the public domain ispell tool, the binary format dictionary can be integrated into a Spanish spelling checker system for Unix operating systems.

A text format dictionary of expanded forms (espa~nol.wl) can be generated from the binary format expanded electronic dictionary (espa~nol.hash) and the dictionary of lemmas (espa~nol.words).

As Infoling (an electronic newsletter on Spanish linguistics) reported, the release of the text format expanded electronic dictionary is likely to be an important event for developers—both universities and companies— of Spanish linguistic technologies that need to integrate a dictionary of inflectional forms into specific applications, especially taking into account that the COES project dictionaries are the only public domain and freely distributed electronic Spanish dictionaries.

The whole package of dictionaries and other components is composed of a file of Spanish verb, noun and adjective inflection suffixes; a list of words that appear in the Diccionario de la Real Academia Española de la Lengua (Reference work published by the Royal Academy of the Spanish Language, 21st edition); another list of words that do not appear in the Diccionario de la Real Academia Española de la Lengua, but are commonly used in the Spanish language; a list of words that are routinely used in computing, even though they are not in the Diccionario de la Real Academia Española de la Lengua.

Additionally, this set of dictionaries includes a list of words that appear in the Diccionario de la Real Academia Española de la Lengua whose meanings are in current Spanish usage, a list of expanded words, a script and a makefile file.

Eduardo Martínez | alfa
Further information:
http://www.fi.upm.es/?pagina=597&idioma=english
http://www.datsi.fi.upm.es/~coes/coes.html
http://fmg-www.cs.ucla.edu/fmg-members/geoff/ispell.html

More articles from Information Technology:

nachricht Single-photon detector can count to 4
18.12.2017 | Duke University

nachricht New epidemic management system combats monkeypox outbreak in Nigeria
15.12.2017 | Helmholtz-Zentrum für Infektionsforschung

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Error-free into the Quantum Computer Age

A study carried out by an international team of researchers and published in the journal Physical Review X shows that ion-trap technologies available today are suitable for building large-scale quantum computers. The scientists introduce trapped-ion quantum error correction protocols that detect and correct processing errors.

In order to reach their full potential, today’s quantum computer prototypes have to meet specific criteria: First, they have to be made bigger, which means...

Im Focus: Search for planets with Carmenes successful

German and Spanish researchers plan, build and use modern spectrograph

Since 2016, German and Spanish researchers, among them scientists from the University of Göttingen, have been hunting for exoplanets with the “Carmenes”...

Im Focus: First-of-its-kind chemical oscillator offers new level of molecular control

DNA molecules that follow specific instructions could offer more precise molecular control of synthetic chemical systems, a discovery that opens the door for engineers to create molecular machines with new and complex behaviors.

Researchers have created chemical amplifiers and a chemical oscillator using a systematic method that has the potential to embed sophisticated circuit...

Im Focus: Long-lived storage of a photonic qubit for worldwide teleportation

MPQ scientists achieve long storage times for photonic quantum bits which break the lower bound for direct teleportation in a global quantum network.

Concerning the development of quantum memories for the realization of global quantum networks, scientists of the Quantum Dynamics Division led by Professor...

Im Focus: Electromagnetic water cloak eliminates drag and wake

Detailed calculations show water cloaks are feasible with today's technology

Researchers have developed a water cloaking concept based on electromagnetic forces that could eliminate an object's wake, greatly reducing its drag while...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

Event News

See, understand and experience the work of the future

11.12.2017 | Event News

Innovative strategies to tackle parasitic worms

08.12.2017 | Event News

AKL’18: The opportunities and challenges of digitalization in the laser industry

07.12.2017 | Event News

 
Latest News

The body's street sweepers

18.12.2017 | Life Sciences

Fast flowing heat in layered material heterostructures

18.12.2017 | Materials Sciences

Life on the edge prepares plants for climate change

18.12.2017 | Life Sciences

VideoLinks
B2B-VideoLinks
More VideoLinks >>>