Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

The first public electronic Spanish dictionary on the Internet

11.02.2008
The first public domain and freely distributed electronic Spanish dictionary was developed as part of the COES project, led by Santiago Rodríguez, professor of the Universidad Politécnica de Madrid’s School of Computing (FIUPM), and Jesús Carretero, now professor of the Universidad Carlos III de Madrid and former professor of the FIUPM.

COES Spanish language tools are one of the Department of Computer Systems Architecture and Technology’s (DATSI) fields of research at the FIUPM. The key objective of this research is to formalize a set of Spanish grammar rules and apply the rules to check documents written in Spanish for all-round correctness. COES has been distributed as open source software since early 1994. Even though it is over ten years old, the tool is regularly updated and can be consulted at the project web site.

The Spanish dictionary system is composed of a text format electronic dictionary, containing 53,000 terms, a file of Spanish inflectional classes and a script that can generate a binary format expanded dictionary, containing all the inflectional forms of the verbs, nouns, adjectives and the invariable forms, like adverbs and conjunctions, etc., in the dictionary of lemmas.

This set of files constitutes a Spanish dictionary containing a constantly increasing number of terms, although new versions are not released until they have been checked for correct operation. Only properly operating versions are released to the public. The current version of COES includes a spelling checker. Using the public domain ispell tool, the binary format dictionary can be integrated into a Spanish spelling checker system for Unix operating systems.

A text format dictionary of expanded forms (espa~nol.wl) can be generated from the binary format expanded electronic dictionary (espa~nol.hash) and the dictionary of lemmas (espa~nol.words).

As Infoling (an electronic newsletter on Spanish linguistics) reported, the release of the text format expanded electronic dictionary is likely to be an important event for developers—both universities and companies— of Spanish linguistic technologies that need to integrate a dictionary of inflectional forms into specific applications, especially taking into account that the COES project dictionaries are the only public domain and freely distributed electronic Spanish dictionaries.

The whole package of dictionaries and other components is composed of a file of Spanish verb, noun and adjective inflection suffixes; a list of words that appear in the Diccionario de la Real Academia Española de la Lengua (Reference work published by the Royal Academy of the Spanish Language, 21st edition); another list of words that do not appear in the Diccionario de la Real Academia Española de la Lengua, but are commonly used in the Spanish language; a list of words that are routinely used in computing, even though they are not in the Diccionario de la Real Academia Española de la Lengua.

Additionally, this set of dictionaries includes a list of words that appear in the Diccionario de la Real Academia Española de la Lengua whose meanings are in current Spanish usage, a list of expanded words, a script and a makefile file.

Eduardo Martínez | alfa
Further information:
http://www.fi.upm.es/?pagina=597&idioma=english
http://www.datsi.fi.upm.es/~coes/coes.html
http://fmg-www.cs.ucla.edu/fmg-members/geoff/ispell.html

More articles from Information Technology:

nachricht Touchscreens go 3D with buttons that pulsate and vibrate under your fingertips
14.03.2019 | Universität des Saarlandes

nachricht EU project CALADAN set to reduce manufacturing cost of Terabit/s capable optical transceivers
11.03.2019 | IHP - Leibniz-Institut für innovative Mikroelektronik

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Magnetic micro-boats

Nano- and microtechnology are promising candidates not only for medical applications such as drug delivery but also for the creation of little robots or flexible integrated sensors. Scientists from the Max Planck Institute for Polymer Research (MPI-P) have created magnetic microparticles, with a newly developed method, that could pave the way for building micro-motors or guiding drugs in the human body to a target, like a tumor. The preparation of such structures as well as their remote-control can be regulated using magnetic fields and therefore can find application in an array of domains.

The magnetic properties of a material control how this material responds to the presence of a magnetic field. Iron oxide is the main component of rust but also...

Im Focus: Self-healing coating made of corn starch makes small scratches disappear through heat

Due to the special arrangement of its molecules, a new coating made of corn starch is able to repair small scratches by itself through heat: The cross-linking via ring-shaped molecules makes the material mobile, so that it compensates for the scratches and these disappear again.

Superficial micro-scratches on the car body or on other high-gloss surfaces are harmless, but annoying. Especially in the luxury segment such surfaces are...

Im Focus: Stellar cartography

The Potsdam Echelle Polarimetric and Spectroscopic Instrument (PEPSI) at the Large Binocular Telescope (LBT) in Arizona released its first image of the surface magnetic field of another star. In a paper in the European journal Astronomy & Astrophysics, the PEPSI team presents a Zeeman- Doppler-Image of the surface of the magnetically active star II Pegasi.

A special technique allows astronomers to resolve the surfaces of faraway stars. Those are otherwise only seen as point sources, even in the largest telescopes...

Im Focus: Heading towards a tsunami of light

Researchers at Chalmers University of Technology and the University of Gothenburg, Sweden, have proposed a way to create a completely new source of radiation. Ultra-intense light pulses consist of the motion of a single wave and can be described as a tsunami of light. The strong wave can be used to study interactions between matter and light in a unique way. Their research is now published in the scientific journal Physical Review Letters.

"This source of radiation lets us look at reality through a new angle - it is like twisting a mirror and discovering something completely different," says...

Im Focus: Revealing the secret of the vacuum for the first time

New research group at the University of Jena combines theory and experiment to demonstrate for the first time certain physical processes in a quantum vacuum

For most people, a vacuum is an empty space. Quantum physics, on the other hand, assumes that even in this lowest-energy state, particles and antiparticles...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

VideoLinks
Industry & Economy
Event News

International Modelica Conference with 330 visitors from 21 countries at OTH Regensburg

11.03.2019 | Event News

Selection Completed: 580 Young Scientists from 88 Countries at the Lindau Nobel Laureate Meeting

01.03.2019 | Event News

LightMAT 2019 – 3rd International Conference on Light Materials – Science and Technology

28.02.2019 | Event News

 
Latest News

To proliferate or not to proliferate

21.03.2019 | Life Sciences

Magnetic micro-boats

21.03.2019 | Physics and Astronomy

Motorless pumps and self-regulating valves made from ultrathin film

21.03.2019 | HANNOVER MESSE

VideoLinks
Science & Research
Overview of more VideoLinks >>>