COES Spanish language tools are one of the Department of Computer Systems Architecture and Technology’s (DATSI) fields of research at the FIUPM. The key objective of this research is to formalize a set of Spanish grammar rules and apply the rules to check documents written in Spanish for all-round correctness. COES has been distributed as open source software since early 1994. Even though it is over ten years old, the tool is regularly updated and can be consulted at the project web site.
The Spanish dictionary system is composed of a text format electronic dictionary, containing 53,000 terms, a file of Spanish inflectional classes and a script that can generate a binary format expanded dictionary, containing all the inflectional forms of the verbs, nouns, adjectives and the invariable forms, like adverbs and conjunctions, etc., in the dictionary of lemmas.
This set of files constitutes a Spanish dictionary containing a constantly increasing number of terms, although new versions are not released until they have been checked for correct operation. Only properly operating versions are released to the public. The current version of COES includes a spelling checker. Using the public domain ispell tool, the binary format dictionary can be integrated into a Spanish spelling checker system for Unix operating systems.
A text format dictionary of expanded forms (espa~nol.wl) can be generated from the binary format expanded electronic dictionary (espa~nol.hash) and the dictionary of lemmas (espa~nol.words).
As Infoling (an electronic newsletter on Spanish linguistics) reported, the release of the text format expanded electronic dictionary is likely to be an important event for developers—both universities and companies— of Spanish linguistic technologies that need to integrate a dictionary of inflectional forms into specific applications, especially taking into account that the COES project dictionaries are the only public domain and freely distributed electronic Spanish dictionaries.
The whole package of dictionaries and other components is composed of a file of Spanish verb, noun and adjective inflection suffixes; a list of words that appear in the Diccionario de la Real Academia Española de la Lengua (Reference work published by the Royal Academy of the Spanish Language, 21st edition); another list of words that do not appear in the Diccionario de la Real Academia Española de la Lengua, but are commonly used in the Spanish language; a list of words that are routinely used in computing, even though they are not in the Diccionario de la Real Academia Española de la Lengua.
Additionally, this set of dictionaries includes a list of words that appear in the Diccionario de la Real Academia Española de la Lengua whose meanings are in current Spanish usage, a list of expanded words, a script and a makefile file.
Eduardo Martínez | alfa
Next Generation Cryptography
20.03.2018 | Fraunhofer-Institut für Sichere Informationstechnologie SIT
TIB’s Visual Analytics Research Group to develop methods for person detection and visualisation
19.03.2018 | Technische Informationsbibliothek (TIB)
A new scenario seeking to explain how Mars' putative oceans came and went over the last 4 billion years implies that the oceans formed several hundred million...
For the first time, an interdisciplinary team from the University of Basel has succeeded in integrating artificial organelles into the cells of live zebrafish embryos. This innovative approach using artificial organelles as cellular implants offers new potential in treating a range of diseases, as the authors report in an article published in Nature Communications.
In the cells of higher organisms, organelles such as the nucleus or mitochondria perform a range of complex functions necessary for life. In the networks of...
Animal photoreceptors capture light with photopigments. Researchers from the University of Göttingen have now discovered that these photopigments fulfill an...
On 15 March, the AWI research aeroplane Polar 5 will depart for Greenland. Concentrating on the furthest northeast region of the island, an international team...
The world’s second-largest ice shelf was the destination for a Polarstern expedition that ended in Punta Arenas, Chile on 14th March 2018. Oceanographers from...
19.03.2018 | Event News
16.03.2018 | Event News
13.03.2018 | Event News
20.03.2018 | Physics and Astronomy
20.03.2018 | Physics and Astronomy
20.03.2018 | Earth Sciences