COES Spanish language tools are one of the Department of Computer Systems Architecture and Technology’s (DATSI) fields of research at the FIUPM. The key objective of this research is to formalize a set of Spanish grammar rules and apply the rules to check documents written in Spanish for all-round correctness. COES has been distributed as open source software since early 1994. Even though it is over ten years old, the tool is regularly updated and can be consulted at the project web site.
The Spanish dictionary system is composed of a text format electronic dictionary, containing 53,000 terms, a file of Spanish inflectional classes and a script that can generate a binary format expanded dictionary, containing all the inflectional forms of the verbs, nouns, adjectives and the invariable forms, like adverbs and conjunctions, etc., in the dictionary of lemmas.
This set of files constitutes a Spanish dictionary containing a constantly increasing number of terms, although new versions are not released until they have been checked for correct operation. Only properly operating versions are released to the public. The current version of COES includes a spelling checker. Using the public domain ispell tool, the binary format dictionary can be integrated into a Spanish spelling checker system for Unix operating systems.
A text format dictionary of expanded forms (espa~nol.wl) can be generated from the binary format expanded electronic dictionary (espa~nol.hash) and the dictionary of lemmas (espa~nol.words).
As Infoling (an electronic newsletter on Spanish linguistics) reported, the release of the text format expanded electronic dictionary is likely to be an important event for developers—both universities and companies— of Spanish linguistic technologies that need to integrate a dictionary of inflectional forms into specific applications, especially taking into account that the COES project dictionaries are the only public domain and freely distributed electronic Spanish dictionaries.
The whole package of dictionaries and other components is composed of a file of Spanish verb, noun and adjective inflection suffixes; a list of words that appear in the Diccionario de la Real Academia Española de la Lengua (Reference work published by the Royal Academy of the Spanish Language, 21st edition); another list of words that do not appear in the Diccionario de la Real Academia Española de la Lengua, but are commonly used in the Spanish language; a list of words that are routinely used in computing, even though they are not in the Diccionario de la Real Academia Española de la Lengua.
Additionally, this set of dictionaries includes a list of words that appear in the Diccionario de la Real Academia Española de la Lengua whose meanings are in current Spanish usage, a list of expanded words, a script and a makefile file.
Eduardo Martínez | alfa
Controlling robots with brainwaves and hand gestures
20.06.2018 | Massachusetts Institute of Technology, CSAIL
Innovative autonomous system for identifying schools of fish
20.06.2018 | IMDEA Networks Institute
In a recent publication in the renowned journal Optica, scientists of Leibniz-Institute of Photonic Technology (Leibniz IPHT) in Jena showed that they can accurately control the optical properties of liquid-core fiber lasers and therefore their spectral band width by temperature and pressure tuning.
Already last year, the researchers provided experimental proof of a new dynamic of hybrid solitons– temporally and spectrally stationary light waves resulting...
Scientists from the University of Freiburg and the University of Basel identified a master regulator for bone regeneration. Prasad Shastri, Professor of...
Moving into its fourth decade, AchemAsia is setting out for new horizons: The International Expo and Innovation Forum for Sustainable Chemical Production will take place from 21-23 May 2019 in Shanghai, China. With an updated event profile, the eleventh edition focusses on topics that are especially relevant for the Chinese process industry, putting a strong emphasis on sustainability and innovation.
Founded in 1989 as a spin-off of ACHEMA to cater to the needs of China’s then developing industry, AchemAsia has since grown into a platform where the latest...
The BMBF-funded OWICELLS project was successfully completed with a final presentation at the BMW plant in Munich. The presentation demonstrated a Li-Fi communication with a mobile robot, while the robot carried out usual production processes (welding, moving and testing parts) in a 5x5m² production cell. The robust, optical wireless transmission is based on spatial diversity; in other words, data is sent and received simultaneously by several LEDs and several photodiodes. The system can transmit data at more than 100 Mbit/s and five milliseconds latency.
Modern production technologies in the automobile industry must become more flexible in order to fulfil individual customer requirements.
An international team of scientists has discovered a new way to transfer image information through multimodal fibers with almost no distortion - even if the fiber is bent. The results of the study, to which scientist from the Leibniz-Institute of Photonic Technology Jena (Leibniz IPHT) contributed, were published on 6thJune in the highly-cited journal Physical Review Letters.
Endoscopes allow doctors to see into a patient’s body like through a keyhole. Typically, the images are transmitted via a bundle of several hundreds of optical...
13.06.2018 | Event News
08.06.2018 | Event News
05.06.2018 | Event News
22.06.2018 | Materials Sciences
22.06.2018 | Earth Sciences
22.06.2018 | Life Sciences