Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

The first public electronic Spanish dictionary on the Internet

11.02.2008
The first public domain and freely distributed electronic Spanish dictionary was developed as part of the COES project, led by Santiago Rodríguez, professor of the Universidad Politécnica de Madrid’s School of Computing (FIUPM), and Jesús Carretero, now professor of the Universidad Carlos III de Madrid and former professor of the FIUPM.

COES Spanish language tools are one of the Department of Computer Systems Architecture and Technology’s (DATSI) fields of research at the FIUPM. The key objective of this research is to formalize a set of Spanish grammar rules and apply the rules to check documents written in Spanish for all-round correctness. COES has been distributed as open source software since early 1994. Even though it is over ten years old, the tool is regularly updated and can be consulted at the project web site.

The Spanish dictionary system is composed of a text format electronic dictionary, containing 53,000 terms, a file of Spanish inflectional classes and a script that can generate a binary format expanded dictionary, containing all the inflectional forms of the verbs, nouns, adjectives and the invariable forms, like adverbs and conjunctions, etc., in the dictionary of lemmas.

This set of files constitutes a Spanish dictionary containing a constantly increasing number of terms, although new versions are not released until they have been checked for correct operation. Only properly operating versions are released to the public. The current version of COES includes a spelling checker. Using the public domain ispell tool, the binary format dictionary can be integrated into a Spanish spelling checker system for Unix operating systems.

A text format dictionary of expanded forms (espa~nol.wl) can be generated from the binary format expanded electronic dictionary (espa~nol.hash) and the dictionary of lemmas (espa~nol.words).

As Infoling (an electronic newsletter on Spanish linguistics) reported, the release of the text format expanded electronic dictionary is likely to be an important event for developers—both universities and companies— of Spanish linguistic technologies that need to integrate a dictionary of inflectional forms into specific applications, especially taking into account that the COES project dictionaries are the only public domain and freely distributed electronic Spanish dictionaries.

The whole package of dictionaries and other components is composed of a file of Spanish verb, noun and adjective inflection suffixes; a list of words that appear in the Diccionario de la Real Academia Española de la Lengua (Reference work published by the Royal Academy of the Spanish Language, 21st edition); another list of words that do not appear in the Diccionario de la Real Academia Española de la Lengua, but are commonly used in the Spanish language; a list of words that are routinely used in computing, even though they are not in the Diccionario de la Real Academia Española de la Lengua.

Additionally, this set of dictionaries includes a list of words that appear in the Diccionario de la Real Academia Española de la Lengua whose meanings are in current Spanish usage, a list of expanded words, a script and a makefile file.

Eduardo Martínez | alfa
Further information:
http://www.fi.upm.es/?pagina=597&idioma=english
http://www.datsi.fi.upm.es/~coes/coes.html
http://fmg-www.cs.ucla.edu/fmg-members/geoff/ispell.html

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: A step towards controlling spin-dependent petahertz electronics by material defects

The operational speed of semiconductors in various electronic and optoelectronic devices is limited to several gigahertz (a billion oscillations per second). This constrains the upper limit of the operational speed of computing. Now researchers from the Max Planck Institute for the Structure and Dynamics of Matter in Hamburg, Germany, and the Indian Institute of Technology in Bombay have explained how these processes can be sped up through the use of light waves and defected solid materials.

Light waves perform several hundred trillion oscillations per second. Hence, it is natural to envision employing light oscillations to drive the electronic...

Im Focus: Freiburg researcher investigate the origins of surface texture

Most natural and artificial surfaces are rough: metals and even glasses that appear smooth to the naked eye can look like jagged mountain ranges under the microscope. There is currently no uniform theory about the origin of this roughness despite it being observed on all scales, from the atomic to the tectonic. Scientists suspect that the rough surface is formed by irreversible plastic deformation that occurs in many processes of mechanical machining of components such as milling.

Prof. Dr. Lars Pastewka from the Simulation group at the Department of Microsystems Engineering at the University of Freiburg and his team have simulated such...

Im Focus: Skyrmions like it hot: Spin structures are controllable even at high temperatures

Investigation of the temperature dependence of the skyrmion Hall effect reveals further insights into possible new data storage devices

The joint research project of Johannes Gutenberg University Mainz (JGU) and the Massachusetts Institute of Technology (MIT) that had previously demonstrated...

Im Focus: Making the internet more energy efficient through systemic optimization

Researchers at Chalmers University of Technology, Sweden, recently completed a 5-year research project looking at how to make fibre optic communications systems more energy efficient. Among their proposals are smart, error-correcting data chip circuits, which they refined to be 10 times less energy consumptive. The project has yielded several scientific articles, in publications including Nature Communications.

Streaming films and music, scrolling through social media, and using cloud-based storage services are everyday activities now.

Im Focus: New synthesis methods enhance 3D chemical space for drug discovery

After helping develop a new approach for organic synthesis -- carbon-hydrogen functionalization -- scientists at Emory University are now showing how this approach may apply to drug discovery. Nature Catalysis published their most recent work -- a streamlined process for making a three-dimensional scaffold of keen interest to the pharmaceutical industry.

"Our tools open up whole new chemical space for potential drug targets," says Huw Davies, Emory professor of organic chemistry and senior author of the paper.

All Focus news of the innovation-report >>>

Anzeige

Anzeige

VideoLinks
Industry & Economy
Event News

70th Lindau Nobel Laureate Meeting: Around 70 Laureates set to meet with young scientists from approx. 100 countries

12.02.2020 | Event News

11th Advanced Battery Power Conference, March 24-25, 2020 in Münster/Germany

16.01.2020 | Event News

Laser Colloquium Hydrogen LKH2: fast and reliable fuel cell manufacturing

15.01.2020 | Event News

 
Latest News

"Make two out of one" - Division of Artificial Cells

19.02.2020 | Life Sciences

High-Performance Computing Center of the University of Stuttgart Receives new Supercomuter "Hawk"

19.02.2020 | Information Technology

A step towards controlling spin-dependent petahertz electronics by material defects

19.02.2020 | Power and Electrical Engineering

VideoLinks
Science & Research
Overview of more VideoLinks >>>