Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

Classic grammar model can be used for computerised parsing

31.05.2010
A classic Nordic grammar model can be used for computerised grammatical analyses and technical applications of modern Swedish text, shows a new thesis in the field of language technology from the University of Gothenburg, Sweden.
One such application enables queries answered by a digital text to be generated when it is opened, and then used to search for specific information in the text.

Language researcher Kenneth Wilhelmsson has developed a new method which interprets the grammatical structure of a text, known as parsing, with the help of a computer program.

The method builds on Danish linguist Paul Diderichsen’s traditional sentence structure, which has been adopted for the description of all the Nordic languages and is found in most modern Swedish grammar books.

“The grammatical analysis in the program is performed mostly at the main clause level, which can be seen as a big advantage, as the task is then less complex but still gives usable results,” explains Wilhelmsson at the University of Gothenburg.

Instead of performing the entire analysis in one go, the approach consists of a series of steps which can be performed with high levels of accuracy. It is primarily the main clause’s finite verb and other single-word sentence elements which are identified at the main clause level. This, in turn, paves the way for the identification of complex sentence elements (subject, object/predicative and adverbial), which can rely on exclusion methodologies and similar rule formulations (heuristics) rather than an explicit, complete grammatical description.

Kenneth Wilhelmsson’s newly developed method can also be used by language researchers to search for instances of different grammatical phenomena, which can be described in a more refined fashion than with word and string matching.

Wilhelmsson’s work on the thesis also included the creation of various prototype applications which build on this type of analysis. One of them is a unique system for automatic generation of queries from a Swedish text.

The program has access to the Swedish Wikipedia’s article database and can be used to generate queries when a text is opened. When the user begins to type a query, the text is completed automatically, and only queries that can actually be answered may be asked.

“This is intended as an alternative to most other modern query programs where the user cannot know whether a query can actually be answered by the knowledge base at all, and where variations in the formulation of the query may mean that information that is there is missed,” explains Wilhelmsson.

Title of thesis: Heuristic Analysis with Diderichsen's Sentence Schema – Applications for Swedish Text
Author: Kenneth Wilhelmsson, tel: +46 31 408 211
E-mail: kw@ling.gu.se
Link to thesis: http://hdl.handle.net/2077/22028

Helena Aaberg | idw
Further information:
http://hdl.handle.net/2077/22028
http://www.gu.se/

More articles from Information Technology:

nachricht Cutting edge research for the industries of tomorrow – DFKI and NICT expand cooperation
21.03.2017 | Deutsches Forschungszentrum für Künstliche Intelligenz GmbH, DFKI

nachricht Molecular motor-powered biocomputers
20.03.2017 | Technische Universität Dresden

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Giant Magnetic Fields in the Universe

Astronomers from Bonn and Tautenburg in Thuringia (Germany) used the 100-m radio telescope at Effelsberg to observe several galaxy clusters. At the edges of these large accumulations of dark matter, stellar systems (galaxies), hot gas, and charged particles, they found magnetic fields that are exceptionally ordered over distances of many million light years. This makes them the most extended magnetic fields in the universe known so far.

The results will be published on March 22 in the journal „Astronomy & Astrophysics“.

Galaxy clusters are the largest gravitationally bound structures in the universe. With a typical extent of about 10 million light years, i.e. 100 times the...

Im Focus: Tracing down linear ubiquitination

Researchers at the Goethe University Frankfurt, together with partners from the University of Tübingen in Germany and Queen Mary University as well as Francis Crick Institute from London (UK) have developed a novel technology to decipher the secret ubiquitin code.

Ubiquitin is a small protein that can be linked to other cellular proteins, thereby controlling and modulating their functions. The attachment occurs in many...

Im Focus: Perovskite edges can be tuned for optoelectronic performance

Layered 2D material improves efficiency for solar cells and LEDs

In the eternal search for next generation high-efficiency solar cells and LEDs, scientists at Los Alamos National Laboratory and their partners are creating...

Im Focus: Polymer-coated silicon nanosheets as alternative to graphene: A perfect team for nanoelectronics

Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are less stable. Now researchers at the Technical University of Munich (TUM) have, for the first time ever, produced a composite material combining silicon nanosheets and a polymer that is both UV-resistant and easy to process. This brings the scientists a significant step closer to industrial applications like flexible displays and photosensors.

Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are...

Im Focus: Researchers Imitate Molecular Crowding in Cells

Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to simulate these confined natural conditions in artificial vesicles for the first time. As reported in the academic journal Small, the results are offering better insight into the development of nanoreactors and artificial organelles.

Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

Event News

International Land Use Symposium ILUS 2017: Call for Abstracts and Registration open

20.03.2017 | Event News

CONNECT 2017: International congress on connective tissue

14.03.2017 | Event News

ICTM Conference: Turbine Construction between Big Data and Additive Manufacturing

07.03.2017 | Event News

 
Latest News

Northern oceans pumped CO2 into the atmosphere

27.03.2017 | Earth Sciences

Fingerprint' technique spots frog populations at risk from pollution

27.03.2017 | Life Sciences

Big data approach to predict protein structure

27.03.2017 | Life Sciences

VideoLinks
B2B-VideoLinks
More VideoLinks >>>