Abstract: The paper reports on preliminary results of an ongoing research aiming at development of an
automatic procedure for recognition of discourse-compositional structure of scientific and technical texts, which is
required in many NLP applications. The procedure exploits as discourse markers various domain-independent
words and expressions that are specific for scientific and technical texts and organize scientific discourse. The
paper discusses features of scientific discourse and common scientific lexicon comprising such words and
expressions. Methodological issues of development of a computer dictionary for common scientific lexicon are
concerned; basic principles of its organization are described as well. Main steps of the discourse-analyzing
procedure based on the dictionary and surface syntactical analysis are pointed out.
- The work is supported by the grant № 06-01-00571 of Russian Fond of Fundamental Researches (RFFI).
International Journal "Information Theories & Applications" Vol.15 / 2008
190
Keywords: scientific and technical prose, common scientific words and expressions, discourse markers, scientific
discourse operations, discourse-compositional analysis.
ACM Classification Keywords: I.2.7 Artificial Intelligence: Natural language processing – Text analysis
Link:
COMMON SCIENTIFIC LEXICON FOR AUTOMATIC DISCOURSE ANALYSIS OF SCIENTIFIC AND TECHNICAL TEXTS *
Elena Bolshakova
http://www.foibg.com/ijita/vol15/ijita15-2-p15.pdf