Morphological Patterns of Scientific Terminology in Thesauri Database of IRANDOC
Paper ID : 1088-ICIL
Authors:
Moluksadat Hosseini Beheshti *
No. 1090. Enqelab Eslami Ave
Abstract:
Abstract: IranDoc database of scientific terms includes basic sciences, agricultural and engineering thesauri. 39,000 terms were used as linguistic Corpus for analysis from the database. This terminology consists of terms for Chemistry, Physics, Geology, Biology and Mathematics, the merging of which led to omitting the terms that were common between the five thesauri and synchronizing them.It must be pointed out that these terms have been extracted from published scientific documents, like dissertations, scientific articles, research project reports and indexes of academic books.
Persian scientific terms in basic sciences thesauri were categorized to simple and complex words, and their frequency in the database were determined. The complex words were classified into compounds, derivatives and compound-derivatives and the frequency of each category was calculated. These terms, which were the most frequent in indexing the scientific documents, were analyzed with the aid of computer. The frequency of prefixes and suffixes in Persian scientific terms and their morphological patterns were determined based on these analyses.These terms are preferred by researchers and specialists, who are the main users of scientific information and therefore play the main role in updating IranDoc thesauri.
Keywords:
terminology, morphological patterns, concept system, thesaurus, ontology, IranDoc database
Status : Paper Conditionally Accepted
10th International Iranian Conference on Linguistics
login