This book highlights an innovative approach for extracting terminological cores from subject domain-bounded collections of professional texts. The approach is based on exploiting the phenomenon of terminological saturation. The book presents the formal framework for the method of detecting and measuring terminological saturation as a successive approximation process. It further offers the suite of the algorithms that implement the method in the software and comprehensively evaluates all the aspects of the method and possible input configurations in the experiments on synthetic and real collections of texts in several subject domains. The book demonstrates the use of the developed method and software pipeline in industrial and academic use cases. It also outlines the potential benefits of the method for the adoption in industry.
This book highlights an innovative approach for extracting terminological cores from subject domain-bounded collections of professional texts. The approach is based on exploiting the phenomenon of terminological saturation. The book presents the formal framework for the method of detecting and measuring terminological saturation as a successive approximation process. It further offers the suite of the algorithms that implement the method in the software and comprehensively evaluates all the aspects of the method and possible input configurations in the experiments on synthetic and real collections of texts in several subject domains. The book demonstrates the use of the developed method and software pipeline in industrial and academic use cases. It also outlines the potential benefits of the method for the adoption in industry.
Victoria Kosa
Ontology Engineering Ontology Learning Automated Term Extraction Methods (ATE) Text Similarity Measurement Terminological Saturation Terminological Difference Function (thd) Optimized Computation Pipeline Accumulated Regular Noise Removal Document Ordering Term Grouping Optimized Term Extraction Pipeline Gartner Trend Prediction