Gregor Thurmair Thurmair Knowledge-Driven Multilingual Text Analysis and Transparent Information Retrieval

Knowledge-Driven Multilingual Text Analysis and Transparent Information Retrieval

von Gregor Thurmair

Language Technology for Industrial Applications

Preis unbekannt

Buch in deiner Nähe kaufen


...oder deine aktuelle Postleitzahl eingeben:
oder

Beschreibung

This book presents all components and knowledge sources required for Transparent Information Retrieval. Depending on the respective topic and taking care of their interoperability, both deep and shallow technology is used. The processing starts from the analysis of the text data and collects its results in a multilingual conceptual network, this way enabling Transparent Information Retrieval where users communicate with the system in their native language while the documents could be in a different language, transparent to the users.   To do so, the author investigates all text analysis components required for multilingual indexing, starting from preparatory work like language and topic identification, continuing with sentence splitting and tokenization (including Chinese), and describing lexical analysis, also for multiword entries and Named Entities. Entries are then disambiguated both on syntactic (by a tagger) and semantic level (by multilingual word sense disambiguation). The analysis results are collected in a dynamic multilingual ConceptNet, which is an index structure extended by monolingual relations (like synonyms, or head-modifier links) as well as multilingual ones (translations). In addition to many European languages also Turkish, Arabic, Persian, and Chinese are treated.   The book concludes with a description of components needed to build the required resources, like crawlers, bilingual term extraction, and tools for defaulting linguistic annotations. For each component, readers will find a technology overview, a discussion of its main challenges in computational treatment, a description of the technical solution selected, and evaluation information.
This book presents all components and knowledge sources required for Transparent Information Retrieval. Depending on the respective topic and taking care of their interoperability, both deep and shallow technology is used. The processing starts from the analysis of the text data and collects its results in a multilingual conceptual network, this way enabling Transparent Information Retrieval where users communicate with the system in their native language while the documents could be in a different language, transparent to the users.   To do so, the author investigates all text analysis components required for multilingual indexing, starting from preparatory work like language and topic identification, continuing with sentence splitting and tokenization (including Chinese), and describing lexical analysis, also for multiword entries and Named Entities. Entries are then disambiguated both on syntactic (by a tagger) and semantic level (by multilingual word sense disambiguation). The analysis results are collected in a dynamic multilingual ConceptNet, which is an index structure extended by monolingual relations (like synonyms, or head-modifier links) as well as multilingual ones (translations). In addition to many European languages also Turkish, Arabic, Persian, and Chinese are treated.   The book concludes with a description of components needed to build the required resources, like crawlers, bilingual term extraction, and tools for defaulting linguistic annotations. For each component, readers will find a technology overview, a discussion of its main challenges in computational treatment, a description of the technical solution selected, and evaluation information.
Presents all components needed for text analysis with approach, implementation, and evaluation results Extends the language coverage beyond the usual European ones and offers analyses for Turkish, Arabic, Persian or Chinese Introduces the concept of Transparent Information Retrieval (TIR) and search using a multilingual Conceptual network

Autor*in

Gregor Thurmair

Themen in »Knowledge-Driven Multilingual Text Analysis and Transparent Information Retrieval«

Text Analysis Conceptual Network Multilingual Indexing Information Retrieval TINA Transparent Information Retrieval (TIR) Word Sense Disambiguation LtConceptNet Named Entity Recognition Lexical Analysis Text Segmentation

Stimmen zu »Knowledge-Driven Multilingual Text Analysis and Transparent Information Retrieval«

Details

ISBN: 9783031917417
Verlag: Springer International Publishing
Erscheinung: 09.10.2025

Link teilen


Über buchnah.de | Die Buchhandlungen | Die Verlage | Impressum & Kontakt | Datenschutz | Presse


Auf dieser Seite kannst Du Buchhandlungen in der Nähe finden