Skadiņa Using Comparable Corpora for Under-Resourced Areas of Machine Translation

Using Comparable Corpora for Under-Resourced Areas of Machine Translation

von

Preis unbekannt

Buch in deiner Nähe kaufen


...oder deine aktuelle Postleitzahl eingeben:
oder

Beschreibung

This book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating comparability and extracting parallel data that can be used for the machine translation task. It is divided into several sections, each covering a specific task such as building, processing, and using comparable corpora, focusing particularly on under-resourced language pairs and domains.

The book is intended for anyone interested in data-driven machine translation for under-resourced languages and domains, especially for developers of machine translation systems, computational linguists and language workers. It offers a valuable resource for specialists and students in natural language processing, machine translation, corpus linguistics and computer-assisted translation, and promotes the broader use of comparable corpora in natural language processing and computational linguistics.



This book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating comparability and extracting parallel data that can be used for the machine translation task. It is divided into several sections, each covering a specific task such as building, processing, and using comparable corpora, focusing particularly on under-resourced language pairs and domains.

The book is intended for anyone interested in data-driven machine translation for under-resourced languages and domains, especially for developers of machine translation systems, computational linguists and language workers. It offers a valuable resource for specialists and students in natural language processing, machine translation, corpus linguistics and computer-assisted translation, and promotes the broader use of comparable corpora in natural language processing and computational linguistics.




Describes a step-by-step method for collecting comparable corpora and processing it for usage in machine translation Demonstrates how data from comparable corpora can improve the quality of machine translation Proposes novel methods for measuring the comparability of multilingual corpora Describes algorithms and techniques for alignment and extraction of lexical and terminological data from comparable corpora in order to provide training and customization data for machine translation

Autor*in

Inguna Skadiņa

Themen in »Using Comparable Corpora for Under-Resourced Areas of Machine Translation«

Comparable corpora Under-resourced languages Multilingual processing Comparability metric Parallel data extraction from comparable corpora Term extraction Machine translation Domain adaptation

Stimmen zu »Using Comparable Corpora for Under-Resourced Areas of Machine Translation«

Details

ISBN: 9783319990040
Verlag: Springer International Publishing
Erscheinung: 06.02.2019

Link teilen


Über buchnah.de | Die Buchhandlungen | Die Verlage | Impressum & Kontakt | Datenschutz | Presse


Auf dieser Seite kannst Du Buchhandlungen in der Nähe finden