Query Translation for Cross-lingual Search in the Academic Search Engine PubPsych
This article is a preprint and has not been certified by peer review [What does this mean?].
Author(s) / Creator(s)
España-Bonet, Cristina
Stiller, Juliane
Ramthun, Roland
van Genabith, Josef
Petras, Vivien
Abstract / Description
We describe a lexical resource-based process for query translation of a domain-specific and multilingual academic search engine in psychology, PubPsych. PubPsych queries are diverse in language with a high amount of informational queries and technical terminology. We present an approach for translating queries into English, German, French, and Spanish. We build a quadrilingual lexicon with aligned terms in the four languages using MeSH, Wikipedia and Apertium as our main resources. Our results show that using the quadlexicon together with some simple translation rules, we can automatically translate 85% of translatable tokens in PubPsych queries with mean adequacy over all the translatable text of 1.4 when measured on a 3-point scale [0,1,2].
Keyword(s)
machine translation information retrieval translation evaluationPersistent Identifier
Date of first publication
2018
Publisher
PsychArchives
Citation
España-Bonet, C., Stiller, J., Ramthun, R., Van Genabith, J., & Petras, V. (2018). Query Translation for Cross-lingual Search in the Academic Search Engine PubPsych. PsychArchives. https://doi.org/10.23668/psycharchives.1062
-
query_translation_academic_search_2018.pdfAdobe PDF - 258.71KBMD5: c71f7f573d1fd6b809b32434a9d71be0
-
There are no other versions of this object.
-
Author(s) / Creator(s)España-Bonet, Cristina
-
Author(s) / Creator(s)Stiller, Juliane
-
Author(s) / Creator(s)Ramthun, Roland
-
Author(s) / Creator(s)van Genabith, Josef
-
Author(s) / Creator(s)Petras, Vivien
-
PsychArchives acquisition timestamp2018-11-15T12:34:26Z
-
Made available on2018-11-15T12:34:26Z
-
Date of first publication2018
-
Abstract / DescriptionWe describe a lexical resource-based process for query translation of a domain-specific and multilingual academic search engine in psychology, PubPsych. PubPsych queries are diverse in language with a high amount of informational queries and technical terminology. We present an approach for translating queries into English, German, French, and Spanish. We build a quadrilingual lexicon with aligned terms in the four languages using MeSH, Wikipedia and Apertium as our main resources. Our results show that using the quadlexicon together with some simple translation rules, we can automatically translate 85% of translatable tokens in PubPsych queries with mean adequacy over all the translatable text of 1.4 when measured on a 3-point scale [0,1,2].en_US
-
Publication statusother
-
Review statusreviewed
-
SponsorshipThis research was supported by the Leibniz-Gemeinschaft under grant SAW-2016-ZPID-2.en_US
-
CitationEspaña-Bonet, C., Stiller, J., Ramthun, R., Van Genabith, J., & Petras, V. (2018). Query Translation for Cross-lingual Search in the Academic Search Engine PubPsych. PsychArchives. https://doi.org/10.23668/psycharchives.1062en
-
Persistent Identifierhttps://hdl.handle.net/20.500.12034/870
-
Persistent Identifierhttps://doi.org/10.23668/psycharchives.1062
-
Language of contentengen_US
-
PublisherPsychArchivesen_US
-
Is related tohttps://doi.org/10.23668/psycharchives.928
-
Keyword(s)machine translationen_US
-
Keyword(s)information retrievalen_US
-
Keyword(s)translation evaluationen_US
-
Dewey Decimal Classification number(s)150
-
TitleQuery Translation for Cross-lingual Search in the Academic Search Engine PubPsychen_US
-
DRO typepreprinten_US