Query Translation for Cross-lingual Search in the Academic Search Engine PubPsych
Author(s) / Creator(s)
España-Bonet, Cristina
Stiller, Juliane
Ramthun, Roland
van Genabith, Josef
Petras, Vivien
Abstract / Description
We describe a lexical resource-based process for query translation of a domain-specific and multilingual academic search engine in psychology, PubPsych. PubPsych queries are diverse in language with a high amount of informational queries and technical terminology. We present an approach for translating queries into English, German, French, and Spanish. We build a quadrilingual lexicon with aligned terms in the four languages using MeSH, Wikipedia and Apertium as our main resources. Our results show that using the quadlexicon together with some simple translation rules, we can automatically translate 85% of translatable tokens in PubPsych queries with mean adequacy over all the translatable text of 1.4 when measured on a 3-point scale [0,1,2].
Keyword(s)
machine translation information retrievalPersistent Identifier
Date of first publication
2018
Is part of
12th International Conference on Metadata and Semantics Research, 2018, Limassol, Cyprus
Publisher
PsychArchives
Citation
-
Presentation_JS_RR_MTSR2018.pdfAdobe PDF - 233.4KBMD5: 80f4931a2448b718e40e7a5717895caeDescription: Conference Talk
-
There are no other versions of this object.
-
Author(s) / Creator(s)España-Bonet, Cristina
-
Author(s) / Creator(s)Stiller, Juliane
-
Author(s) / Creator(s)Ramthun, Roland
-
Author(s) / Creator(s)van Genabith, Josef
-
Author(s) / Creator(s)Petras, Vivien
-
PsychArchives acquisition timestamp2018-11-05T10:52:49Z
-
Made available on2018-11-05T10:52:49Z
-
Date of first publication2018
-
Abstract / DescriptionWe describe a lexical resource-based process for query translation of a domain-specific and multilingual academic search engine in psychology, PubPsych. PubPsych queries are diverse in language with a high amount of informational queries and technical terminology. We present an approach for translating queries into English, German, French, and Spanish. We build a quadrilingual lexicon with aligned terms in the four languages using MeSH, Wikipedia and Apertium as our main resources. Our results show that using the quadlexicon together with some simple translation rules, we can automatically translate 85% of translatable tokens in PubPsych queries with mean adequacy over all the translatable text of 1.4 when measured on a 3-point scale [0,1,2].en_US
-
Publication statuspublishedVersion
-
Review statusreviewed
-
Persistent Identifierhttps://hdl.handle.net/20.500.12034/735
-
Persistent Identifierhttps://doi.org/10.23668/psycharchives.928
-
Language of contentengen_US
-
PublisherPsychArchivesen_US
-
Is part of12th International Conference on Metadata and Semantics Research, 2018, Limassol, Cyprus
-
Is related tohttps://doi.org/10.23668/psycharchives.1062
-
Keyword(s)machine translationen_US
-
Keyword(s)information retrievalen_US
-
Dewey Decimal Classification number(s)150
-
TitleQuery Translation for Cross-lingual Search in the Academic Search Engine PubPsychen_US
-
DRO typeconferenceObjecten_US
-
Leibniz institute name(s) / abbreviation(s)ZPID