DSpace logo

Please use this identifier to cite or link to this item: http://142.54.178.187:9060/xmlui/handle/123456789/978
Title: Multi-Label Question Classification for Factoid and List Type Questions in Biomedical Question Answering
Authors: Wasim, Muhammad
Mahmood, Waqar
Asim, Muhammad Nabeel
Khan, Muhammad Usman
Keywords: Medical and Health Sciences
Knowledge discovery
Feature extraction
Corpus generation
Binary relevance
Semantics
Issue Date: 25-Dec-2018
Publisher: IEEE Access
Abstract: Biomedical experts and bio-curators are unable to quickly find short and precise information using typical search engines as the amount of biomedical literature is increasing exponentially. The research community is focusing on biomedical question answering (QA) systems so that anyone can find precise information nuggets from the massive amount of biomedical literature. Generally, the user queries fall under different categories such as factoid, list, yes/no, or summary. The existing state-of-the-art question answering systems deal with most of these question types. However, the research to improve the performance of individual question types is also on the rise. To improve QA system performance, question classification plays a vital role for factoid and list type questions as it allows the answer processing stage to narrow down the candidate answer space and assigns a higher rank to the correct answers. A single biomedical answer or entity may be associated with more than one biomedical category or semantic type, e.g., Coenzyme Q(10) is classified under two categories in Unified Medical Language System (UMLS): organic chemical and biologically active substance . This inherent characteristic of biomedical entities makes question classification in the biomedical domain a multi-label classification problem, where one question might expect answers belonging to more than one semantic type. To the best of our knowledge, several QA systems deal with question classification as a multi-class classification problem and only one state-of-the-art system – OAQA – deals with it as a multi-label classification problem. In this paper, we analyze the pipeline of the OAQA system for factoid and list type questions, emphasizing the multi-label question classification. We use an improved question classification dataset with the copy transformation technique to improve the performance of list type questions. Moreover, we introduce a binary transformation in the pipeline of factoid
URI: http://142.54.178.187:9060/xmlui/handle/123456789/978
ISSN: 2169-3536
Appears in Collections:Journals

Files in This Item:
File Description SizeFormat 
keywords#.htm125 BHTMLView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.