International Research Journal of Engineering and Technology (IRJET) Volume: 04 Issue: 03 | March -2017
e-ISSN: 2395 -0056 p-ISSN: 2395-0072
www.irjet.net
Question Retrieval in Community Question Answering via NONNegative Matrix Factorization Deshmukh Ashvini B., Shelke Pooja P., Kokare Sayali A., Taware Saksha S. B.E. Students, Department of Information Technology, SVPM’s C.O.E. Malegaon (Bk), 413115, Savitribai Phule, Pune University, Maharashtra, India --------------------------------------------------------------****-------------------------------------------------------------Abstract - CQA helpful in answering real world question. CQA provide answer to human. Question retrieval in CQA can automatically find the most relevant and recent questions that have been solved by other users. We propose an alternative way to address the word ambiguity and word mismatch problems by taking advantage of potentially rich semantic information drawn from other languages. The translated words from other languages via non-negative matrix factorization. Contextual information is exploited during the translation from one language to another language by using Google Translate. Thus, word ambiguity can be solved based on the contextual information when questions are translated. Multiple words that have similar meanings in one language may be translated into a unique word or a few words in a foreign language. It is a word-based translation language model for retrieval with query likelihood model for answer. We use a translated representation by alternative enriching the original question with the words from other language in CQA. We translate the English question into other four language using Google translate which take into account contextual information during translation. If we translate the question word by word, it discard the contextual information. We would expect that such a translation would not be able to solve word ambiguity problem.
which are related to the asked question but asked in other language as well as the questions that are related to the topic but not having similar keywords. The proposed system shows that this can be achieved when these questions are retrieved semantically instead of using keywords. It is found that, in most cases, automated approach cannot obtain results that are as good as those generated by human intelligence. Along with the proliferation and improvement of underlying communication technologies, community Question Answering (CQA) has emerged as an extremely popular alternative to acquire information online, owning to the following facts. a. Information seekers are able to post their specific questions on any topic and obtain answers provided by other participants. By leveraging community efforts, they are able to get better answers than simply using search engines. In comparison with automated CQA systems, CQA usually receives answers with better quality as they are generated based on human intelligence. c. Over times, a tremendous number of QA pairs have been accumulated in their repositories, and it facilitates the preservation and search of answered questions.
Key Words: Community Question Answering, Statical Machine Translation, Non Matrix Factorization, Google Translator, Recursive Neural Network.
User Enter question in CQA.CQA check the question in dataset.CQA Factorize the question in query format. We translate the English questions into other four languages using Google Translate, which takes into account contextual information during translation. Remove word mismatch and word ambiguity. Use the algorithm SMT+NMF for optimization. Use Map Reduce on optimize matrix. Using ranking get expected result with best answer.
1. INTRODUCTION To make community question answering portals more useful, it is necessary for the system to be able to fetch the questions asked in other languages as well. This will give the user a wide range of pre answered questions to look for solution of his/her problem. Current systems fail to do so. Also these systems fetches related questions based on the keywords in it. Thus, if there is a question which is related to the topic but having other keywords, then that question is not retrieved, this is a major drawback of a system as there can be many circumstances where a semantically related question but not having similar keywords is not retrieved. The proposed system shows a way to retrieve questions © 2017, IRJET
|
Impact Factor value: 5.181
|
2. SYSTEM DESCRIPTION 2.1 Functionality summary
User: User first do the registration to the system. He login to the system by entering userid and password. Then, he ask the question in the system. CQA: The CQA gives the users answer in textual format. After that it select the answer medium for particular question. ISO 9001:2008 Certified Journal
|
Page 8