International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395 -0056
Volume: 04 Issue: 05 | May -2017
p-ISSN: 2395-0072
www.irjet.net
Model for semantic processing in information retrieval systems Ph.D Roberto Passailaigue Baquerizo1, MSc. Hubert Viltres Sala2, Ing. Paúl Rodríguez Leyva3, Ph.D Vivian Estrada Sentí4 1Canciller
Universidad Tecnológica (ECOTEC) Guayaquil, Ecuador 2Departamento de Práctica Profesional, Universidad de las Ciencias Informáticas, La Habana, Cuba 3Departamento de Soluciones Informáticas para Internet, Universidad de las Ciencias Informáticas, La Habana, Cuba 4Departamento Metodológico de Postgrado Universidad de las Ciencias Informáticas, La Habana, Cuba
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - The processing of information with semantic annotation allows to identify the intention of search of the users and to adjust the result according to the context of the information. The present research proposes a model for the retrieval of information with semantic annotation that allows to help the user to recover the most relevant information among all the information available on the web. In the model, three components (Trace-Indexing, Processing and Presentation) are developed that allow identifying the need for user information through the processing, selection and subsequent publication of the retrieved information. The crawling and indexing component allows the identification of available web sites to extract information and perform semantic annotation by applying different information processing techniques. The processing component analyzes the preferences of the user and processes the query performed to calculate the similarity of the indexed information. Subsequently the results are sorted according to the relevance to show in the Presentation component a quantity of information that can be assimilated by the users. For the validation of the proposal, the metrics of precision and completeness were used to demonstrate the quality and relevance of the information retrieval with semantic annotation.
enabled a large volume of web content to be generated. The information available on the web is dispersed, poorly structured or invisible to the common user, making it difficult to access information of high quality and value to the user. In this context, users when they access the Internet are overwhelmed by information overload and do not easily and quickly obtain the information that best suits their needs, limit their experience in the use of information retrieval systems. There are more than a trillion websites on the Internet and every day there is an exponential increase in the amount of information available. Generating new opportunities and different challenges for users when they try to obtain relevant information. Due to the large amount of information available on the Internet and the difficulty of assimilating it, users rely on information retrieval systems (IRS) to find what they are looking for. Information retrieval systems using different tools, methods and techniques retrieve public information from the web for later analysis, selecting and ordering the most relevant information for the user's needs. Among the main sources of information are the component repositories, databases and search engines that allow to simplify and group relevant information, using certain concepts of information organization. The main objective of an SRI as proposed in [1] is to satisfy the user's need for information in a natural language query specified through a set of key words (see figure 1), which help identify the most relevant to the user.
Key Words: Semantic Web, information retrieval, processing, relevance, semantic annotation, similarity 1. INTRODUCTION The development of society, the emergence of technologies and tools to improve access to information and the rapid growth of the Internet in recent years, has
© 2017, IRJET
|
Impact Factor value: 5.181
|
ISO 9001:2008 Certified Journal
|
Page 1