Advanced text documents information retrieval system for search services

Chiranjeevi, H S and Shenoy, Manjula K (2020) Advanced text documents information retrieval system for search services. Cogent Engineering. ISSN 2331-1916

[img] PDF
11251.pdf - Published Version
Restricted to Registered users only

Download (3MB) | Request a copy
Official URL:


Information technology has explored the growth of text documents data in many organizations and the structural arrangement of voluminous data is a complex task. Handling the text document data is a challenging process involving not only the training of models but also numerous additional procedures, e.g., data pre-processing, transformation, and dimensionality reduction. In this paper, we describe the system’s architecture, the technical challenges, and the novel solution we have built. We propose a Recurrent Convolutional Neural network (RCNN), based text information retrieval system which efficiently retrieves the text documents and information for the user query. Pre-processing using tokenization and stemming, retrieval using TF-IDF (Term Frequency-Inverse Document Frequency), and RCNN classifier which captures the contextual information is implemented. A real-time advanced search system is developed on a huge set of MAHE University dataset. The performance of the proposed text document retrieval system is compared with other existing algorithms and the efficacy of the method is discussed. The proposed RCNN-based text document information retrieval model performs better in terms of precision, recall, and F-measure. A high-quality and high-performance text docu�ment retrieval search system is presented

Item Type: Article
Uncontrolled Keywords: : information technology; text documents; search engine; information retrieval; tokenization; recurrent convolutional neural network; retrieval efficiency
Subjects: Engineering > MIT Manipal > Information and Communication Technology
Depositing User: MIT Library
Date Deposited: 23 Apr 2021 09:11
Last Modified: 23 Apr 2021 09:11

Actions (login required)

View Item View Item