Please use this identifier to cite or link to this item:
http://localhost:8080/xmlui/handle/123456789/1790
Title: | Category Classification of the Training Set Combined with Sentence Multiplication for Semantic Data Extraction Using GENI Algorithm |
Authors: | Saravana Kumar Coimbatore Shanmugam Santhosh Rajendran Amudhavalli Padmanabhan Kalaiarasan Chellan |
Issue Date: | 2020 |
Publisher: | Bentham Sciences Publishers B.V |
Citation: | Recent Patents on Computer Science |
Abstract: | Background: Increase in the internet data has increased the priority in the data extraction accuracy. Accuracy here lies with what data the user has requested for and what has been retrieved. The same large data sets that need to be analyzed make the required information retrieval a challenging task. Objective: To propose a new algorithm in an improved way than the traditional methods to classify the category or group to which each training sentence belongs. Method: Identifying the category to which the input sentence belongs is achieved by analyzing the Noun and Verb of each training sentence. NLP is applied to each training sentence and the group or category classification is achieved using the proposed GENI algorithm so that the classifier is trained efficiently to extract the user requested information. Results: The input sentences are transformed into a data table by applying GENI algorithm for group categorization. Plotting the graph in R tool, the accuracy of the group extracted by the Classifier involving GENI approach is higher than that of Naive Bayes & Decision Trees. |
URI: | http://localhost:8080/xmlui/handle/123456789/1790 |
Appears in Collections: | Computer Science Engineering Department |
Files in This Item:
File | Size | Format | |
---|---|---|---|
CSE-41.docx | 16.53 kB | Microsoft Word XML | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.