Bonfring International Journal of Software Engineering and Soft Computing

Impact Factor: 0.375 | International Scientific Indexing(ISI) calculate based on International Citation Report(ICR)


Enhanced Automatically Mining Facets for Queries and Clustering with Side Information Model

K. Vidhya and N. Saravanan


Abstract:

In this paper describe a specific type of summaries that Query facet the main topic of given text. Existing summarization algorithms are classified into different categories in terms of their summary construction methods (abstractive or extractive), the number of sources for the summary (single document or multiple documents), types of information in the summary (indicative or informative), and the relationship between summary and query (generic or query-based). QD Miner aims to offer the possibility of finding the main points of multiple documents and thus save users? time on reading whole documents. The difference is that most existing summarization systems dedicate themselves to generating summaries using sentences extracted from documents. In addition, return multiple groups of semantically related items, while they return a flat list of sentences. In this paper, adding these lists may improve both accuracy and recall of query facets. Part-of-speech information can be used to check the homogeneity of lists and improve the quality of query facets. The side-information could not be incorporate into the mining process, because it can either improve the quality of the representation for the mining process, or can add noise to the process. Therefore, a principle way is required to perform the mining process, so as to maximize the advantages from using this side information. This dissertation proposes an algorithm which combines classical partitioning algorithms with probabilistic models in order to create an effective clustering approach.

Keywords: Data Mining, Classification, TF-IDF, K-Mean Clustering, Statistical Mean Validation.

Volume: 8 | Issue: 2

Pages: 01-06

Issue Date: April , 2018

DOI: 10.9756/BIJSESC.8387

Full Text

Email

Password

 


This Journal is an Open Access Journal to Facilitate the Research Community