Web information mining and semantic analysis in heterogeneous unstructured text data using enhanced latent Dirichlet allocation. (18th October 2022)
- Record Type:
- Journal Article
- Title:
- Web information mining and semantic analysis in heterogeneous unstructured text data using enhanced latent Dirichlet allocation. (18th October 2022)
- Main Title:
- Web information mining and semantic analysis in heterogeneous unstructured text data using enhanced latent Dirichlet allocation
- Authors:
- Venugopal, Madamanchi
Sharma, Virendra K.
Sharma, Kalpana - Abstract:
- Summary: Information mining and semantic analysis have gained significant attention over recent years to obtain appropriate information from unstructured data. Several approaches have been introduced for web information mining. However, the expected accuracy is not reached by these approaches. Therefore, hybrid fuzzy clustering and enhanced latent Dirichlet allocation (ELDA) are proposed for the accuracy increment in this work. The information clustering process is performed using the hybrid fuzzy clustering algorithm called fuzzy‐C‐medoids optimized using improved whale algorithm. The clustering procedure entails grouping data points into more similar clusters than data points from other clusters. Finally, the context of the text is recognized by analyzing the semantic information with ELDA, which offers a suitable index for accurate and fast data extraction. PYTHON tool is used to develop the proposed web mining model, and the simulation analysis of the proposed model is carried out using the BibTex dataset and compared with baseline models. The performance of the proposed method is evaluated using purity, normalized mutual information, accuracy, and precision metrics. Also, it is compared with different existing algorithms. Thus, the analysis of the results proved that the proposed approach achieves better outcomes than the existing approaches.
- Is Part Of:
- Concurrency and computation. Volume 35:Number 1(2023)
- Journal:
- Concurrency and computation
- Issue:
- Volume 35:Number 1(2023)
- Issue Display:
- Volume 35, Issue 1 (2023)
- Year:
- 2023
- Volume:
- 35
- Issue:
- 1
- Issue Sort Value:
- 2023-0035-0001-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2022-10-18
- Subjects:
- clustering -- optimization algorithm -- preprocessing -- semantic analysis -- text mining -- web mining
Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.7410 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 24678.xml