Text segmentation of health examination item based on character statistics and information measurement. Issue 1 (29th March 2018)
- Record Type:
- Journal Article
- Title:
- Text segmentation of health examination item based on character statistics and information measurement. Issue 1 (29th March 2018)
- Main Title:
- Text segmentation of health examination item based on character statistics and information measurement
- Authors:
- An, Hui
Wang, Dahui
Pan, Zhigeng
Chen, Meiling
Wang, Xinting - Abstract:
- Abstract : This study explores the segmentation algorithm of item text data, especially of single long length data in health examination. In the specific implementation, a large amount of historical health examination data is analysed. Using the method of character statistics, the connection tightness values T AB s between two adjacent characters are calculated. Three parameters, the candidate number N, the best position BP, and balance weight BW are set. The total segmentation indexes SIs are calculated, thus determined the segmentation position Pos. The optimal parameter values are determined by the method of information measurement. Experimental results show that the accuracy rate is 78.6% and reaches 82.9% in the most frequently appeared text item. The complexity of the algorithm is O ( n ). Using no existing domain knowledge, it is very simple and fast. By executed repeatedly, it is convenient to obtain the characteristics of each single item of text data, furthermore, to distinguish respective express preference of different physicians to the same item. The assumption is verified that without professional domain knowledge, a large amount of historical data can provide valuable clues for the text understanding. The results of this research are being applied and verified in the following research works in the field of health examination.
- Is Part Of:
- CAAI transactions on intelligence technology. Volume 3:Issue 1(2018)
- Journal:
- CAAI transactions on intelligence technology
- Issue:
- Volume 3:Issue 1(2018)
- Issue Display:
- Volume 3, Issue 1 (2018)
- Year:
- 2018
- Volume:
- 3
- Issue:
- 1
- Issue Sort Value:
- 2018-0003-0001-0000
- Page Start:
- 28
- Page End:
- 32
- Publication Date:
- 2018-03-29
- Subjects:
- image segmentation -- text analysis -- data mining -- graph theory -- data analysis
text segmentation -- health examination item -- character statistics -- information measurement -- segmentation algorithm -- item text data -- single long length data -- historical health examination data -- connection tightness values -- adjacent characters -- position BP -- balance weight BW -- total segmentation indexes SIs -- segmentation position Pos -- optimal parameter values -- frequently appeared text item -- existing domain knowledge -- single item -- professional domain knowledge -- historical data -- text understanding -- domain knowledge graph -- intelligent method -- automated method -- automatic data analysis -- information classification -- health assessment
C1160 Combinatorial mathematics -- C6130 Data handling techniques -- C6170K Knowledge engineering techniques -- C7330 Biology and medical computing
Artificial intelligence -- Periodicals
Computer science -- Periodicals
Artificial intelligence
Computer science
Electronic journals
Periodicals
006.305 - Journal URLs:
- https://digital-library.theiet.org/content/journals/trit ↗
https://ietresearch.onlinelibrary.wiley.com/journal/24682322 ↗
http://search.ebscohost.com/login.aspx?direct=true&site=edspub-live&scope=site&type=44&db=edspub&authtype=ip, guest&custid=ns011247&groupid=main&profile=eds&bquery=AN%2010129651 ↗
http://www.sciencedirect.com/ ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1049/trit.2018.0005 ↗
- Languages:
- English
- ISSNs:
- 2468-6557
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2943.720000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 16698.xml