Filtering Chinese Image Spam Using Pseudo‐OCR. Issue 1 (1st January 2015)
- Record Type:
- Journal Article
- Title:
- Filtering Chinese Image Spam Using Pseudo‐OCR. Issue 1 (1st January 2015)
- Main Title:
- Filtering Chinese Image Spam Using Pseudo‐OCR
- Authors:
- Xu, Bin
Li, Ruiguang
Liu, Yashu
Yan, Hanbing
Li, Siyuan
Zhang, Honggang - Abstract:
- Abstract : For image spam filtering, the Optical character recognition(OCR) based methods often achieve a better performance due to the more complex structure of recognizing corresponding text. However, applying traditional OCR techniques usually introduced shortcomings like the expensive computational cost, vulnerability to image noises and artificial interferences, especially for Chinese image spam filtering. So, by optimizing recognition procedure of traditional OCR, we propose the idea of pseudo‐OCR more suitable for Chinese image spam filtering. During which discriminating the potential image spam character features from ham ones is sufficient, instead of recognizing them. What's more, a novel Chinese key‐point based character feature specific for pseudo‐OCR is also devised and extracted using a carefully designed algorithm, which outperforms classic corner detection methods in finding such key‐points. Experiment results show that our proposed system usually has a better performance than traditional OCR based method while maintaining a low false positive rate.
- Is Part Of:
- Chinese journal of electronics. Volume 24:Issue 1(2015)
- Journal:
- Chinese journal of electronics
- Issue:
- Volume 24:Issue 1(2015)
- Issue Display:
- Volume 24, Issue 1 (2015)
- Year:
- 2015
- Volume:
- 24
- Issue:
- 1
- Issue Sort Value:
- 2015-0024-0001-0000
- Page Start:
- 134
- Page End:
- 139
- Publication Date:
- 2015-01-01
- Subjects:
- feature extraction -- filtering theory -- optical character recognition -- unsolicited e‐mail
Chinese image spam filtering -- pseudoOCR techniques -- optical character recognition -- text recognition -- image noises -- artificial interferences -- image spam character feature extraction -- Chinese key‐point based character feature extraction -- corner detection methods -- low false positive rate
Electronics -- Periodicals
Electronics -- China -- Periodicals
Electronics
China
Periodicals
621.38105 - Journal URLs:
- https://ietresearch.onlinelibrary.wiley.com/journal/20755597 ↗
http://ieeexplore.ieee.org/servlet/opac?punumber=7479413 ↗
http://ieeexplore.ieee.org/Xplore/home.jsp ↗ - DOI:
- 10.1049/cje.2015.01.022 ↗
- Languages:
- English
- ISSNs:
- 1022-4653
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3180.317180
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 16468.xml