一种多样变换的手写验证码自动识别算法的研究及应用
Research on Automatically Identifying Algorithm and Application about Handwritten CAPTCHA of Kinds of Diverse Transformation
摘要: 研究验证码自动识别技术可以进一步提升人识别验证码的可读性,增强机器识别的难度,从而提高网络安全性。针对目前提出的验证码识别方法基本都是采用光学字符识别(OCR)方法对机器写的标准字符进行识别,本文提出了一种多样变换的手写验证码自动识别算法,对彩色验证码进行识别主要包括彩色验证码的二值化、手写字符的区域分割、同一字符的区域连接、使用卷积神经网络对手写字符进行训练、手写字符识别。本文的实现结果明显优于OCR的识别结果。结果表明通过该网站的测试,基本上能自动识别该网站的验证码。
Abstract: Research on technique of automatically identifying CAPTCHA can promote people to identify the readability of verifying the code further and strengthen the difficulty that the machine identifies and raises a network safety thus. Currently aiming at the CAPTCHA identification methods are basically used optical character recognition (OCR) method to identify the standard characters written by the machine. The paper puts forward color CAPTCHA identifies to mainly include color verification code binary by threshold, the connect district segmentation of the handwritten character list, the near district of the same character list links, use convolution neural network to train a character and to identify handwritten character. The paper realization obviously surpasses an identifying of OCR result. The result shows CAPTCHA of the website that is basically passed by the website test; the website can automatically identify CAPTCHA of the website.
文章引用:王春才, 孙媛媛. 一种多样变换的手写验证码自动识别算法的研究及应用[J]. 计算机科学与应用, 2017, 7(11): 1059-1066. https://doi.org/10.12677/CSA.2017.711120

参考文献

[1] Yan, J., El Ahmad, A.S., et al. (2008) A Low-Cost Attack on a Microsoft CAPTCHA School of Computing Science. Newcastle University, UK.
[2] 程治国, 刘允才. 一种通用的去除文字图像中干扰线的算法[J]. 上海交通大学学报, 2005, 39(8): 1288-1291.
[3] 李兴国, 高炜, 黄江林. 基于滴水算法的验证码中粘连字符分割方法[J]. 计算机工程与应用, 2014, 50(1): 163-166.
[4] 汪洋, 许映秋, 彭艳兵. 基于KNN技术的校内网验证码识别[J]. 计算机与现代化, 2017(2): 93-97.
[5] 殷光. 基于SVM的验证码识别算法研究[D]: [硕士学位论文]. 合肥: 安徽大学, 2010: 1-54.
[6] 陈超, 毛坚桓, 刘寅. 基于卷积神经网络的铁路货运网站验证码识别[J]. 指挥信息系统与技术, 2016, 7(4): 91-96.
[7] 田怀川. 基于神经网络的图形验证码识别及防识别的研究与应用[D]: [硕士学位论文]. 哈尔滨: 哈尔滨工业大学, 2010: 1-73.