面向票据的OCR识别算法研究与实现
Research and Implementation of Bill Oriented OCR Recognition Algorithm
摘要: 随着对票据使用的不断增多,票据的存储、管理以及票据信息的查找,逐渐变得繁琐,给人们带来困扰。通过对票据中信息的识别,发现其中圆形印章中的字符并不能准确识别,针对环形字符以及印章中文字的准确识别进行研究,实现了面向票据的OCR识别算法。使用Canny算子边缘检测、Hough变换、极坐标变换、以及确定极坐标变换起点的算法等,实现了能够按照印章中文字排列的逻辑进行变换,并成功识别出印章中所含的文字内容。实验结果表明,对印章中文字内容识别的正确率达到83.84%。
Abstract:
With the increasing use of bills, the storage and management of bills and the search of bill information have become cumbersome and perplexing. Through the recognition of the information in the bill, it is found that the characters in the round seal cannot be accurately recognized. Aiming at the accurate recognition of the circular characters and the characters in the seal, an OCR recognition algorithm for the bill is realized. Using Canny operator edge detection, Hough transformation, polar coordinate transformation, and the algorithm to determine the starting point of polar coordinate transformation, the transformation can be carried out according to the logic of the text arrangement in the seal, and the text content contained in the seal can be successfully recognized. The experimental results show that the correct rate of text recognition in seal is 83.84%.
参考文献
|
[1]
|
王文华. 浅谈OCR技术的发展和应用[J]. 福建电脑, 2012, 28(6): 56+92.
|
|
[2]
|
梁林森. 基于OCR技术的医疗收费票据自动录入系统研究[J]. 电力设备管理, 2021(4): 198-199.
|
|
[3]
|
杜训祥. 基于卷积神经网络的图像中文OCR识别纠错方法及系统的研究[J]. 江苏通信, 2021, 37(1): 109-112.
|
|
[4]
|
王阳, 李振东, 杨观赐. 基于深度学习的OCR文字识别在银行业的应用研究[J]. 计算机应用研究, 2020, 37(S2): 375-379.
|
|
[5]
|
汪伊函. 基于小波分析的书画印章图像识别方法[J]. 信息与电脑(理论版), 2022, 34(14): 89-91.
|
|
[6]
|
戴俊峰, 杨天, 熊闻心. 基于极坐标转换的中文印章文字识别[J]. 计算机工程与设计, 2021, 42(11): 3174-3180.
|
|
[7]
|
陈娅娅, 刘全香, 王凯丽, 易尧华. 基于ResNet和迁移学习的古印章文本识别[J]. 计算机工程与应用, 2022, 58(10): 125-131.
|
|
[8]
|
张倩, 郝红光, 韩星周. 利用VGGnet对印章印文分类识别的适用条件研究[J]. 通信技术, 2019, 52(7): 1639-1642.
|