• Your current location :Home> Solutions> Government enterprises
    Solutions

    Hanvon OCR government&office e-government information acquisition system solutions

    AddTime:2016-08-23 09:22:16     Hits:0
      Bill/form/documents recognition---The application of“Hanvon OCR SDK”in the field of government office
      With the pace of society information continually accelerating,people have unprecedented desires for the fast and ordered information.Especially the fast development of Internet/Intranet technology provides guarantees for the exchange and share of information as well as collaborative operations of team and it also brings fresh exchanges of information and application modes.Our construction of e-government affairs also enters a brand-new stage.With the improvement of information degree of all industries,the applications of OA and ERP become more and more popularized in government agency and enterprise and public institutions.In recent years,both state organs,public institutions and large and medium enterprises all puts the most money into the infrastructure of network hardware.However,many enterprises have already set up OA systems with no resource or fragmentary resources that troubles leaders can not make decisions according to complete information.Now there are some strange phenomena in many enterprises that on the one hand,with high cost,OA system can not be made full use due to the incomplete information,on the other hand,secretaries have original documents in hand walking among each department.With the further application of Hanvon OCR technology in office automation system,this technology has laid a firm foundation for state organs and enterprises and public institutions to truly achieve paperless office.
      The foundation of OA is the understanding to management and accumulation of information while technology is only tools of OA.Only when we have profound understanding of management as well as management services,will OA comes to play.Only when the information occurring in the office process has ordered accumulation and sedimentary accretion,will OA plays a role.
      The application of Hanvon OCR technology can achieve documents’automatic type-in,storage and management,which not only thoroughly works out the problem of resource starvation and incompleteness,but also makes paperless office possible.
      The application of Hanvon OCR technology completes and enriches OA resources which makes leaders can make decisions on the basis of complete information and lets secretaries relax from tedious typing.It really achieves rapid delivery of e-documents in LAN or WAN.
      1、user requirements
      Information accumulation is the foundation to truly realize OA.Office,actually,refers to the process of documents’making,modification,delivery,identification,saving,destroying and archiving.Traditional mode mainly uses paper-based medium,but in the tide of information revolution,it obviously can not meet the needs of highly efficient and fast-paced modern work and life.How to realize automation of information processing and paperless OA has gradually attracted people’s attention.
     

     
      Traditional office modes leave us millions of paper documents.How to rapidly transfer the paper information into computer-readable digital information,the inputting way seems to be especially important.OA users will achieve the overall goal of paper document input way in the following aspects:
      1)Realize automatic type-in,storage and management of various documents
      2)Realize resource sharing truly and fully
      3)Eliminate and avoid“dead space”of document circulation to make information operation flexible vertically and horizontally.
      4)Offer leaders sufficient and complete data to provide a basis for decisions.
      5)Improve work efficiency,lower cost and avoid overlapping investment.
      6)Easy to use;try to combine with traditional office mode
      2、Hanvon OCR technology in government office/E-government affairs system
      In the information era,people generally use a computer to improve personal work efficiency.The notion of OA has been raised for years,but its effect is not obvious.People still remain at OA’s primary stage of single-machine word and form processing.How to increase the overall productivity?The key point is to establish a safe,reliable,open and efficient information networks,OA,and information management computerization system.This system provides modern daily office conditions for administrative department and ample integrated information services.It realizes archives management automation and office TP automation that improves office efficiency and management level.Besides,this system also makes daily business work of each department become standardization,electronization and normalization and enhances manageability to documentary archives,personnel archives,technical record,finance files etc.In addition,the system realizes information on-line inquiry and on-line borrowing and finally achieves paperless office.
     

     
      OA system integrator integrates“Hanvon standard print form OCR SDK development kit”with its developed OA system by organic integration of OCR technology and scanners,which efficiently solves the bottleneck problem of information inputting in other related fields including information-based government administration and enterprise information etc.
      Flow chart of“Hanvon OCR technology”application in government office/E-government affairs system is as shown in the figure below:
     
      Date sorting:
      For ease of scanning and data classification,dismounting,naming and numbering for further inquiring and searching.
      Scanning:
      Scanning is a process to input paper manuscript images into computer.Generally,the related manuscripts are orderly scanned and after automatic detection and amendment of scanning quality control procedure,the scanned manuscripts will automatically save into data base.
      Image processing:
      In order to increase recognition rate,images will be eliminated blue and cleaned,that is,to remove noise that may effect image recognition rate,such as pocking marks,underline etc.Image quality control procedure will automatically monitor picture processing quality.
      Layout Analysis:
      Automatically understand and locate layouts,distinguish restricted places from horizontal text area,vertical text area,form area and image area,automatic layout analysis runs in background and operator can affirm on the front end with manual intervention when necessary.
      Recognition:
      Transfer character images into computer language code;recognize Chinese in block letter(simplified Chinese character&complicated Chinese character),form,Chinese and English hybrid alignment;the recognized character encodes include GB,BIG5,GBK.Recognition process runs in the background.
      Vertical Proof-reading:
      Very strong error checking and correcting capabilities;it is to recognize one image or some images into a same character image to display in rank and highlight colors to mark doubtful words for ease of finding and modifying mistakes.
      Horizontal Proof-reading:
      It is a traditional manual proofreading method that operator can directly compare recognition results and original images for finding error characters.System will automatically call out corresponding image and make comparison,at the same time,use striking color to mark characters with low reliability.
      Layout Restore:
      Restore the modified text into the same layout as the original one for online reading and inquiring digitization documents in RTF、PDF、HTML formats.
      Data Loading:
      Storage of digitization documents
      The process of OCR inputting data base includes scanning&inputting,image processing,layout analysis,recognition,vertical proof-reading,horizontal proof-reading,layout restore etc.
      Inputting functions of common paper documents:
      The supported document layout is leading in China that support division and recognition of complex layout.Legends are as follow:
     





     
      Inputting functions of official document:
      It can exactly recognize areas of title,main body and official seal;and accurately restore recognized results according to original layout.Legends are as follow:


     
      Inputting functions of form:
      Hanvon OCR form recognition technology has strong recognition function and can provides operations relations between data logic and arithmetic.Particular processing function has been designed for forms that can meet various complex recognition needs and supports modification before data loading,which ensures the final loading data meet the demand.


     
      The realization of all above inputting work must completely be managed by the whole set e-government system.People in different levels will be limited to have rights to operate,use and manage resources by permission settings to departments and people.
      3、Comparison between“Hanvon standard print form OCR SDK development kit”recognized results and original images
      As mentioned above in the function introduction of“Hanvon standard print form OCR SDK development kit”,adopting“Hanvon standard print form OCR SDK development kit”can realize recognition of text information in designated areas,acquire text information(area in the image,typeface,reliability and candidate characters),save recognition results in forms of TXT、RTF、PDF etc.thereinto,RTF and PDF support layout restore of original layout.The following is the comparison between original image and specimen image as well as its recognition results used Hanvon OCR recognition technology
     
      Contrast between RTF restore result and original image
      
    Contrast between PDF restore result and original image

     
      Contrast between PDF official document restore result and original image


     


    Previous Article :Hanvon OCR VAT invoice automatic input system solutions
    Next Article :Hanvon “Easy Recognition” almighty image-text recognition system application scheme

    Copyright © 2017  Beijing Hanvon ZhiXue Technology Co.,Ltd    京ICP备05039526   京公网安备11010802012909