• Title/Summary/Keyword: AI-OCR

Search Result 19, Processing Time 0.026 seconds

Deep Learning OCR based document processing platform and its application in financial domain (금융 특화 딥러닝 광학문자인식 기반 문서 처리 플랫폼 구축 및 금융권 내 활용)

  • Dongyoung Kim;Doohyung Kim;Myungsung Kwak;Hyunsoo Son;Dongwon Sohn;Mingi Lim;Yeji Shin;Hyeonjung Lee;Chandong Park;Mihyang Kim;Dongwon Choi
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.143-174
    • /
    • 2023
  • With the development of deep learning technologies, Artificial Intelligence powered Optical Character Recognition (AI-OCR) has evolved to read multiple languages from various forms of images accurately. For the financial industry, where a large number of diverse documents are processed through manpower, the potential for using AI-OCR is great. In this study, we present a configuration and a design of an AI-OCR modality for use in the financial industry and discuss the platform construction with application cases. Since the use of financial domain data is prohibited under the Personal Information Protection Act, we developed a deep learning-based data generation approach and used it to train the AI-OCR models. The AI-OCR models are trained for image preprocessing, text recognition, and language processing and are configured as a microservice architected platform to process a broad variety of documents. We have demonstrated the AI-OCR platform by applying it to financial domain tasks of document sorting, document verification, and typing assistance The demonstrations confirm the increasing work efficiency and conveniences.

Study on OCR Enhancement of Homomorphic Filtering with Adaptive Gamma Value

  • Heeyeon Jo;Jeongwoo Lee;Hongrae Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.2
    • /
    • pp.101-108
    • /
    • 2024
  • AI-OCR (Artificial Intelligence Optical Character Recognition) combines OCR technology with Artificial Intelligence to overcome limitations that required human intervention. To enhance the performance of AI-OCR, training on diverse data sets is essential. However, the recognition rate declines when image colors have similar brightness levels. To solve this issue, this study employs Homomorphic filtering as a preprocessing step to clearly differentiate color levels, thereby increasing text recognition rates. While Homomorphic filtering is ideal for text extraction because of its ability to adjust the high and low frequency components of an image separately using a gamma value, it has the downside of requiring manual adjustments to the gamma value. This research proposes a range for gamma threshold values based on tests involving image contrast, brightness, and entropy. Experimental results using the proposed range of gamma values in Homomorphic filtering suggest a high likelihood for effective AI-OCR performance.

Proposal Record Automation Service Based on AI by Using OCR and Pattern Analysis Algorithm (OCR과 패턴분석 알고리즘을 활용한 인공지능 기반 기록 자동화 서비스 제안)

  • Hwang, Yun-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2019.10a
    • /
    • pp.530-532
    • /
    • 2019
  • 제안하는 서비스는 OCR(Optical Character Recognition, 광학문자인식)과 딥러닝 패턴분석 알고리즘을 활용하여 문서를 효율적으로 관리하는 서비스로 필기를 많이 하는 사용자를 위한 기능을 제공한다. 최근 다양한 분야에서의 머신러닝 기반의 OCR의 활용이 증가했지만 기존의 애플리케이션은 패턴 분석 알고리즘과 통계 기반의 OCR을 혼합하여 사용하기 때문에 필기체에 대한 인식률이 높지 않다. 이에 본 논문에서는 OCR과 패턴분석 알고리즘을 활용하여 필기체에 대한 높은 인식률을 제공하는 서비스를 제안한다.

A Case Study on the Application of AI-OCR for Data Transformation of Paper Records (종이기록 데이터화를 위한 AI-OCR 적용 사례연구)

  • Ahn, Sejin;Hwang, Hyunho;Yim, Jin Hee
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.3
    • /
    • pp.165-193
    • /
    • 2022
  • It can be said that digital technology is at the center of the change in the modern work environment. In particular, in general public institutions that prove their work with records produced by business management systems and document production systems, the record management system is also the work environment itself. Gimpo City applied for the 2021 public cloud leading project of the National Information Society Agency (NIA) to proactively respond to the 4th industrial revolution technology era and implemented a public cloud-based AI-OCR technology enhancement project with 330 million won in support of 330 million won. Through this, it was converted into data beyond the limitations of non-electronic records limited to search and image viewing that depend on standardized index values. In addition, a 98% recognition rate was realized by applying a new technology called AI-OCR. Since digital technology has been used to improve work efficiency, productivity, development cost, and record management service levels of internal and external users, we would like to share the direction of enhancing expertise in the record management and implementation of work environment innovation.

A Study on the Automatic Recognition of AI-based Port Documents Using OCR - Based on the application of KNN algorithm- (OCR을 이용한 AI기반 항만서류 자동인식에 관한 연구 -KNN 알고리즘 적용을 중심으로-)

  • Kim, Jong-Eun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2019.10a
    • /
    • pp.872-875
    • /
    • 2019
  • 우리나라의 수출입 화물 물량의 대부분은 항만을 통해 처리되고 있으며, 취급화물의 다양성과 선박의 대형화로 주변 국가들의 항만간 경쟁으로 심화되면서 항만 비용의 증가로 발생되고 있다. 이는 항만업무의 효율화와 생산성의 증가로 비용 감소효과를 바라볼 수 있는데, 4차 산업혁명의 주요 기술인 인공지능(OCR, AI알고리즘, 머신러닝, RPA등)의 기술 적용으로 개선할 수 있다. 본 연구에서는 이와 관련된 실질적 항만업무와 관련된 기술을 적용하여 업무의 효율화와 생산성 증가의 기술적 검증을 통해 항만의 경쟁력 강화와 국가 물류발전의 기술적 향상을 도모하고자 한다.

Pet Disease Prediction Service and Integrated Management Application (반려동물 질병예측서비스 및 통합관리 어플리케이션)

  • Ki-Du Pyo;Dong-Young Lee;Won-Se Jung;Oh-Jun Kwon;Kyung-Suk Han
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.23 no.6
    • /
    • pp.133-137
    • /
    • 2023
  • In this paper, we developed a 'comprehensive pet management application' that combines pet AI diagnosis, animal hospital search, smart household accounts, and community functions. The application can solve the inconvenience of users who have to use multiple functions as separate applications, and can easily use pet AI diagnosis services through photos, provides animal hospital information using crawling, finds nearby animal hospitals, and supports smart households that can scan receipts using OCR text extraction techniques. By using this application, information necessary for raising pets such as health and consumption details of pets can be managed in one system.

A Design and Implementation of Generative AI-based Advertising Image Production Service Application

  • Chang Hee Ok;Hyun Sung Lee;Min Soo Jeong;Yu Jin Jeong;Ji An Choi;Young-Bok Cho;Won Joo Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.5
    • /
    • pp.31-38
    • /
    • 2024
  • In this paper, we propose an ASAP(AI-driven Service for Advertisement Production) application that provides a generative AI-based automatic advertising image production service. This application utilizes GPT-3.5 Turbo Instruct to generate suitable background mood and promotional copy based on user-entered keywords. It utilizes OpenAI's DALL·E 3 model and Stability AI's SDXL model to generate background images and text images based on these inputs. Furthermore, OCR technology is employed to improve the accuracy of text images, and all generated outputs are synthesized to create the final advertisement. Additionally, using the PILLOW and OpenCV libraries, text boxes are implemented to insert details such as phone numbers and business hours at the edges of promotional materials. This application offers small business owners who face difficulties in advertising production a simple and cost-effective solution.

Trends in Deep Learning-based Medical Optical Character Recognition (딥러닝 기반의 의료 OCR 기술 동향)

  • Sungyeon Yoon;Arin Choi;Chaewon Kim;Sumin Oh;Seoyoung Sohn;Jiyeon Kim;Hyunhee Lee;Myeongeun Han;Minseo Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.2
    • /
    • pp.453-458
    • /
    • 2024
  • Optical Character Recognition is the technology that recognizes text in images and converts them into digital format. Deep learning-based OCR is being used in many industries with large quantities of recorded data due to its high recognition performance. To improve medical services, deep learning-based OCR was actively introduced by the medical industry. In this paper, we discussed trends in OCR engines and medical OCR and provided a roadmap for development of medical OCR. By using natural language processing on detected text data, current medical OCR has improved its recognition performance. However, there are limits to the recognition performance, especially for non-standard handwriting and modified text. To develop advanced medical OCR, databaseization of medical data, image pre-processing, and natural language processing are necessary.

Recognition of Korean Menu for Online to Offline Stores : VGG-ResNet Fusion Model with Attention Mechanism (Online to Offline 상점을 위한 한글 메뉴판 인식 : 어텐션 메커니즘을 적용한 VGG-ResNet 융합 모델)

  • Jongwook Si;Sangjin Lee;Sungyoung Kim
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.17 no.4
    • /
    • pp.190-197
    • /
    • 2024
  • The O2O store model dissolves the boundaries between online and offline platforms, providing significant convenience to customers. To effectively operate such platforms, small business owners must provide necessary information in digital format. Specifically, the process of digitizing Korean menus manually can lead to multiple issues, and the use of OCR technology often results in high error rates due to the low accuracy in recognizing Korean. In response, this paper proposes an enhanced OCR model based on the popular EasyOCR framework, aimed at improving the recognition accuracy of Korean. The proposed model integrates the structural advantages of VGG and ResNet, and incorporates an attention mechanism to significantly improve the recognition performance of Korean. Moreover, experimental results indicate that the proposed model achieved approximately a 3.5% improvement in accuracy and around a 1% improvement in both confidence score and normalized edit distance compared to EasyOCR. Therefore, this demonstrates that the proposed method effectively addresses the existing challenges.

Case Studies for Insurance Service Marketing Using Artificial Intelligence(AI) in the InsurTech Industry. (인슈어테크(InsurTech)산업에서의 인공지능(AI)을 활용한 보험서비스 마케팅사례 연구)

  • Jo, Jae-Wook
    • Journal of Digital Convergence
    • /
    • v.18 no.10
    • /
    • pp.175-180
    • /
    • 2020
  • Through case studies for insurance service marketing using artificial intelligence(AI) in the insurtech industry, it investigated how innovative technologies(artificial intelligence, machine learning etc.) are being used in the insurance ecosystems. In particular, through domestic and international case studies, it was examined by Lemonade's service of insurance contracts and getting the indemnity and AI company's service of calculating the compensation through a medical certificate image based on OCR, which brought disruptive innovations using artificial intelligence. As a result of the case analysis, these services have drastically shortened the lead time of insurance contracts and payment through machine learning using numerous customer data based on artificial intelligence. And accurate and reasonable compensation was calculated in the estimation of indemnity, which has a lot of disputes between customers and insurance companies. It was able to increase customer satisfaction and customer value.