• Title/Summary/Keyword: Handwritten Data

Search Result 90, Processing Time 0.026 seconds

Strategies and Challenges in Digitizing Archaeological Data (고고 디지털 아카이브 구축의 과제와 전략)

  • KIM Bumcheol
    • Korean Journal of Heritage: History & Science
    • /
    • v.56 no.1
    • /
    • pp.6-19
    • /
    • 2023
  • As data management and intelligence capability become proxy indicators of national power, the risk provoked by high depending on digital technology ironically increases. The quicker the changes come to be, the more important digitizing existing data and management of digital data are. The management of archaeological data could not be exceptional. It has to be performed in a more comprehensive, systematic and rapid manner. In order to perform the task, the nature of archaeological data contained in the digital archive should be properly recognized in advance: the primary data are generated by excavation as a process destroying their sources, the data are enormous in type and quantity, including long-term and various human experience, and the natural extinction of primary data in handwritten form is likely to be more crucial than in any other discipline. These characteristics of archaeological data unimaginably devastated the possibility of recovering archives, when we face a digital dark age. Considering both recent trend and the nature of archaeological data mentioned above, we can derive strategies for building a sustainable archaeological digital archive. As an archaeology-major consumer of the digital data, I propose four strategic considerations: ① establishing a system of digital data literacy; ② enhancing evaluation and capability of data reuse; ③ building an international data sharing system; ④ developing it into the platform for digital archaeology.

Extraction of Important Areas Using Feature Feedback Based on PCA (PCA 기반 특징 되먹임을 이용한 중요 영역 추출)

  • Lee, Seung-Hyeon;Kim, Do-Yun;Choi, Sang-Il;Jeong, Gu-Min
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.13 no.6
    • /
    • pp.461-469
    • /
    • 2020
  • In this paper, we propose a PCA-based feature feedback method for extracting important areas of handwritten numeric data sets and face data sets. A PCA-based feature feedback method is proposed by extending the previous LDA-based feature feedback method. In the proposed method, the data is reduced to important feature dimensions by applying the PCA technique, one of the dimension reduction machine learning algorithms. Through the weights derived during the dimensional reduction process, the important points of data in each reduced dimensional axis are identified. Each dimension axis has a different weight in the total data according to the size of the eigenvalue of the axis. Accordingly, a weight proportional to the size of the eigenvalues of each dimension axis is given, and an operation process is performed to add important points of data in each dimension axis. The critical area of the data is calculated by applying a threshold to the data obtained through the calculation process. After that, induces reverse mapping to the original data in the important area of the derived data, and selects the important area in the original data space. The results of the experiment on the MNIST dataset are checked, and the effectiveness and possibility of the pattern recognition method based on PCA-based feature feedback are verified by comparing the results with the existing LDA-based feature feedback method.

A Study on Improvement of Korean OCR Accuracy Using Deep Learning (딥러닝을 이용한 한글 OCR 정확도 향상에 대한 연구)

  • Kang, Ga-Hyeon;Ko, Ji-Hyun;Kwon, Yong-Jun;Kwon, Na-Young;Koh, Seok-Ju
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.05a
    • /
    • pp.693-695
    • /
    • 2018
  • In this paper, we propose the improvement of Hangul OCR accuracy through deep learning. OCR is a program that senses printed and handwritten characters in an optical way and encodes them digitally. In the case of the most commonly used Tesseract OCR, the accuracy of English recognition is high. However, Hangul has lower accuracy because it has less learning data for a complex structure. Therefore, in this study, we propose a method to improve the accuracy of Hangul OCR by extracting the character region from the desired image through image processing and using deep learning using it as learning data. It is expected that OCR, which has been developed only by existing alphanumeric and several languages, can be applied to various languages.

  • PDF

A Developing a Machine Leaning-Based Defect Data Management System For Multi-Family Housing Unit (기계학습 알고리즘 기반 하자 정보 관리 시스템 개발 - 공동주택 전용부분을 중심으로 -)

  • Park, Da-seul;Cha, Hee-sung
    • Korean Journal of Construction Engineering and Management
    • /
    • v.24 no.5
    • /
    • pp.35-43
    • /
    • 2023
  • Along with the increase in Multi-unit housing defect disputes, the importance of defect management is also increased. However, previous studies have mostly focused on the Multi-unit housing's 'common part'. In addition, there is a lack of research on the system for the 'management office', which is a part of the subject of defect management. These resulted in the lack of defect management capability of the management office and the deterioration of management quality. Therefore, this paper proposes a machine learning-based defect data management system for management offices. The goal is to solve the inconvenience of management by using Optical Character Recognition (OCR) and Natural Language Processing (NLP) modules. This system converts handwritten defect information into online text via OCR. By using the language model, the defect information is regenerated along with the form specified by the user. Eventually, the generated text is stored in a database and statistical analysis is performed. Through this chain of system, management office is expected to improve its defect management capabilities and support decision-making.

Research on the Construction of an Automation Model for Maintenance Managers Based on Smart Devices (스마트 디바이스 기반 유지보수 관리자용 자동화 모델 구축에 관한 연구)

  • Park, Jihwan;Chung, Suwan;Lee, Seojoon;Song, Jinwoo;Kwon, Soonwook
    • Korean Journal of Construction Engineering and Management
    • /
    • v.22 no.1
    • /
    • pp.72-80
    • /
    • 2021
  • Based on the previous year's statistics, 37% of buildings in South Korea are aged over 30 years. As the number of the aging buildings increases, so does the need for maintenance. Building maintenance involves a significant number of works; the work of 'maintenance manager' accounting for the largest part. Currently, the maintenance history record is mostly in drawing or handwritten form which makes reviewing the data highly time consuming. Therefore, to improve the convenience of maintenance works and optimize historical data management, the existing maintenance process was analyzed. Problems were derived and a smart device-based automation model was established. In order to establish a smart device-based automation model, ① general flow of facility management process was analyzed and related articles were reviewed, ② current maintenance process was optimized, ③ functional block diagram of BIM Data, COBie Data, IoT, and AR-based automated maintenance management model was created, ④ a smart device-based automated maintenance management model was constructed, ⑤ finally, the above system was verified by testing the aforementioned model in the field site, evaluating the time required for the maintenance process and reviewing maintenance history data against the current one.

A High Order Product Approximation Method based on the Minimization of Upper Bound of a Bayes Error Rate and Its Application to the Combination of Numeral Recognizers (베이스 에러율의 상위 경계 최소화에 기반한 고차 곱 근사 방법과 숫자 인식기 결합에의 적용)

  • Kang, Hee-Joong
    • Journal of KIISE:Software and Applications
    • /
    • v.28 no.9
    • /
    • pp.681-687
    • /
    • 2001
  • In order to raise a class discrimination power by combining multiple classifiers under the Bayesian decision theory, the upper bound of a Bayes error rate bounded by the conditional entropy of a class variable and decision variables obtained from training data samples should be minimized. Wang and Wong proposed a tree dependence first-order approximation scheme of a high order probability distribution composed of the class and multiple feature pattern variables for minimizing the upper bound of the Bayes error rate. This paper presents an extended high order product approximation scheme dealing with higher order dependency more than the first-order tree dependence, based on the minimization of the upper bound of the Bayes error rate. Multiple recognizers for unconstrained handwritten numerals from CENPARMI were combined by the proposed approximation scheme using the Bayesian formalism, and the high recognition rates were obtained by them.

  • PDF

Feature extraction motivated by human information processing method and application to handwritter character recognition (인간의 정보처리 방법에 기반한 특징추출 및 필기체 문자인식에의 응용)

  • 윤성수;변혜란;이일병
    • Korean Journal of Cognitive Science
    • /
    • v.9 no.1
    • /
    • pp.1-11
    • /
    • 1998
  • In this paper, the features which are thought to be used by humans based on the psychological experiment of human information processing are applied to character recognition problem. Man will deal with a little large area information as well as pixel by pixel information. Therefore we define the feature that represents a little wide region I information called region feature, and combine the features derived from region feature and pixel by pixel features that have been used by now. The features we used are the result of region feature based preanalysis, mesh with region attributes, cross distance difference and gradient. The training and test data in the experiment are handwritten Korean alphabets, digits and English alphabets, which are trained on neural network using back propagation algorithm and recognition results are 90.27-93.25%, 98.00% and 79.73-85.75%, respectively Experimental results show that the feature we are suggesting in this paper is 1-2% better than UDLRH feature similar in attribute to region feature, and the tendency of misrecognition is more easily acceptable by humans.

  • PDF

A Study on Character Recognition using Wavelet Transformation and Moment (웨이브릿 변환과 모멘트를 이용한 문자인식에 관한 연구)

  • Cho, Meen-Hwan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.10
    • /
    • pp.49-57
    • /
    • 2010
  • In this thesis, We studied on hand-written character recognition, that characters entered into a digital input device and remove noise and separating character elements using preprocessing. And processed character images has done thinning and 3-level wavelet transform for making normalized image and reducing image data. The structural method among the numerical Hangul recognition methods are suitable for recognition of printed or hand-written characters because it is usefull method deal with distortion. so that method are applied to separating elements and analysing texture. The results show that recognition by analysing texture is easily distinguished with respect to consonants. But hand-written characters are tend to decreasing successful recognition rate for the difficulty of extraction process of the starting point, of interconnection of each elements, of mis-recognition from vanishing at the thinning process, and complexity of character combinations. Some characters associated with the separation process is more complicated and sometime impossible to separating elements. However, analysis texture of the proposed character recognition with the exception of the complex handwritten is aware of the character.

A Recognition Algorithm for Handwritten Logic Circuit Diagrams Using Neural Network (신경회로망을 이용한 손으로 작성된 논리회로 도면 인식 알고리듬)

  • Kim, Dug-Ryung;Park, Sung-Han
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.27 no.10
    • /
    • pp.68-77
    • /
    • 1990
  • In this paper, a neural patten recognition method for the automatic circuit diagram reading system is proposed. The proposed procedure to recognize a deformed logic symbols is composed of three stages: feature detection, log mapping, and pattern classification. In the feature detection stage, a modified competitive learning algorithm where each pattern has the inhibition weight as well as the activation weight is developed. The global information of hand-written logic symbols is obtained by the feature detection neural network having both the inhibition and activation weights. The obtained global data is then transformed into a log space by the conformal mapping where according to the Schwartz's theory about the human visual signal process-ing, the degree of rotation and the scale change are mapped into the translation change. Logic symbols are finally classified by a three layer perceptron trained by the error back propagation algorithm. The computer simulation demonstrates that the proposed multistage neural network system can recognize well the deformed patterns of hand-written logic circuit diagrams.

  • PDF

Optimization of Structure-Adaptive Self-Organizing Map Using Genetic Algorithm (유전자 알고리즘을 사용한 구조적응 자기구성 지도의 최적화)

  • 김현돈;조성배
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.11 no.3
    • /
    • pp.223-230
    • /
    • 2001
  • Since self-organizing map (SOM) preserves the topology of ordering in input spaces and trains itself by unsupervised algorithm, it is Llsed in many areas. However, SOM has a shortcoming: structure cannot be easily detcrmined without many trials-and-errors. Structure-adaptive self-orgnizing map (SASOM) which can adapt its structure as well as its weights overcome the shortcoming of self-organizing map: SASOM makes use of structure adaptation capability to place the nodes of prototype vectors into the pattern space accurately so as to make the decision boundmies as close to the class boundaries as possible. In this scheme, the initialization of weights of newly adapted nodes is important. This paper proposes a method which optimizes SASOM with genetic algorithm (GA) to determines the weight vector of newly split node. The leanling algorithm is a hybrid of unsupervised learning method and supervised learning method using LVQ algorithm. This proposed method not only shows higher performance than SASOM in terms of recognition rate and variation, but also preserves the topological order of input patterns well. Experiments with 2D pattern space data and handwritten digit database show that the proposed method is promising.

  • PDF