• Title/Summary/Keyword: E-OCR

Search Result 28, Processing Time 0.023 seconds

Identification of Novel Cupredoxin Homologs Using Overlapped Conserved Residues Based Approach

  • Goyal, Amit;Madan, Bharat;Hwang, Kyu-Suk;Lee, Sun-Gu
    • Journal of Microbiology and Biotechnology
    • /
    • v.25 no.1
    • /
    • pp.127-136
    • /
    • 2015
  • Cupredoxin-like proteins are mainly copper-binding proteins that conserve a typical rigid Greek-key arrangement consisting of an eight-stranded β-sandwich, even though they share as little as 10-15% sequence similarity. The electron transport function of the Cupredoxins is critical for respiration and photosynthesis, and the proteins have therapeutic potential. Despite their crucial biological functions, the identification of the distant Cupredoxin homologs has been a difficult task due to their low sequence identity. In this study, the overlapped conserved residue (OCR) fingerprint for the Cupredoxin superfamily, which consists of conserved residues in three aspects (i.e., the sequence, structure, and intramolecular interaction), was used to detect the novel Cupredoxin homologs in the NCBI non-redundant protein sequence database. The OCR fingerprint could identify 54 potential Cupredoxin sequences, which were validated by scanning them against the conserved Cupredoxin motif near the Cu-binding site. This study also attempted to model the 3D structures and to predict the functions of the identified potential Cupredoxins. This study suggests that the OCR-based approach can be used efficiently to detect novel homologous proteins with low sequence identity, such as Cupredoxins.

Improved Spam Filter via Handling of Text Embedded Image E-mail

  • Youn, Seongwook;Cho, Hyun-Chong
    • Journal of Electrical Engineering and Technology
    • /
    • v.10 no.1
    • /
    • pp.401-407
    • /
    • 2015
  • The increase of image spam, a kind of spam in which the text message is embedded into attached image to defeat spam filtering technique, is a major problem of the current e-mail system. For nearly a decade, content based filtering using text classification or machine learning has been a major trend of anti-spam filtering system. Recently, spammers try to defeat anti-spam filter by many techniques. Text embedding into attached image is one of them. We proposed an ontology spam filters. However, the proposed system handles only text e-mail and the percentage of attached images is increasing sharply. The contribution of the paper is that we add image e-mail handling capability into the anti-spam filtering system keeping the advantages of the previous text based spam e-mail filtering system. Also, the proposed system gives a low false negative value, which means that user's valuable e-mail is rarely regarded as a spam e-mail.

A Korean CAPTCHA Study: Defeating OCRs In a New CAPTCHA Context By Using Korean Syllables

  • Yang, Tae-Cheon;Ince, Ibrahim Furkan;Salman, Yucel Datu
    • International Journal of Contents
    • /
    • v.5 no.3
    • /
    • pp.50-56
    • /
    • 2009
  • Internet is being used for several activities by a great range of users. These activities include communication, e-commerce, education, and entertainment. Users are required to register regarding website in order to enroll web activities. However, registration can be done by automated hacking software. That software make false enrollments which occupy the resources of the website by reducing the performance and efficiency of servers, even stop the entire web service. It is crucial for the websites to have a system which has the capability of differing human users and computer programs in reading images of text. Completely Automated Public Turing Test to Tell Computers and Human Apart (CAPTCHA) is such a defense system against Optical Character Recognition (OCR) software. OCR can be defined as software which work for defeating CAPTCHA images and make countless number of registrations on the websites. This study proposes a new CAPTCHA context that is Korean CAPTCHA by means of the method which is splitting CAPTCHA images into several parts with random rotation values, and drawing random lines on a grid background by using Korean characters only. Lines are in the same color with the CAPTCHA text and they provide a distortion of image with grid background. Experimental results show that Korean CAPTCHA is a more secure and effective CAPTCHA type for Korean users rather than current CAPTCHA types due to the structure of Korean letters and the algorithm we are using: rotation and splitting. In this paper, the algorithm of our method is introduced in detail.

Application of Fault Location Method to Improve Protect-ability for Distributed Generations

  • Jang Sung-Il;Lee Duck-Su;Choi Jung-Hwan;Kang Yong-Cheol;Kang Sang-Hee;Kim Kwang-Ho;Park Yong-Up
    • Journal of Electrical Engineering and Technology
    • /
    • v.1 no.2
    • /
    • pp.137-144
    • /
    • 2006
  • This paper proposes novel protection schemes for grid-connected distributed generation (DG) units using the fault location algorithm. The grid-connected DG would be influenced by abnormal distribution line conditions. Identification of the fault location for the distribution lines at the relaying point of DG helps solve the problems of the protection relays for DG. The proposed scheme first identifies fault locations using currents and voltages measured at DG and source impedance of distribution networks. Then the actual faulted feeder is identified, applying time-current characteristic curves (TCC) of overcurrent relay (OCR). The method considering the fault location and TCC of OCR might improve the performance of the conventional relays for DG. Test results show that the method prevents the superfluous operations of protection devices by discriminating the faulted feeder, whether it is a distribution line where DG is integrated or out of the line emanated from the substation to which the DGs are connected.

Design and Implementation of the Search Inside Middleware System by using XML (XML 기반의 본문검색 미들웨어 시스템 설계 및 구현)

  • Kim, Hyo-Nam
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2011.01a
    • /
    • pp.229-232
    • /
    • 2011
  • 최근 스마트 디바이스 기반의 다양한 콘텐츠 제작 공급에 대한 새로운 마켓 시장 형성과 태블릿 디바이스 기반의 전자책 시장 규모의 확대에 따른 새로운 유형의 디지털 콘텐츠 시장이 형성되고 있다. 디지털 미디어는 정보환경 범위를 종이의 범위에서 벗어나 매체와 연계한 다양한 형태로의 변화뿐만 아니라 서술 방식과 소통방식의 방법에도 변화를 발생하였다. 그리고 지면에서 국한된 아날로그 매체의 물리적, 공간적, 시간적인 한계를 뛰어넘어 다양한 콘텐츠를 손쉽게 접근할 수 있게 만들었다. 이런 환경에서 본문검색 서비스는 아날로그와 디지털 매체의 상호 공존관계를 형성할 수 있다. 본 논문에서는 그림종이문서를 본문검색이 가능한 이미지형태의 디지털문서로 변환해주는 디지털라이징 시스템으로 문자위치정보를 포함하는 광학문자인식(OCR)기능과 인식된 문자의 오류를 수정하는 에디터기능을 통해 추출된 내용을 XML형태로 제공하는 본문검색 시스템을 제안한다. 특히, 문자인식 후처리 공정에서 복수의 관학문자인식(OCR)엔진을 통해 결과 비교와 문자위치 정보 확인 및 편집, 맞춤법 검사 등의 특화된 기능 등은 본 논문에서 가지는 강점으로 디지털문서 구축에 소요되는 시간과 비용을 혁신적으로 절감시켜준다.

  • PDF

Effect of oxygen distribution for hot spot and carbon deposition minimization in a methane autothermal reforming reactor

  • Lee, Shin-Ku;Bae, Joong-Myeon;Kim, Yong-Min;Park, Joong-Uen;Lim, Sung-Kwang
    • Proceedings of the KSME Conference
    • /
    • 2008.11b
    • /
    • pp.1996-2000
    • /
    • 2008
  • In autothermal reforming reaction, oxygen to carbon ratio (OCR) and steam to carbon ratio (SCR) are significant factors, which control temperature and carbon deposition into the reactor. The OCR is more sensitive than the SCR to affect the temperature distribution and reforming efficiency. In conventional operation, hydrocarbon fuel, steam, and oxygen was homogeneously mixed and injected into the reactor in order to get hydrogen-rich gas. The temperature was abruptly raised due to fast oxidation reaction in the former part of the reactor. Deactivation of packed catalysts can be accelerated there. In the present study, therefore, the effect of the oxygen distribution is introduced and investigated to suppress the carbon deposition and to maintain the reactor in the mild operating temperature (e.g., $700{\sim}800^{\circ}C$). In order to investigate the effect numerically, the following models are adopted; heterogeneous reaction model and two-medium model for heat balance.

  • PDF

The way to make training data for deep learning model to recognize keywords in product catalog image at E-commerce (온라인 쇼핑몰에서 상품 설명 이미지 내의 키워드 인식을 위한 딥러닝 훈련 데이터 자동 생성 방안)

  • Kim, Kitae;Oh, Wonseok;Lim, Geunwon;Cha, Eunwoo;Shin, Minyoung;Kim, Jongwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.1-23
    • /
    • 2018
  • From the 21st century, various high-quality services have come up with the growth of the internet or 'Information and Communication Technologies'. Especially, the scale of E-commerce industry in which Amazon and E-bay are standing out is exploding in a large way. As E-commerce grows, Customers could get what they want to buy easily while comparing various products because more products have been registered at online shopping malls. However, a problem has arisen with the growth of E-commerce. As too many products have been registered, it has become difficult for customers to search what they really need in the flood of products. When customers search for desired products with a generalized keyword, too many products have come out as a result. On the contrary, few products have been searched if customers type in details of products because concrete product-attributes have been registered rarely. In this situation, recognizing texts in images automatically with a machine can be a solution. Because bulk of product details are written in catalogs as image format, most of product information are not searched with text inputs in the current text-based searching system. It means if information in images can be converted to text format, customers can search products with product-details, which make them shop more conveniently. There are various existing OCR(Optical Character Recognition) programs which can recognize texts in images. But existing OCR programs are hard to be applied to catalog because they have problems in recognizing texts in certain circumstances, like texts are not big enough or fonts are not consistent. Therefore, this research suggests the way to recognize keywords in catalog with the Deep Learning algorithm which is state of the art in image-recognition area from 2010s. Single Shot Multibox Detector(SSD), which is a credited model for object-detection performance, can be used with structures re-designed to take into account the difference of text from object. But there is an issue that SSD model needs a lot of labeled-train data to be trained, because of the characteristic of deep learning algorithms, that it should be trained by supervised-learning. To collect data, we can try labelling location and classification information to texts in catalog manually. But if data are collected manually, many problems would come up. Some keywords would be missed because human can make mistakes while labelling train data. And it becomes too time-consuming to collect train data considering the scale of data needed or costly if a lot of workers are hired to shorten the time. Furthermore, if some specific keywords are needed to be trained, searching images that have the words would be difficult, as well. To solve the data issue, this research developed a program which create train data automatically. This program can make images which have various keywords and pictures like catalog and save location-information of keywords at the same time. With this program, not only data can be collected efficiently, but also the performance of SSD model becomes better. The SSD model recorded 81.99% of recognition rate with 20,000 data created by the program. Moreover, this research had an efficiency test of SSD model according to data differences to analyze what feature of data exert influence upon the performance of recognizing texts in images. As a result, it is figured out that the number of labeled keywords, the addition of overlapped keyword label, the existence of keywords that is not labeled, the spaces among keywords and the differences of background images are related to the performance of SSD model. This test can lead performance improvement of SSD model or other text-recognizing machine based on deep learning algorithm with high-quality data. SSD model which is re-designed to recognize texts in images and the program developed for creating train data are expected to contribute to improvement of searching system in E-commerce. Suppliers can put less time to register keywords for products and customers can search products with product-details which is written on the catalog.

Development of Korean-to-English and English-to-Korean Mobile Translator for Smartphone (스마트폰용 영한, 한영 모바일 번역기 개발)

  • Yuh, Sang-Hwa;Chae, Heung-Seok
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.3
    • /
    • pp.229-236
    • /
    • 2011
  • In this paper we present light weighted English-to-Korean and Korean-to-English mobile translators on smart phones. For natural translation and higher translation quality, translation engines are hybridized with Translation Memory (TM) and Rule-based translation engine. In order to maximize the usability of the system, we combined an Optical Character Recognition (OCR) engine and Text-to-Speech (TTS) engine as a Front-End and Back-end of the mobile translators. With the BLEU and NIST evaluation metrics, the experimental results show our E-K and K-E mobile translation equality reach 72.4% and 77.7% of Google translators, respectively. This shows the quality of our mobile translators almost reaches the that of server-based machine translation to show its commercial usefulness.

Small Strain Measurements of Sands in Plane Strain Compression (평면 변형률 압축상태에서의 모래의 미소 변형률 측정)

  • 박춘식;장정욱
    • Geotechnical Engineering
    • /
    • v.10 no.1
    • /
    • pp.27-46
    • /
    • 1994
  • It has been demonstrated in plane strain compression tests performed on dense Toyoura sand and Silver Leighton Buzzard sand, that the newly developed instrumentation for small strain measurements was capable of measuring the altering stiffness of sands for a wide range of shear strain from ($10^{-6}$to $10^{-2}$. It was found that for the range of shear strain($\gamma$) from $10^{-5}$ to those at peak, the Rowe's stressiilatancy relation seemed to be a good approximation for Toyoura sand and Silver Leighton Buzzard sand. However, the value of K and Poisson's ratio(at elastic range:${\nu}_{psc}^e$) varied with sand types. It was also found that the value of ${\nu}_{psc}^e$ and stress -dilatancy relation was irrespective of overconsolidation ratio(OCR).

  • PDF

Prediction of Residual Settlement of Ground Improved by Vertical Drains Using the Elasto-Viscous Consolidation Model - Application for Field Condition - (탄-점성 압밀이론에 의한 버티칼 드레인 타설지반의 잔류침하 예측 (II) - 현장조건에의 적용 -)

  • Baek, Won-Jin;Lee, Kang-Il;Kim, Woo-Jin
    • Journal of the Korean Geotechnical Society
    • /
    • v.23 no.6
    • /
    • pp.85-95
    • /
    • 2007
  • In this study, in order to propose the prediction method of the residual settlement of clayey ground improved by vertical drains, a series of numerical analyses for a model ground were carried out using the elasto-viscous consolidation model. And the effects of ground improvement conditions of the ratio of effective radii $(r_e/r_w)$, consolidation pressure $({\Delta}p)$ on normally consolidated state, and the OCR (overconsolidation ratio) on overconsolidated state to reduce the residual settlement in three-dimensional consolidation by vertical drains were investigated by performing a series of numerical analyses. Furthermore, based on the results of a series of numerical analyses for the model ground, the predicting method of the residual settlement of clayey ground with vertical drains and the determination method of the value of OCR required to control the residual settlement within an acceptable value are proposed.