• Title/Summary/Keyword: CJK

Search Result 37, Processing Time 0.024 seconds

Improving Elasticsearch for Chinese, Japanese, and Korean Text Search through Language Detector

  • Kim, Ki-Ju;Cho, Young-Bok
    • Journal of information and communication convergence engineering
    • /
    • v.18 no.1
    • /
    • pp.33-38
    • /
    • 2020
  • Elasticsearch is an open source search and analytics engine that can search petabytes of data in near real time. It is designed as a distributed system horizontally scalable and highly available. It provides RESTful APIs, thereby making it programming-language agnostic. Full text search of multilingual text requires language-specific analyzers and field mappings appropriate for indexing and searching multilingual text. Additionally, a language detector can be used in conjunction with the analyzers to improve the multilingual text search. Elasticsearch provides more than 40 language analysis plugins that can process text and extract language-specific tokens and language detector plugins that can determine the language of the given text. This study investigates three different approaches to index and search Chinese, Japanese, and Korean (CJK) text (single analyzer, multi-fields, and language detector-based), and identifies the advantages of the language detector-based approach compared to the other two.

Effect of local wall thinning on ratcheting behavior of pressurized 90° elbow pipe under reversed bending using finite element analysis

  • Chen, Xiaohui;Chen, Xu
    • Steel and Composite Structures
    • /
    • v.20 no.4
    • /
    • pp.931-950
    • /
    • 2016
  • Ratcheting deformation of pressurized Z2CND18.12N stainless steel $90^{\circ}$ elbow pipe with local wall thinning subjected to constant internal pressure and reversed bending was studied using finite element analysis. Chen-Jiao-Kim (CJK) kinematic hardening model, which was used to simulate ratcheting behavior of pressurized $90^{\circ}$ elbow pipe with local wall thinning at extrados, flanks and intrados, was implemented into finite element software ANSYS. The local wall thinning was located at extrados, flanks and intrados of $90^{\circ}$ elbow pipe, whose geometry was rectangular cross-section. The effect of depth, axial length and circumferential angle of local wall thinning at extrados, flanks and intrados on the ratcheting behaviors of $90^{\circ}$ elbow pipe were studied in this paper. Three-dimensional elastic-plastic analysis with Chen-Jiao-Kim (CJK) kinematic hardening model was carried out to evaluate structural ratcheting behaviors. The results indicated that ratcheting strain was generated mainly along the hoop direction, while axial ratcheting strain was relatively small.

Support on Ideograph Characters Search of Unicode Based Information System (정보 시스템의 유니코드 기반 한자 검색 지원)

  • Yoon, So-Young
    • Journal of the Korean Society for information Management
    • /
    • v.24 no.4
    • /
    • pp.375-391
    • /
    • 2007
  • Unicode Han ideograph character set differed from the our principle of the phonetic value ordering in that it followed the principle of KangXi radical-stroke ordering of the characters. Therefore, information system should support ideograph search on precise analysis of materials which consist of korean character (hangul) and ideograph character (hanja). History Information system has been maintaining Hanja(Chinese Character) to Hangul Dictionary, Terminology Dictionary for composition, borrowing, non-ideographic principles, Variant Forms Dictionary, and Recently discovered Chinese Characters List.

Problems with Chinese Ideographs Search in Unicode and Solutions to Them (유니코드 한자 검색의 문제점 및 개선방안)

  • Lee, Jeong-hyeon
    • Informatization Policy
    • /
    • v.19 no.3
    • /
    • pp.50-63
    • /
    • 2012
  • This thesis is designed to analyze how the search for Chinese ideographs is done in Koreanology-related domestic databases, domestic library databases, domestic academic databases, and overseas library databases, with a view to identifying problems and suggesting solutions to them. The major reasons that impede Chinese ideographs search in Unicode are classified as 'multicode characters', 'simplified characters', and 'variant characters', and three characters are chosen as samples to describe the current practice. Thirteen Koreanology-related databases, five domestic library databases, five domestic academic databases and two overseas library databases are analyzed in terms of Chinese ideographs search. To support search for multicode characters, the open source of the Unicode consortium must be applied. To improve search for simplified and variant characters, a matching table must be standardized and proposed to the Unicode consortium.

  • PDF

정보보호기술(ITU-T SG17 중심) 국제표준화 동향

  • O, Heung-Ryong;Kim, Yeong-Hwa;Yeom, Heung-Yeol
    • Information and Communications Magazine
    • /
    • v.31 no.5
    • /
    • pp.34-38
    • /
    • 2014
  • 정보보호 분야 국제표준화는 정보통신 관점에서 ITU-T SG17, 원천기술 관점에서 ISO/IEC JTC1/SC27, 인터넷 서비스 보안 관점에서 IETF Security Area에서 국제표준을 개발하고 있다. 또한, 아시아 지역 협력을 위해 ASTAP, RAISE Forum, CJK Security WG 등의 협의체를 구성하여 국제표준화기구에 대한 아시아 공동 대응체계를 구축해서 활동하고 있다. 본고에서는 2014년 1월, ITU-T SG17 국제회의 주요 결과와 향후 추진전망에 대해 알아본다.

Analysis on the Inter-Industry Network between the Service Industry in the Korean Capital Region and 10 Industrial Sectors in 20 City-Regions of China-Japan-Korea (한국 수도권 서비스업과 한·중·일 20개 도시지역 내 10개 산업부문과의 산업 간 네트워크 분석)

  • Han, Jihye;Kim, Kabsung;Jung, Hayoung
    • Journal of the Korean Regional Science Association
    • /
    • v.32 no.4
    • /
    • pp.51-73
    • /
    • 2016
  • Considering the intensified ties between Korean service industry and the other industries in China and Japan, this study empirically analyzes the inter-industry network between the service industry in the Korean capital region and 10 industrial sectors in 20 city-regions of China, Japan, and Korea(CJK). Firstly, unit structures are constructed based on the estimated CJK interregional input-output tables to understand the production connection. Moreover, the reorganized unit structures are visualized as networks and examined from various angles. As the results of the analysis, the inter-industry network of the service industry in the Seoul Metropolitan Area is still mostly dependent on domestic industries, especially on manufacturing industry, while it shows the tendency to be weakly connected to the industries in Chinese and Japanese city-regions.

Consideration of CJK Joint Hanja Unicode when is used in AMI/HDB-3 Line Coding (AMI/HDB-3 회선부호화와 한·중·일 한자 유니코드 체계 고찰)

  • Tai, Dong-Zhen;Hong, Wan Pyo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.8 no.7
    • /
    • pp.1011-1015
    • /
    • 2013
  • This paper analyses the violation rate of CJK joint Chines character Unicode to the source code rule. In the paper, Chinese character 150ea in Chinese Unicode which have relatively a higher frequency in use of a character was chosen to study. The frequency rate in use of the 150ea characters is about 50% of the total frequency rate of the Chinese characters. The study was applied the AMI/HDB-3 line coding/scrambling and HDLC protocol, According to the analyses, the number of violated characters were 77ea of 150 ea, frequency rate in use 29%. Therefore, when the violated 77ea characters are replaced to the matched character codes to the source coding rule, the processing rate of the line coder can be improved about 37%.