• Title/Summary/Keyword: 정형로그

Search Result 20, Processing Time 0.029 seconds

Key Expressions in Editorial Texts: Determining the Unithood and Termhood of Word Sequences based on a 2009 Newspaper Corpus (신문 사설의 특징적 표현들에 대한 연구)

  • Kim, Hye-Young;Kang, Beom-Mo
    • Annual Conference on Human and Language Technology
    • /
    • 2012.10a
    • /
    • pp.185-190
    • /
    • 2012
  • 본 논문은 동아, 조선, 중앙, 한겨레 신문의 2009년 신문 사설의 제목과 본문에서 나타나는 n-gram에 대한 논의이다. 구체적으로 자주 출현하는 단어들의 연속 단위 3~6개의 형태소를 추출하여 신문 사설에서 나타난 고빈도 형태소 연속체를 살펴본다. 또한 이들을 기사문에서 추출한 패턴과 로그공산비로 비교하여 신문 사설에서 더 특징적인 의미로 사용되는 어휘들을 살펴본다. 그 결과, 사설 본문에서는 3-gram은 '아야 한다'. 4-gram은 'ㄹ 것이다', 5-gram은 'ㄹ 수밖에 없다', 6-gram은 '아야 할 것이다' 등이, 사설 제목은 '것인가, 안 된다'가 하나의 용어처럼 사용되고 있었다. 이러한 형태소 연속체를 살펴봄으로써, 신문사설의 텍스트 특징과 정형적인 표현에 대해서 살펴볼 수 있다.

  • PDF

A Study on Distributed Processing of Big Data and User Authentication for Human-friendly Robot Service on Smartphone (인간 친화적 로봇 서비스를 위한 대용량 분산 처리 기술 및 사용자 인증에 관한 연구)

  • Choi, Okkyung;Jung, Wooyeol;Lee, Bong Gyou;Moon, Seungbin
    • Journal of Internet Computing and Services
    • /
    • v.15 no.1
    • /
    • pp.55-61
    • /
    • 2014
  • Various human-friendly robot services have been developed and mobile cloud computing is a real time computing service that allows users to rent IT resources what they want over the internet and has become the new-generation computing paradigm of information society. The enterprises and nations are actively underway of the business process using mobile cloud computing and they are aware of need for implementing mobile cloud computing to their business practice, but it has some week points such as authentication services and distributed processing technologies of big data. Sometimes it is difficult to clarify the objective of cloud computing service. In this study, the vulnerability of authentication services on mobile cloud computing is analyzed and mobile cloud computing model is constructed for efficient and safe business process. We will also be able to study how to process and analyze unstructured data in parallel to this model, so that in the future, providing customized information for individuals may be possible using unstructured data.

Analysis of Factors for Korean Women's Cancer Screening through Hadoop-Based Public Medical Information Big Data Analysis (Hadoop기반의 공개의료정보 빅 데이터 분석을 통한 한국여성암 검진 요인분석 서비스)

  • Park, Min-hee;Cho, Young-bok;Kim, So Young;Park, Jong-bae;Park, Jong-hyock
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.22 no.10
    • /
    • pp.1277-1286
    • /
    • 2018
  • In this paper, we provide flexible scalability of computing resources in cloud environment and Apache Hadoop based cloud environment for analysis of public medical information big data. In fact, it includes the ability to quickly and flexibly extend storage, memory, and other resources in a situation where log data accumulates or grows over time. In addition, when real-time analysis of accumulated unstructured log data is required, the system adopts Hadoop-based analysis module to overcome the processing limit of existing analysis tools. Therefore, it provides a function to perform parallel distributed processing of a large amount of log data quickly and reliably. Perform frequency analysis and chi-square test for big data analysis. In addition, multivariate logistic regression analysis of significance level 0.05 and multivariate logistic regression analysis of meaningful variables (p<0.05) were performed. Multivariate logistic regression analysis was performed for each model 3.

Investigations on Techniques and Applications of Text Analytics (텍스트 분석 기술 및 활용 동향)

  • Kim, Namgyu;Lee, Donghoon;Choi, Hochang;Wong, William Xiu Shun
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.42 no.2
    • /
    • pp.471-492
    • /
    • 2017
  • The demand and interest in big data analytics are increasing rapidly. The concepts around big data include not only existing structured data, but also various kinds of unstructured data such as text, images, videos, and logs. Among the various types of unstructured data, text data have gained particular attention because it is the most representative method to describe and deliver information. Text analysis is generally performed in the following order: document collection, parsing and filtering, structuring, frequency analysis, and similarity analysis. The results of the analysis can be displayed through word cloud, word network, topic modeling, document classification, and semantic analysis. Notably, there is an increasing demand to identify trending topics from the rapidly increasing text data generated through various social media. Thus, research on and applications of topic modeling have been actively carried out in various fields since topic modeling is able to extract the core topics from a huge amount of unstructured text documents and provide the document groups for each different topic. In this paper, we review the major techniques and research trends of text analysis. Further, we also introduce some cases of applications that solve the problems in various fields by using topic modeling.

Ligament Injuries of Knee in the Recreational Skiers (스키에 의한 슬관절 인대 손상)

  • Lee Dong Chul;Ko Jin Hyeok;Kim Dong Han
    • Journal of Korean Orthopaedic Sports Medicine
    • /
    • v.2 no.1
    • /
    • pp.37-43
    • /
    • 2003
  • Purpose: This study is to analyse the injury patterns of knee ligament and the factors influencing ligament injuries of knee, and to evaluate the changes of knee function and activity after ski injury. Materials and Methods: Thirty cases of ligament injuries of knee were studied with a questionaire, stress radiographs, magnetic resolution imaging, and physical examination. Mean age was 28.6 years old and mean follow up was 29.6 months. Clinical evaluation of Lysholm knee scoring scale was used for knee function and Tegner activity score was used for activity change after ski injury. Results: The anterior cruciate ligament (ACL) injury accompanied with medial collateral ligament(MCL) injury was most in 11 cases and isolated MCL injury was the next in 9 cases. The common types of injury mechanism were Phantom foot phenomenon (13 cases, 43$\%$) and valgus external rotation injury (12 cases, 40$\%$), which constitute 83$\%$ of all case. At the last follow up, the mean Lysholm score was 93.4 and mean Tegner activity score was 4.2. The reduced Tegner activity score after injury was 1.9. Among several injury groups, the evaluation of knee function and activity was best in the isolated MCL injury group and worst in the ACL injury accompanied with MCL injury group. The factors to influence ski injury were participation to ski class, release of binding, and skiing long time more than 2 hours. Conclusions: Lysholm score at last follow up revealed good grading, but sports activity after ski injury was reduced when compared with pre-injury state. It seems to need a active, systemic sports rehabilitation program after sports injury.

  • PDF

A Study on the Data Collection Methods based Hadoop Distributed Environment (하둡 분산 환경 기반의 데이터 수집 기법 연구)

  • Jin, Go-Whan
    • Journal of the Korea Convergence Society
    • /
    • v.7 no.5
    • /
    • pp.1-6
    • /
    • 2016
  • Many studies have been carried out for the development of big data utilization and analysis technology recently. There is a tendency that government agencies and companies to introduce a Hadoop of a processing platform for analyzing big data is increasing gradually. Increased interest with respect to the processing and analysis of these big data collection technology of data has become a major issue in parallel to it. However, study of the collection technology as compared to the study of data analysis techniques, it is insignificant situation. Therefore, in this paper, to build on the Hadoop cluster is a big data analysis platform, through the Apache sqoop, stylized from relational databases, to collect the data. In addition, to provide a sensor through the Apache flume, a system to collect on the basis of the data file of the Web application, the non-structured data such as log files to stream. The collection of data through these convergence would be able to utilize as a basic material of big data analysis.

A Study on Big Data Anti-Money Laundering Systems Design through A Bank's Case Analysis (A 은행 사례 분석을 통한 빅데이터 기반 자금세탁방지 시스템 설계)

  • Kim, Sang-Wan;Hahm, Yu-Kun
    • The Journal of Bigdata
    • /
    • v.1 no.1
    • /
    • pp.85-94
    • /
    • 2016
  • Traditional Anti-Money Laundering (AML) software applications monitor bank customer transactions on a daily basis using customer historical information and account profile data to provide a "whole picture" to bank management. With the advent of Big Data, these applications could be benefited from size, variety, and speed of unstructured data, which have not been used in AML applications before. This study analyses the weaknesses of a bank's current AML systems and proposes an AML systems taking advantage of Big Data. For example, early warning of AML risk can be improved by exposing identities and uncovering hidden relationships through predictive and entity analytics on real-time and outside data such as SNS data.

  • PDF

A Stochastic Work-Handover Relationship Model in Workflow-supported Social Networks (워크플로우 기반 소셜 네트워크의 확률적 업무전달 관계 모델)

  • Ahn, Hyun;Kim, Kwanghoon
    • Journal of Internet Computing and Services
    • /
    • v.16 no.5
    • /
    • pp.59-66
    • /
    • 2015
  • A stochastic modeling approach as a mathematical method for workflow intelligence is widely used for analyzing and simulating workflow models in the literature. In particular, as a resource-centric modeling approach, this paper proposes a stochastic model to represent work-handover relationships between performers in a workflow-supported social network. Calculating probabilities for the work-handover relationships are determined by two types of probabilities. One is the work-transition probability between activities, and the other is the task assignment probability between activities and performers. In this paper, we describe formal definitions of stochastic workflow models and stochastic work-handover relationship models, as well. Then, we propose an algorithm for extracting a stochastic work-handover relationship model from a stochastic workflow model. As a consequence, the proposed model ought to be useful in performing resource-centric workflow simulations and model-log comparison analyses.

A Tombstone Filtered LSM-Tree for Stable Performance of KVS (키밸류 저장소 성능 제어를 위한 삭제 키 분리 LSM-Tree)

  • Lee, Eunji
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.22 no.4
    • /
    • pp.17-22
    • /
    • 2022
  • With the spread of web services, data types are becoming more diversified. In addition to the form of storing data such as images, videos, and texts, the number and form of properties and metadata expressing the data are different for each data. In order to efficiently process such unstructured data, a key-value store is widely used for state-of-the-art applications. LSM-Tree (Log Structured Merge Tree) is the core data structure of various commercial key-value stores. LSM-Tree is optimized to provide high performance for small writes by recording all write and delete operations in a log manner. However, there is a problem in that the delay time and processing speed of user requests are lowered as batches of deletion operations for expired data are inserted into the LSM-Tree as special key-value data. This paper presents a Filtered LSM-Tree (FLSM-Tree) that solves the above problem by separating the deleted key from the main tree structure while maintaining all the advantages of the existing LSM-Tree. The proposed method is implemented in LevelDB, a commercial key-value store and it shows that the read performance is improved by up to 47% in performance evaluation.

The Metabolic Effects of FGF21: From Physiology to Pharmacology (생리, 약학적 관점에서 fibroblast growth factor 21 (FGF21)의 대사 효과 고찰)

  • Song, Parkyong
    • Journal of Life Science
    • /
    • v.30 no.7
    • /
    • pp.640-650
    • /
    • 2020
  • Fibroblast growth factor 21 (FGF21) is an atypical member of the FGF protein family which is highly synthesized in the liver, pancreas, and adipose tissue. Depending on the expression tissue, FGF21 uses endo- or paracrine features to regulate several metabolic pathways including glucose metabolism and energy homeostasis. Different physiologically stressful conditions such as starvation, a ketogenic diet, extreme cold, and mitochondrial dysfunction are known to induce FGF21 synthesis in various tissues to exert either adaptive or defensive mechanisms. More specifically, peroxisome proliferator-activated receptor gamma and peroxisome proliferator-activated receptor alpha control FGF21 expression in adipose tissue and liver, respectively. In addition, the pharmacologic administration of FGF21 has been reported to decrease the body weight and improve the insulin sensitivity and lipoprotein profiles of obese mice and type 2 diabetes patients meaning that FGF21 has attracted huge interest as a therapeutic agent for type 2 diabetes, obesity, and non-alcoholic fatty liver disease. However, understanding FGF21 remains complicated due to the paradoxical condition of its tissue-dependent expression. For example, nutrient deprivation largely increases hepatic FGF21 levels whereas adipose tissue-derived FGF21 is increased under feeding condition. This review discusses the issues of interest that have arisen from existing publications, including the tissue-specific function of FGF21 and its action mechanism. We also summarize the current stage of a clinical trial using several FGF21 analogs.