• Title/Summary/Keyword: S&T information service

Search Result 816, Processing Time 0.027 seconds

A Study on the Dietary Behaviors of Female Baby Boomers and the Needs for Future Perspectives of Dietary Life (여성 베이비부머들의 식생활 태도와 미래 식생활 요구도 조사)

  • Nam, Haewon;Myung, Choonok;Park, Youngsim
    • The Korean Journal of Food And Nutrition
    • /
    • v.26 no.4
    • /
    • pp.895-908
    • /
    • 2013
  • The purpose of this study is to examine female baby boomers' dietary habits and their attitudes together with their needs for future perspectives of dietary life. Our aim is to use these findings as a basic data when forecasting for food-related industries or policy making. A survey is being carried out for a total of 358 female baby boomers and analyzed by SPSS 12.0. The following is a summary of this study. The average age is 52.6 years old, most of them graduated from highschool (63.1%) and had a nuclear type of family (76.1%). Only 39.0% is composed of housewives, others had either full-time or part-time jobs. Self-assessment of stress is not so high and only 8.1% are dissatisfied with their lives. 38.2% are either overweight or obese in terms of BMI, and most of them are non-smokers (97.2%) or non-drinkers (63.0%). Their mean dietary habit scores are $70.6{\pm}11.8$, and the scores show significant relations with their education levels (p<0.01), monthly income (p<0.01), life satisfaction rates (p<0.001), stress levels (p<0.001), smoking habits (p<0.05), drinking habits (p<0.05), regular exercises (p<0.001) and regular health check-ups (p<0.05). The rate of skipping breakfast, lunch and dinner are 18.2%, 1.1%, 5.2% respectively. The main reason for skipping breakfast is the 'lack of time'. With regards to the frequency of grocery shopping, almost half of the subjects (55.7%) said '1~2 times per week' and bought mainly raw food sources such as vegetables, fruits, and meats. The majority of the subjects (91.3%) report that they cooked meals at homes, and took about 1 hour of time. The subjects also point out that cooking was a bothering task, and only 46.4% would prepare meals at home, while others would rather eat out or eat convenience foods. The main reasons for not wanting meal services in the elderly welfare facility are because they didn't want to live such places (48.4%) and the meals are tasteless (31.3%). As for delivery meal services, 60.1% are aware of it, and 39.9% would consider using it in the future. Factors to be considered when using the delivery meal service are sanitation (43.7%), nutrition (28.7%), taste (18.4%), price (6.3%), and brand name (2.9%). This study is expected to be used as useful information when developing food-related strategies for baby boomers in the future.

The Financial Stability of Before-and-after Retirement -Expectation and Support for the Quality of the Elderly Life- (은퇴 전후 세대 재정안정성 -노년기 삶의 수준에 대한 기대와 지원-)

  • Kim, Eunyoung
    • Korean Journal of Social Welfare
    • /
    • v.66 no.1
    • /
    • pp.61-85
    • /
    • 2014
  • Lately, as the problem of the aged poverty becomes a big social issue, this paper studies the problem of financial stability with respect to the income and consumption of before-and-after retirement generations. After dividing the data in Korean Longitudinal Study of Ageing(KLoSA) into retired and non-retired groups, this study compares the difference among ageing groups by cross analysis and t-test. First, the result tells that the total personal income of retired group is lower than the one of non-retired group. Second, the public pension income benefit ratio of retired group is only 30% of it's total income, and the amount of public pension appears to be 40% of the total pension income. The benefit ratio of the personal pension income is low as about 1% in both groups, The private transfer income of the retired group takes relatively large portion in its total personal income. Third, as people gets older, financial stability gets worse because consumption does not decrease as much as the income decreases. Fourth, it is turned out that the expectation of old life supporting from nation is low in both groups. Fifth, the factors that affects the income of the public pension in the retired group are gender, age, education, and health status, when compared with the factors to the personal total income of the non-retired group. In terms of policy, this paper emphasizes the needs of the intensification of the public pension and the support for the revitalization of the personal pension.

  • PDF

A Study of Nurses' Knowledge, Attitude on the Nurses' Law and Nurses' Perception on the Causes, Coping Patterns with the Nursing Accidents (간호사의 간호관련법에 대한 지식$\cdot$태도 및 간호사고에 대한 원인$\cdot$대처양상에 관한 연구)

  • Mun Heui Ja;Lee Mi Aie
    • Journal of Korean Public Health Nursing
    • /
    • v.13 no.1
    • /
    • pp.41-62
    • /
    • 1999
  • Recently the request of the patients to participate in the medical courses has been expanding due to elevated sense of right on the people's health. merchandised medical treatment by mass supply, human right declaration of the patients, generalized medical informations by the mass media and the change of human relation between the medical personnels and the patients. Under these phenomena the accident by the nurses have been increasing by the area of the nurses having been expanded and their independent roles having been increased. Such nursing accidents are the important subject which the professional occupation of the nurses has been facing but legal protective capability of the nurses has been very weak. Therefore this study has examined the degree of the experience of the nursing accident that happens in the clinical nursing scenes in the general hospital to provide the basic materials for the protection and the counter measures of the nursing accidents. The following is the conclusion based by the above examination. 1) The general characters of the subjects of this study is that they are mostly single in their twenties and graduate from nursing college. Their total clinical career is above 5 years$(44.8\%)$ and their current clinical parts' career is between 1-3 years$(40.1\%)$. So these facts suggest that most hospitals has taken the working rotation policy on nurses. 2) The level of nurses' knowledge on the nursing law is accurate partially but isn't it patially. So it is suggested that nurses need the accurate information and education about the nursing law. But the nurses' attitude is very approved of the establishment of a unilateral nursing law. 3) The relation between the demographic characters of the subjects and their attitudes on the nursing law shows that there is no significant differences except the relation between the attitude 6(the sufficient level of education on nursing law in formal education course) and age. total clinical career. 4) The perception of the nurses shows that the cause of the nursing accident has been due to the heavy work$(78.2\%)$. short of professional knowledge and skill$(60.2\%)$, discordance with Doctors. patients and patients' families. They report the accident to the head nurse first$(81.8\%)$ and within 30 minute$(75.1\%)$. The hour of nursing accident frequently happened is regardless of service hour with $49.4\%$ in response rate. the highest rate. and the nursing accident happens in the night more than the daytime. Even though most nurses think that they are themselves responsible for nursing accident. it is found that the chief cause of the nursing accident is due to the nurses' heavy work$(78.2\%)$. So the causes of nursing accidents is analysed. it may be suggested that the endeavor of hospital and nursing organizations to decrease nursing accidents is very important. 5) The coping patterns of patients with nursing accidents are mostly active attitude such as a violent words$(69\%)$. sue or accusation$(36.4\%)$, monetary compensation $(35.6\%)$ except a understanding cases$(38.7\%)$. But the coping patterns of hospitals with nursing accidents are mostly to investigate the accurate cause.

  • PDF

<New material> A Historical Study on the Memorandum Record of 『Gyeongja(庚子)·Daetongryeok(大統曆)』 (<신자료> 『경자년(庚子年) 대통력(大統曆)』에 관한 고증 연구 - 비망 기록을 중심으로 -)

  • RO Seungsuk
    • Korean Journal of Heritage: History & Science
    • /
    • v.56 no.2
    • /
    • pp.12-26
    • /
    • 2023
  • Recently, 『Gyeongja(庚子)/Daetongryeok(大統曆)』(1600), a memorandum record of Yu Seong-ryong during the reign of King Seonjo(宣祖) of the Joseon Dynasty, was returned to Korea from Japan, and about 4,000 letters in cursive characters have been interpreted by Ro Seung-suk. The contents for 203 days written in the margin of 『Gyeongja(庚子)/Daetongryeok(大統曆)』 are mostly new, and are meaningful in understanding Yu's life and social association circumstances. There are daily routines of each day, contemporary figures, diseases and oriental medicine prescriptions. In particular, the combat record of Admiral Yi Sun-shin in 83 letters on the cover is very important to understand the situation in those days. It seems that the reason for writing the combat situations a year and a few months after Admiral Yi died in war was to honor his distinguished military service for a long time by King Seonjo's order according to the public opinion of the royal court. The record can be classified into two categories. First, Admiral Yi sighed when he heard about Yu's dismissal from the office in Gogeumdo, and was always alert with clear water on the boat after the Battle of Waekyo Castle. Second, he was killed by bullets shot by the enemy while directly encouraging battle, not listening to his men who tried to dissuade him from leading the naval battle at Noryang. This only contained contents of devoting his life desperately, which is an important proof of the theory of his death in war. It also contains nine methods for making liquor and another method that wasn't known to the public, and seems to include popular alcohol brewing methods or newly devised ones. In addition, there is a detail that Heo Jun, the author of 『Donguibogam』, introduced medicine to Yu, along with being unable to attend ancestral rites and relieving the poor written in red. There are also stories about Kang Hang(姜沆) returning to Korea after being captured by Japan and Lee Deok-hong(李德弘)'s son, who introduced Gugapseondo(龜甲船圖, the first picture of the Turtle Ship in Korea) to King Seonjo. In the light of the above, 『Gyeongja(庚子)/Daetongryeok(大統曆)』is an important historical record to empirically research not only figures related to Yu but also the circumstances of those days since it contains new facts that are not in the existing literature. In particular, the big accomplishment of this study is to correct the mistakenly known theory of Admiral Yi's suicide and to find out the new fact that Heo Jun provided medical information. In this respect, this book is expected to serve as a testament to the future study of the history and characters related to Yu in the mid-Joseon period.

A Research on Effect of Corporate's Competitive Advantage to the R&D Investment in Small and Medium Enterprise (중소기업 유형별 연구개발투자의 영향요인에 관한 실증연구)

  • Choi, Su-Heyong;Choi, Chul-An
    • Management & Information Systems Review
    • /
    • v.33 no.1
    • /
    • pp.191-217
    • /
    • 2014
  • The Purpose of this study is to find the effect factors of R&D investment in SMEs which plays an important role in the national economy, and the differences of the effect factors by the type of SMEs. The subject of this study is about 3,400 SMEs mentioned in "The survey of technical statistics on SMEs in 2007" by Korea Federation of Small and Medium Business. The effect factors are related with the size of business, the infrastructure of R&D and the activities of R&D which have been studied by many researchers. The methods of analysis are regression analysis, moderating effect analysis and the software package used is SPSS 12.0. The results of the study are as fallow. First, it was found that unlike in previous studies which show the effect of the elements of business's size, research infrastructure, research activities on R&D investment, one element alone can't be considered for meaningful result but the various elements have effect on R&D investment at the same time. In other words, the number of employees and the sales as the elements of business's size, the ratio of researchers, the technical ability, the ratio of equipment possession and the intellectual properties as the elements of R&D infrastructure, the activity of ideas and joint research as the elements of R&D activities have positive(+) effect, whereas the participation of CEO in the activity of R&D as the elements of R&D activities activity has negative(-) one. The number of employees, the ratio of researchers, and the sales had relatively high influence whereas equipment possession, technical ability, intellectual properties, the participation of CEO in the research, the activity of idea, joint research had relatively low influence. Next, it was also found that there are differences of the effect factors over the types of SMEs. SMEs were classified into 19 types by eight criteria such as start-ups and existing business by business age; small business and medium business by size; manufacturing business and service business by product type;independent business and subcontractor business by dealing type; businesses in the entering, growing, maturing and restructuring stage by growth stage; businesses with low, medium and high technology by technological level; pioneering business and non-pioneering business by industrial type; and businesses with state-of-the-art technology and non-advanced business by the level of business activities. The meaning of this study lies in the fact that it found the various effect factors should be considered at the same time when conducting study on SMEs' R&D investment, and the differences by the type should be acknowledged. This study surpassed the limitations of the previous studies which focused on a couple of factors and types. This study result can also be considered for other studies on achievement, organization, marketing and others. Moreover, it shows that a differential policy by business type is needed when formulating SME policy.

  • PDF

A Hybrid Recommender System based on Collaborative Filtering with Selective Use of Overall and Multicriteria Ratings (종합 평점과 다기준 평점을 선택적으로 활용하는 협업필터링 기반 하이브리드 추천 시스템)

  • Ku, Min Jung;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.85-109
    • /
    • 2018
  • Recommender system recommends the items expected to be purchased by a customer in the future according to his or her previous purchase behaviors. It has been served as a tool for realizing one-to-one personalization for an e-commerce service company. Traditional recommender systems, especially the recommender systems based on collaborative filtering (CF), which is the most popular recommendation algorithm in both academy and industry, are designed to generate the items list for recommendation by using 'overall rating' - a single criterion. However, it has critical limitations in understanding the customers' preferences in detail. Recently, to mitigate these limitations, some leading e-commerce companies have begun to get feedback from their customers in a form of 'multicritera ratings'. Multicriteria ratings enable the companies to understand their customers' preferences from the multidimensional viewpoints. Moreover, it is easy to handle and analyze the multidimensional ratings because they are quantitative. But, the recommendation using multicritera ratings also has limitation that it may omit detail information on a user's preference because it only considers three-to-five predetermined criteria in most cases. Under this background, this study proposes a novel hybrid recommendation system, which selectively uses the results from 'traditional CF' and 'CF using multicriteria ratings'. Our proposed system is based on the premise that some people have holistic preference scheme, whereas others have composite preference scheme. Thus, our system is designed to use traditional CF using overall rating for the users with holistic preference, and to use CF using multicriteria ratings for the users with composite preference. To validate the usefulness of the proposed system, we applied it to a real-world dataset regarding the recommendation for POI (point-of-interests). Providing personalized POI recommendation is getting more attentions as the popularity of the location-based services such as Yelp and Foursquare increases. The dataset was collected from university students via a Web-based online survey system. Using the survey system, we collected the overall ratings as well as the ratings for each criterion for 48 POIs that are located near K university in Seoul, South Korea. The criteria include 'food or taste', 'price' and 'service or mood'. As a result, we obtain 2,878 valid ratings from 112 users. Among 48 items, 38 items (80%) are used as training dataset, and the remaining 10 items (20%) are used as validation dataset. To examine the effectiveness of the proposed system (i.e. hybrid selective model), we compared its performance to the performances of two comparison models - the traditional CF and the CF with multicriteria ratings. The performances of recommender systems were evaluated by using two metrics - average MAE(mean absolute error) and precision-in-top-N. Precision-in-top-N represents the percentage of truly high overall ratings among those that the model predicted would be the N most relevant items for each user. The experimental system was developed using Microsoft Visual Basic for Applications (VBA). The experimental results showed that our proposed system (avg. MAE = 0.584) outperformed traditional CF (avg. MAE = 0.591) as well as multicriteria CF (avg. AVE = 0.608). We also found that multicriteria CF showed worse performance compared to traditional CF in our data set, which is contradictory to the results in the most previous studies. This result supports the premise of our study that people have two different types of preference schemes - holistic and composite. Besides MAE, the proposed system outperformed all the comparison models in precision-in-top-3, precision-in-top-5, and precision-in-top-7. The results from the paired samples t-test presented that our proposed system outperformed traditional CF with 10% statistical significance level, and multicriteria CF with 1% statistical significance level from the perspective of average MAE. The proposed system sheds light on how to understand and utilize user's preference schemes in recommender systems domain.

Study of Patient Teaching in The Clinical Area (간호원의 환자교육 활동에 관한 연구)

  • 강규숙
    • Journal of Korean Academy of Nursing
    • /
    • v.2 no.1
    • /
    • pp.3-33
    • /
    • 1971
  • Nursing of today has as one of its objectives the solving of problems related to human needs arising from the demands of a rapidly changing society. This nursing objective, I believe, can he attained by the appropriate application of scientific principles in the giving of comprehensive nursing care. Comprehensive nursing care may be defined as nursing care which meets all of the patient's needs. the needs of patients are said to fall into five broad categories: physical needs, psychological needs, environmental needs, socio-economic needs, and teaching needs. Most people who become ill have adjustment problems related to their new situation. Because patient teaching is one of the most important functions of professional nursing, the success of this teaching may be used as a gauge for evaluating comprehensive nursing care. This represents a challenge foe the future. A questionnaire consisting of 67 items was distributed to 200 professional nurses working ill direct patient care at Yonsei University Medical Center in Seoul, Korea. 160 (80,0%) nurses of the total sample returned completed questionnaires 81 (50.6%) nurses were graduates of 3 fear diploma courser 79 (49.4%) nurses were graduates of 4 year collegiate nursing schools in Korea 141 (88,1%) nurses had under 5 years of clinical experience in a medical center, while 19 (11.9%) nurses had more than 5years of clinical experience. Three hypotheses were tested: 1. “Nurses had high levels of concept and knowledge toward patient teaching”-This was demonstrated by the use of a statistical method, the mean average. 2. “Nurses graduating from collegiate programs and diploma school programs of nursing show differences in concepts and knowledge toward patient teaching”-This was demonstrated by a statistical method, the mean average, although the results showed little difference between the two groups. 3. “Nurses having different amounts of clinical experience showed differences in concepts and knowledge toward patient teaching”-This was demonstrated by the use of a statistical method, the mean average. 2. “Nurses graduating from collegiate programs and diploma school programs of nursing show differences in concepts and knowledge toward patient teaching”-This was demonstrated by a statistical method, the mean average, although the results showed little difference between the two groups. 3. “Nurses having different amounts of clinical experience showed differences in concepts and knowledge toward patient teaching”-This was demonstrated by the use of the T-test. Conclusions of this study are as follow: Before attempting the explanation, of the results, the questionnaire will he explained. The questionnaire contained 67 questions divided into 9 sections. These sections were: concept, content, time, prior preparation, method, purpose, condition, evaluation, and recommendations for patient teaching. 1. The nurse's concept of patient teaching: Most of the nurses had high levels of concepts and knowledge toward patient teaching. Though nursing service was task-centered at the turn of the century, the emphasis today is put on patient-centered nursing. But we find some of the nurses (39.4%) still are task-centered. After, patient teaching, only a few of the nurses (14.4%) checked this as “normal teaching.”It seems therefore that patient teaching is often done unconsciously. Accordingly it would he desirable to have correct concepts and knowledge of teaching taught in schools of nursing. 2. Contents of patient teaching: Most nurses (97.5%) had good information about content of patient teaching. They teach their patients during admission about their diseases, tests, treatments, and before discharge give nurses instruction about simple nursing care, personal hygiene, special diets, rest and sleep, elimination etc. 3. Time of patient teaching: Teaching can be accomplished even if there is no time set aside specifically for it. -a large part of the nurse's teaching can be done while she is giving nursing care. If she believes she has to wait for time free from other activities, she may miss many teaching opportunities. But generally proper time for patient teaching is in the midmorning or midafternoon since one and a half or two hours required. Nurses meet their patients in all stages of health: often tile patient is in a condition in which learning is impossible-pain, mental confusion, debilitation, loss of sensory perception, fear and anxiety-any of these conditions may preclude the possibility of successful teaching. 4. Prior preparation for patient teaching: The teaching aids, nurses use are charts (53.1%), periodicals (23.8%), and books (7.0%) Some of the respondents (28.1%) reported that they had had good preparation for the teaching which they were doing, others (27.5%) reported adequate preparation, and others (43.8%) reported that their preparation for teaching was inadequate. If nurses have advance preparation for normal teaching and are aware of their objectives in teaching patients, they can do effective teaching. 5. Method of patient teaching: The methods of individual patient teaching, the nurses in this study used, were conversation (55.6%) and individual discussion (19.2%) . And the methods of group patient teaching they used were demonstration (42.3%) and lecture (26.2%) They should also he prepared to use pamphlet and simple audio-visual aids for their teaching. 6. Purposes of patient teaching: The purposes of patient teaching is to help the patient recover completely, but the majority of the respondents (40.6%) don't know this. So it is necessary for them to understand correctly the purpose of patient teaching and nursing care. 7. Condition of patient teaching: The majority of respondents (75.0%) reported there were some troubles in teaching uncooperative patients. It would seem that the nurse's leaching would be improved if, in her preparation, she was given a better understanding of the patient and communication skills. The majority of respondents in the total group, felt teaching is their responsibility and they should teach their patient's family as well as the patient. The place for teaching is most often at the patient's bedside (95.6%) but the conference room (3.1%) is also used. It is important that privacy be provided in learning situations with involve personal matters. 8. Evaluation of patient teaching: The majority of respondents (76.3%,) felt leaching is a highly systematic and organized function requiring special preparation in a college or university, they have the idea that teaching is a continuous and ever-present activity of all people throughout their lives. The suggestion mentioned the most frequently for improving preparation was a course in patient teaching included in the basic nursing program. 9. Recommendations: 1) It is recommended, that in clinical nursing, patient teaching be emphasized. 2) It is recommended, that insertive education the concepts and purposes of patient teaching he renewed for all nurses. In addition to this new knowledge, methods and materials which can be applied to patient teaching should be given also. 3) It is recommended, in group patient teaching, we try to embark on team teaching.

  • PDF

Business Application of Convolutional Neural Networks for Apparel Classification Using Runway Image (합성곱 신경망의 비지니스 응용: 런웨이 이미지를 사용한 의류 분류를 중심으로)

  • Seo, Yian;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.1-19
    • /
    • 2018
  • Large amount of data is now available for research and business sectors to extract knowledge from it. This data can be in the form of unstructured data such as audio, text, and image data and can be analyzed by deep learning methodology. Deep learning is now widely used for various estimation, classification, and prediction problems. Especially, fashion business adopts deep learning techniques for apparel recognition, apparel search and retrieval engine, and automatic product recommendation. The core model of these applications is the image classification using Convolutional Neural Networks (CNN). CNN is made up of neurons which learn parameters such as weights while inputs come through and reach outputs. CNN has layer structure which is best suited for image classification as it is comprised of convolutional layer for generating feature maps, pooling layer for reducing the dimensionality of feature maps, and fully-connected layer for classifying the extracted features. However, most of the classification models have been trained using online product image, which is taken under controlled situation such as apparel image itself or professional model wearing apparel. This image may not be an effective way to train the classification model considering the situation when one might want to classify street fashion image or walking image, which is taken in uncontrolled situation and involves people's movement and unexpected pose. Therefore, we propose to train the model with runway apparel image dataset which captures mobility. This will allow the classification model to be trained with far more variable data and enhance the adaptation with diverse query image. To achieve both convergence and generalization of the model, we apply Transfer Learning on our training network. As Transfer Learning in CNN is composed of pre-training and fine-tuning stages, we divide the training step into two. First, we pre-train our architecture with large-scale dataset, ImageNet dataset, which consists of 1.2 million images with 1000 categories including animals, plants, activities, materials, instrumentations, scenes, and foods. We use GoogLeNet for our main architecture as it has achieved great accuracy with efficiency in ImageNet Large Scale Visual Recognition Challenge (ILSVRC). Second, we fine-tune the network with our own runway image dataset. For the runway image dataset, we could not find any previously and publicly made dataset, so we collect the dataset from Google Image Search attaining 2426 images of 32 major fashion brands including Anna Molinari, Balenciaga, Balmain, Brioni, Burberry, Celine, Chanel, Chloe, Christian Dior, Cividini, Dolce and Gabbana, Emilio Pucci, Ermenegildo, Fendi, Giuliana Teso, Gucci, Issey Miyake, Kenzo, Leonard, Louis Vuitton, Marc Jacobs, Marni, Max Mara, Missoni, Moschino, Ralph Lauren, Roberto Cavalli, Sonia Rykiel, Stella McCartney, Valentino, Versace, and Yve Saint Laurent. We perform 10-folded experiments to consider the random generation of training data, and our proposed model has achieved accuracy of 67.2% on final test. Our research suggests several advantages over previous related studies as to our best knowledge, there haven't been any previous studies which trained the network for apparel image classification based on runway image dataset. We suggest the idea of training model with image capturing all the possible postures, which is denoted as mobility, by using our own runway apparel image dataset. Moreover, by applying Transfer Learning and using checkpoint and parameters provided by Tensorflow Slim, we could save time spent on training the classification model as taking 6 minutes per experiment to train the classifier. This model can be used in many business applications where the query image can be runway image, product image, or street fashion image. To be specific, runway query image can be used for mobile application service during fashion week to facilitate brand search, street style query image can be classified during fashion editorial task to classify and label the brand or style, and website query image can be processed by e-commerce multi-complex service providing item information or recommending similar item.

Automatic Quality Evaluation with Completeness and Succinctness for Text Summarization (완전성과 간결성을 고려한 텍스트 요약 품질의 자동 평가 기법)

  • Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.125-148
    • /
    • 2018
  • Recently, as the demand for big data analysis increases, cases of analyzing unstructured data and using the results are also increasing. Among the various types of unstructured data, text is used as a means of communicating information in almost all fields. In addition, many analysts are interested in the amount of data is very large and relatively easy to collect compared to other unstructured and structured data. Among the various text analysis applications, document classification which classifies documents into predetermined categories, topic modeling which extracts major topics from a large number of documents, sentimental analysis or opinion mining that identifies emotions or opinions contained in texts, and Text Summarization which summarize the main contents from one document or several documents have been actively studied. Especially, the text summarization technique is actively applied in the business through the news summary service, the privacy policy summary service, ect. In addition, much research has been done in academia in accordance with the extraction approach which provides the main elements of the document selectively and the abstraction approach which extracts the elements of the document and composes new sentences by combining them. However, the technique of evaluating the quality of automatically summarized documents has not made much progress compared to the technique of automatic text summarization. Most of existing studies dealing with the quality evaluation of summarization were carried out manual summarization of document, using them as reference documents, and measuring the similarity between the automatic summary and reference document. Specifically, automatic summarization is performed through various techniques from full text, and comparison with reference document, which is an ideal summary document, is performed for measuring the quality of automatic summarization. Reference documents are provided in two major ways, the most common way is manual summarization, in which a person creates an ideal summary by hand. Since this method requires human intervention in the process of preparing the summary, it takes a lot of time and cost to write the summary, and there is a limitation that the evaluation result may be different depending on the subject of the summarizer. Therefore, in order to overcome these limitations, attempts have been made to measure the quality of summary documents without human intervention. On the other hand, as a representative attempt to overcome these limitations, a method has been recently devised to reduce the size of the full text and to measure the similarity of the reduced full text and the automatic summary. In this method, the more frequent term in the full text appears in the summary, the better the quality of the summary. However, since summarization essentially means minimizing a lot of content while minimizing content omissions, it is unreasonable to say that a "good summary" based on only frequency always means a "good summary" in its essential meaning. In order to overcome the limitations of this previous study of summarization evaluation, this study proposes an automatic quality evaluation for text summarization method based on the essential meaning of summarization. Specifically, the concept of succinctness is defined as an element indicating how few duplicated contents among the sentences of the summary, and completeness is defined as an element that indicating how few of the contents are not included in the summary. In this paper, we propose a method for automatic quality evaluation of text summarization based on the concepts of succinctness and completeness. In order to evaluate the practical applicability of the proposed methodology, 29,671 sentences were extracted from TripAdvisor 's hotel reviews, summarized the reviews by each hotel and presented the results of the experiments conducted on evaluation of the quality of summaries in accordance to the proposed methodology. It also provides a way to integrate the completeness and succinctness in the trade-off relationship into the F-Score, and propose a method to perform the optimal summarization by changing the threshold of the sentence similarity.

Design and Implementation of MongoDB-based Unstructured Log Processing System over Cloud Computing Environment (클라우드 환경에서 MongoDB 기반의 비정형 로그 처리 시스템 설계 및 구현)

  • Kim, Myoungjin;Han, Seungho;Cui, Yun;Lee, Hanku
    • Journal of Internet Computing and Services
    • /
    • v.14 no.6
    • /
    • pp.71-84
    • /
    • 2013
  • Log data, which record the multitude of information created when operating computer systems, are utilized in many processes, from carrying out computer system inspection and process optimization to providing customized user optimization. In this paper, we propose a MongoDB-based unstructured log processing system in a cloud environment for processing the massive amount of log data of banks. Most of the log data generated during banking operations come from handling a client's business. Therefore, in order to gather, store, categorize, and analyze the log data generated while processing the client's business, a separate log data processing system needs to be established. However, the realization of flexible storage expansion functions for processing a massive amount of unstructured log data and executing a considerable number of functions to categorize and analyze the stored unstructured log data is difficult in existing computer environments. Thus, in this study, we use cloud computing technology to realize a cloud-based log data processing system for processing unstructured log data that are difficult to process using the existing computing infrastructure's analysis tools and management system. The proposed system uses the IaaS (Infrastructure as a Service) cloud environment to provide a flexible expansion of computing resources and includes the ability to flexibly expand resources such as storage space and memory under conditions such as extended storage or rapid increase in log data. Moreover, to overcome the processing limits of the existing analysis tool when a real-time analysis of the aggregated unstructured log data is required, the proposed system includes a Hadoop-based analysis module for quick and reliable parallel-distributed processing of the massive amount of log data. Furthermore, because the HDFS (Hadoop Distributed File System) stores data by generating copies of the block units of the aggregated log data, the proposed system offers automatic restore functions for the system to continually operate after it recovers from a malfunction. Finally, by establishing a distributed database using the NoSQL-based Mongo DB, the proposed system provides methods of effectively processing unstructured log data. Relational databases such as the MySQL databases have complex schemas that are inappropriate for processing unstructured log data. Further, strict schemas like those of relational databases cannot expand nodes in the case wherein the stored data are distributed to various nodes when the amount of data rapidly increases. NoSQL does not provide the complex computations that relational databases may provide but can easily expand the database through node dispersion when the amount of data increases rapidly; it is a non-relational database with an appropriate structure for processing unstructured data. The data models of the NoSQL are usually classified as Key-Value, column-oriented, and document-oriented types. Of these, the representative document-oriented data model, MongoDB, which has a free schema structure, is used in the proposed system. MongoDB is introduced to the proposed system because it makes it easy to process unstructured log data through a flexible schema structure, facilitates flexible node expansion when the amount of data is rapidly increasing, and provides an Auto-Sharding function that automatically expands storage. The proposed system is composed of a log collector module, a log graph generator module, a MongoDB module, a Hadoop-based analysis module, and a MySQL module. When the log data generated over the entire client business process of each bank are sent to the cloud server, the log collector module collects and classifies data according to the type of log data and distributes it to the MongoDB module and the MySQL module. The log graph generator module generates the results of the log analysis of the MongoDB module, Hadoop-based analysis module, and the MySQL module per analysis time and type of the aggregated log data, and provides them to the user through a web interface. Log data that require a real-time log data analysis are stored in the MySQL module and provided real-time by the log graph generator module. The aggregated log data per unit time are stored in the MongoDB module and plotted in a graph according to the user's various analysis conditions. The aggregated log data in the MongoDB module are parallel-distributed and processed by the Hadoop-based analysis module. A comparative evaluation is carried out against a log data processing system that uses only MySQL for inserting log data and estimating query performance; this evaluation proves the proposed system's superiority. Moreover, an optimal chunk size is confirmed through the log data insert performance evaluation of MongoDB for various chunk sizes.