• Title/Summary/Keyword: Sentence Frequency

Search Result 142, Processing Time 0.025 seconds

Research on Designing Korean Emotional Dictionary using Intelligent Natural Language Crawling System in SNS (SNS대상의 지능형 자연어 수집, 처리 시스템 구현을 통한 한국형 감성사전 구축에 관한 연구)

  • Lee, Jong-Hwa
    • The Journal of Information Systems
    • /
    • v.29 no.3
    • /
    • pp.237-251
    • /
    • 2020
  • Purpose The research was studied the hierarchical Hangul emotion index by organizing all the emotions which SNS users are thinking. As a preliminary study by the researcher, the English-based Plutchick (1980)'s emotional standard was reinterpreted in Korean, and a hashtag with implicit meaning on SNS was studied. To build a multidimensional emotion dictionary and classify three-dimensional emotions, an emotion seed was selected for the composition of seven emotion sets, and an emotion word dictionary was constructed by collecting SNS hashtags derived from each emotion seed. We also want to explore the priority of each Hangul emotion index. Design/methodology/approach In the process of transforming the matrix through the vector process of words constituting the sentence, weights were extracted using TF-IDF (Term Frequency Inverse Document Frequency), and the dimension reduction technique of the matrix in the emotion set was NMF (Nonnegative Matrix Factorization) algorithm. The emotional dimension was solved by using the characteristic value of the emotional word. The cosine distance algorithm was used to measure the distance between vectors by measuring the similarity of emotion words in the emotion set. Findings Customer needs analysis is a force to read changes in emotions, and Korean emotion word research is the customer's needs. In addition, the ranking of the emotion words within the emotion set will be a special criterion for reading the depth of the emotion. The sentiment index study of this research believes that by providing companies with effective information for emotional marketing, new business opportunities will be expanded and valued. In addition, if the emotion dictionary is eventually connected to the emotional DNA of the product, it will be possible to define the "emotional DNA", which is a set of emotions that the product should have.

Co-occurrence Based Drug-disease Relationship Inference with Genes as Mediators (유전자를 중간 매개로 고려한 동시발생 기반의 약물-질병 관계 추론)

  • Shin, Sangwon;Sin, Yeeun;Jang, Giup;Yoo, Youngmi
    • The Journal of Korean Institute of Information Technology
    • /
    • v.16 no.11
    • /
    • pp.1-9
    • /
    • 2018
  • Drug repositioning is to discover new uses of drugs. Text mining derives knowledge from unstructured text. We propose a method to predict new drug-disease relationships by taking into account the rate of frequency of genes simultaneously measured in disease-gene and gene-drug. Co-occurrence of drug-gene and gene-disease in the biological literature is counted and calculate the rate of the gene for each drug and disease. Weights of drug-disease relationships are calculated using the average of the rates of genes that are measured and used to measure the accuracy for each disease. In measuring drug-disease relationships, a more accurate identification of relationships was shown by measuring the frequency on a sentence and considering multiple relationships than existing method.

Frequency and Social Network Analysis of the Bible Data using Big Data Analytics Tools R (빅데이터 분석도구 R을 이용한 성경 데이터의 빈도와 소셜 네트워크 분석)

  • Ban, ChaeHoon;Ha, JongSoo;Kim, Dong Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.2
    • /
    • pp.166-171
    • /
    • 2020
  • Big data processing technology that can store and analyze data and obtain new knowledge has been adjusted for importance in many fields of the society. Big data is emerging as an important problem in the field of information and communication technology, but the mind of continuous technology is rising. the R, a tool that can analyze big data, is a language and environment that enables information analysis of statistical bases. In this paper, we use this to analyze the Bible data. We analyze the four Gospels of the New Testament in the Bible. We collect the Bible data and perform filtering for analysis. The R is used to investigate the frequency of what text is distributed and analyze the Bible through social network analysis, in which words from a sentence are paired and analyzed between words for accurate data analysis.

Long Term Average Spectrum Characteristics of Speaking Voice of Western Operatic Singers (Long Term Average Spectrum을 이용한 성악가들의 Speaking Voice 분석)

  • Lee, Kyung-Chul;Hong, Seok-Jin;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.2
    • /
    • pp.122-127
    • /
    • 2004
  • Background and Objectives : Many studies have described and analyzed singer's formant and it has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. A similar phenomenon produced by trained singers in their speech led some authors to examine the speaker's ring. This study was designed to analyze the speaking voice of the singers and speaker's ring. Baterials and Methods : Ten tenors, fifteen baritones, fifteen sopranos and ten mezzo sopranos attending the music college, department of vocal music were chosen for this study. Fifteen male and fifteen female untrained normal speakers were chosen for control group. Each subject was asked to produce a sample of a sustained spoken vowel /ah/ sound for at least five seconds and read sentence 'Kaeul'. The sound data was analyzed using the Fast Fourier Transform(FFT) - based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social Sciences(SPSS). Results : For LTA Power spectrum of/ah/ sound, a significant increase was seen in the 2,500-3,500Hz region(p<0.01) in four trained singer group compared with untrained speaker group, and a significant increase in the 9,000-10,000Hz region(p<0.01) in soparano group. Similarly, in sentence 'Kaeul', there was a significant increase in energy in the tenor, baritone, mezzo soprano group compared with the untrained speaker group in the 2,500-3,500Hz region(p<0.01), and a significant increase in all frequency region(p<0.01) in the soprano group. Conclusions : The LTA power spectrum suggests that trained singers group show more energy concentration in the 'singer's formant' region in the speaking voice, and authors believe this region to be the 'speaker's ring'. Further research is needed on the effect of singing training on the resonance of the speaking voice.

  • PDF

A Corpus-based Study of Translation Universals in English Translations of Korean Newspaper Texts (한국 신문의 영어 번역에 나타난 번역 보편소의 코퍼스 기반 분석)

  • Goh, Gwang-Yoon;Lee, Younghee (Cheri)
    • Cross-Cultural Studies
    • /
    • v.45
    • /
    • pp.109-143
    • /
    • 2016
  • This article examines distinctive linguistic shifts of translational English in an effort to verify the validity of the translation universals hypotheses, including simplification, explicitation, normalization and leveling-out, which have been most heavily explored to date. A large-scale study involving comparable corpora of translated and non-translated English newspaper texts has been carried out to typify particular linguistic attributes inherent in translated texts. The main findings are as follows. First, by employing the parameters of STTR, top-to-bottom frequency words, and mean values of sentence lengths, the translational instances of simplification have been detected across the translated English newspaper corpora. In contrast, the portion of function words produced contrary results, which in turn suggests that this feature might not constitute an effective test of the hypothesis. Second, it was found that the use of connectives was more salient in original English newspaper texts than translated English texts, being incompatible with the explicitation hypothesis. Third, as an indicator of translational normalization, lexical bundles were found to be more pervasive in translated texts than in non-translated texts, which is expected from and therefore support the normalization hypothesis. Finally, the standard deviations of both STTR and mean sentence lengths turned out to be higher in translated texts, indicating that the translated English newspaper texts were less leveled out within the same corpus group, which is opposed to what the leveling-out hypothesis postulates. Overall, the results suggest that not all four hypotheses may qualify for the label translation universals, or at least that some translational predictors are not feasible enough to evaluate the effectiveness of the translation universals hypotheses.

Stress status classification based on EEG signals (뇌파 신호 기반 스트레스 상태 분류)

  • Kang, Jun-Su;Jang, Giljin;Lee, Minho
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.16 no.3
    • /
    • pp.103-108
    • /
    • 2016
  • In daily life, humans get stress very often. Stress is one of the important factors of healthy life and closely related to the quality of life. Too much stress is known to cause hormone imbalance of our body, and it is observed by the brain and bio signals. Based on this, the relationship between brain signal and stress is explored, and brain signal based stress index is proposed in our work. In this study, an EEG measurement device with 32 channels is adopted. However, only two channels (FP1, FP2) are used to this study considering the applicability of the proposed method in real enveironment, and to compare it with the commercial 2 channel EEG device. Frequency domain features are power of each frequency bands, subtraction, addition, or division by each frequency bands. Features in time domain are hurst exponent, correlation dimension, lyapunov exponent, etc. Total 6 subjects are participated in this experiment with English sentence reading task given. Among several candidate features, ${\frac{{\theta}\;power}{mid\;{\beta}\;power}}$ shows the best test performance (70.8%). For future work, we will confirm the results is consistent in low price EEG device.

Correlation Between Knowledge and Educational Needs Related to Recurrent in Coronary Artery bypass graft patients (관상동맥 우회술환자의 수술 후 재발 관련 지식과 교육요구도와의 상관관계)

  • 김희승;박민정
    • Journal of Korean Academy of Nursing
    • /
    • v.30 no.3
    • /
    • pp.549-559
    • /
    • 2000
  • The purpose of this study was to investigate the correlation between the knowledge and educational needs related to recurrent in coronary artery bypass graft patients as a basis to provide an individual nursing education for the population. The subjects consisted of 110 patients who had coronary artery bypass graft(CABG) at Asan Medical Center in Seoul and Sechong hospital in Buchon. Data was obtained from a knowledge questionnaire and a learning needs questionnaire between November 1998 and February 1999. Data were analyzed using SAS program for Wilcoxon rank sum test and Spearman correlation coefficient. The results were as follows : 1. With regard to the 18 items to measure knowledge, the mean (median) of items 'don't know' was 4.9(4) items. The mean (median) of items answered wrong was 3.2(3) items. The number of items answered 'don't know' tend to show higher in those who had less education, blue color jobs and myocardiac infarction history than in their counter parts. There were higher frequency of items answered 'don't know' in those who had no hypertension 2. With regard to the level of knowledge by questionnaire about CABG, The most "I dont know" (59.1%) highly response was 'He has to be treated with anticoagulant drug to prevent revasculized vessel from obstructing.' The seond highest response (56.4%) was 'If you were hypotensive, the coronary attack would collapse. 'During the hospitalized day, the patient has complete bedrest.' The highest error probability was cholesterol has not to intake.', 'After surgery, the sexual life is need controlled for 1 year. 3. The mean of educational needs was 3.38. With regard to the level of learning needs by sentence about CABG, 'Food that benefit heart disease', 'Recurrence possibility of heart disease', 'Management method of operation site', 'Risk symptom that visit hospital or report immediately' were higher than other sentenses. With regard to the level of learning needs by factor 'food(5 items)', 'disease(9 items)' and 'exercise(3 items)' showed the highest than other factors. The educational needs by patients characteristics tend to show higher in males, under the age of 49, middle or high school degree, previous experience of admission with coronary artery disease, history of myocardial infarction, expierience of PTCA, history of cerebro-vascular accident, previous expierience of smoking than in their counter parts. 4. The number of items answered 'don't know', wrong and correct weren't correlated with the level educational needs. As the results, the number of items answered 'don't know' tend to show higher in those who had less educated, blue color jobs and myocardiac infarction history than in their counter parts. There were higher frequency of items answered 'don't know' in those who had no hypertension .There were higher frequency of items answered 'don't know' on anti-thrombolitic theraphy, hypotension and pain relief. Also there were higher frequency of items answered wrong on bed rest period, cholesterol intake, and sexual life. Educational needs were higher in young age group, had previous experience of procesure and history of other disease. And when we educate CABG patients, education for diet, recurrence possibility of disease, management methods of operation site and risk symptom should be emphasized.

  • PDF

Analysis of Vision Statements in 6th Community Health Plan of Local Government in Korea (우리나라 시·군·구 지역보건의료계획의 비전(Vision) 문구 분석)

  • Ahn, Chi-Young;Kim, Hyun-Soo;Kim, Won-bin;Oh, Chang-hoon;Hong, Jee-Young;Kim, Eun-Young;Lee, Moo-Sik
    • Journal of agricultural medicine and community health
    • /
    • v.42 no.1
    • /
    • pp.1-12
    • /
    • 2017
  • Objectives: In this study, we analyzed vision statements of the 6th community health plan of local government in Korea. Methods: We examined vision statements letters, missions and strategy plans, and long-term missions of 6th community health plans of 229 local government in Korea. We also analyzed the numbers of vision letters, sentence examination, word frequency, each vision statement with frequency analysis, chi-square test, and one-way ANOVA. Results: Among 229 local government, 172(75.1%) of local government had the number of letters (Korean) less than 17 of vision statements, and there were a significant differences according to type of community health centers (p<0.05). Figuration (37.1%) were the most used in an expression of vision statement sentence, and special characters (43.2%) were the most used language except Korean. The most commonly used words of vision statement in order of frequency were 'health', 'happiness', 'with', 'citizen', 'city', '100 years old' etc. Chungcheong provinces and Daejeon metropolitan city had a highest score in directionality on phrase evaluation, and there were a significant differences according to regional classes of local government (p<0.01). Gyeongsang provinces, Ulsan, Daegu, and Busan metropolitan cities had a highest score in future orientation and sharing possibilities on phrase evaluation, and there were a significant differences according to regional classes of local government (p<0.01). Conclusions: Vision is one of the most important component of community health plan. We need more detailed 'vision statement guideline' and the community health care centers of local government should effort to make more clear and complete their vision.

The Analysis of Elementary Pre-service Teachers' Reflective Thinking and Experiment Performance Ability on Photosynthesis Experiment (광합성 실험에서 나타난 초등 예비교사들의 반성적 사고와 실험 수행 능력 분석)

  • Kim, Dong-Ryeul
    • Journal of Korean Elementary Science Education
    • /
    • v.34 no.4
    • /
    • pp.502-518
    • /
    • 2015
  • In order to find out Elementary pre-service teachers' reflective thinking and experiment performance ability related with Photosynthesis Experiment in the Korea Elementary School Science Textbook, the research is conducted targeting Elementary pre-service teachers. They are asked to carry out the experiment and write their own report about the difficulties and solutions of exploration process. This study aims to analyze Elementary pre-service teachers' reflection and experiment performance ability on Photosynthesis experiment based on 10 groups' reports and presentation materials. Reflective thinking extracts 108 statements which is associated with the four types of the sentence 'Knowledge, Procedure, Orientation, Attitude' in 10 reports. There are many sentences about reflective thinking acquired through analysis of the photosynthesis experiment. reflective thinking about the newly discovered type or changed concepts through experimentation in Knowledge is at the highest frequency. 56 sentences in relation to the ability to perform experiments are extracted by adding 4 different types of reflective thinking in 10 groups shown the highest frequency group and the lowest frequency group's report through analyzing 4 steps 'Experimental preparation and safety accident prevention', 'Experiments performance', 'Experimental results and generalization', and 'Experimental results and feedback.' Results of the analysis showed that there are the biggest difference between the two groups in 'experiment results supplement and feedback step.' In the lowest group's report, there's no contents related with 'Computer-assisted information processing' in the 'Experimental results summary and generalization stage', 'Alternative reagents and materials research', and 'Devising alternative experiment methods'.

Content Analysis of Crisis Response Communication Strategies along Crisis Stages for Match-fixing Case in K-League (프로축구 승부조작 사건에 대한 프로축구연맹의 위기단계별 위기대응 커뮤니케이션 전략 분석)

  • Bang, Shinwoong;Hwang, Sunhwan
    • The Journal of the Korea Contents Association
    • /
    • v.14 no.5
    • /
    • pp.390-402
    • /
    • 2014
  • This study, based upon the Sturges' crisis stages, examines the crisis response communication strategies of Korea Professional Football League(KPFL) for the K-league match-fixing case as well as the frequency of related news articles and the source of information. To explore the crisis response communication strategies the Korea Professional Football League used, a total of 118 news articles were analyzed using the content analysis and frequency analysis. The unit of analysis for crisis response communication strategies is sentence. The frequency of news articles based upon the crisis stages shows highest rate at the acute crisis stage. The source of information for news reports shows that KPFL was one of the major sources of the news reports. KPFL's crisis response communication strategy throughout all stages of the crisis stage shows that corrective action strategy was used highest ratio. In particular, the crisis response communication strategy between team, player and KPFL was shown lack of consistency throughout all crisis stages. Implication and future research direction for the results are discussed.