• Title/Summary/Keyword: words frequency

Search Result 876, Processing Time 0.025 seconds

Text Mining of Wood Science Research Published in Korean and Japanese Journals

  • Eun-Suk JANG
    • Journal of the Korean Wood Science and Technology
    • /
    • v.51 no.6
    • /
    • pp.458-469
    • /
    • 2023
  • Text mining techniques provide valuable insights into research information across various fields. In this study, text mining was used to identify research trends in wood science from 2012 to 2022, with a focus on representative journals published in Korea and Japan. Abstracts from Journal of the Korean Wood Science and Technology (JKWST, 785 articles) and Journal of Wood Science (JWS, 812 articles) obtained from the SCOPUS database were analyzed in terms of the word frequency (specifically, term frequency-inverse document frequency) and co-occurrence network analysis. Both journals showed a significant occurrence of words related to the physical and mechanical properties of wood. Furthermore, words related to wood species native to each country and their respective timber industries frequently appeared in both journals. CLT was a common keyword in engineering wood materials in Korea and Japan. In addition, the keywords "MDF," "MUF," and "GFRP" were ranked in the top 50 in Korea. Research on wood anatomy was inferred to be more active in Japan than in Korea. Co-occurrence network analysis showed that words related to the physical and structural characteristics of wood were organically related to wood materials.

Comparative Analysis in Perception of Retro Fashion and New-tro Fashion Using Big Data (빅 데이터를 활용한 레트로 패션과 뉴트로 패션에 대한 인식 비교)

  • Kyung Ja Paek;Jeong-Mee Kim
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.25 no.1
    • /
    • pp.83-96
    • /
    • 2023
  • The purpose of this study is to compare and analyze the perception of retro fashion and new-tro fashion using big data. TEXTOM allowed the collection of big data on the words 'retro fashion' and 'new-tro fashion', which was refined afterwards. As for the data collection period, Jan. 1, 2019 to Nov. 30, 2022 was set. A top 50 list of words were extracted from this data based on appearance frequency. The extracted words were processed through Network centrality analysis and CONCOR analysis using Ucinet 6. The results are as follows. 1) In retro fashion, the appearance frequency of 'style' was the highest, followed by 'sensibility', 'color', 'trend', 'fashion', and 'brand'. These words came up with high TF-IDF values. Network centrality analysis discovered that 'color', 'style', 'trend', 'sensibility', and 'design' had high level of connectivity with other words. CONCOR analysis showed a total of four significant groups; trends, styles, looks, and photos. 2) In new-tro fashion, the appearance frequency of 'retro' was the highest, followed by 'trend', 'generation', 'style', 'brand', and 'fashion'. These words also came up with high TF-IDF values. Network centrality analysis found that 'retro', 'trend', 'generation', and 'brand' had high level of connectivity with other words. CONCOR analysis showed a total of four significant groups; style, brand, clothing, and trend. 3) New-tro fashion is included in retro fashion in that it reproduces the styles of the past. However, it is taken completely differently from generation to generation. Unlike the older generations, millennials actively accept newly created clothes and brands based on the past styles. They perceive it as a fashion that reveals their own unique tastes and tastes.

The neighborhood size and frequency effect in Korean words (한국어 단어재인에서 나타나는 이웃효과)

  • Kwon You-An;Cho Hye-Suk;Nam Ki-Chun
    • Proceedings of the KSPS conference
    • /
    • 2006.05a
    • /
    • pp.117-120
    • /
    • 2006
  • This paper examined two hypotheses. Firstly, if the first syllable of word play an important role in visual word recognition, it may be the unit of word neighbor. Secondly, if the first syllable is the unit of lexical access, the neighborhood size effect and the neighborhood frequency effect would appear in a lexical decision task and a form primed lexical decision task. We conducted two experiments. Experiment 1 showed that words had large neighbors made a inhibitory effect in the LDT(lexical decision task). Experiment 2 showed the interaction between the neighborhood frequency effectand the word form similarity in the form primed LDT. We concluded that the first syllable in Korean words might be the unit of word neighborhood and play a central role in a lexical access.

  • PDF

Intelligent Wordcloud Using Text Mining (텍스트 마이닝을 이용한 지능적 워드클라우드)

  • Kim, Yeongchang;Ji, Sangsu;Park, Dongseo;Lee, Choong Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2019.05a
    • /
    • pp.325-326
    • /
    • 2019
  • This paper proposes an intelligent word cloud by improving the existing method of representing word cloud by examining the frequency of nouns with text mining technique. In this paper, we propose a method to visually show word clouds focused on other parts, such as verbs, by effectively adding newly-coined words and the like to a dictionary that extracts noun words in text mining. In the experiment, the KoNLP package was used for extracting the frequency of existing nouns, and 80 new words that were not supported were added manually by examining frequency.

  • PDF

Effects of orthographic and morphological frequency of a syllable in Korean word recognition (한국어 음절의 표기빈도와 형태소빈도가 단어인지에 미치는 효과)

  • Yi, Kwang-Oh;Bae, Sung-Bong
    • Korean Journal of Cognitive Science
    • /
    • v.20 no.3
    • /
    • pp.309-333
    • /
    • 2009
  • Two experiments were conducted to examine the role of Kulja and morpheme in processing two-syllable Sino-Korean words. In Experiment 1, the effects of morphemic frequency were not significant at the initial and final positions of a word while Kulja frequency and Kulja-morpheme correspondence at both positions in a word had a significant impact on the processing of nonwords. Lexical decision times were longer for nonwords with high frequency Kulja and for nonwords with ambiguous Kulja-morpheme correspondence whose Kulja can go with many different morphemes. In Experiment 2 Kulja-morpheme correspondence was examined for words as well as nonwords. Lexical decisions were slower for stimuli with ambiguous Kulja-morpheme correspondence. The effect was more stable for nonwords, which replicated the result of Experiment 1. In sum, the results of this study suggest that words with ambiguous Kulja-morpheme correspondence activate many different morphemes and competition among these morphemic candidates slows down the lexical selection process. Kulja frequency, Kulja neighborhood, morphemic frequency, morphological neighborhood, and Kulja-morpheme correspondence in Korean word recognition were also discussed.

  • PDF

English Bible Text Visualization Using Word Clouds and Dynamic Graphics Technology (단어 구름과 동적 그래픽스 기법을 이용한 영어성경 텍스트 시각화)

  • Jang, Dae-Heung
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.3
    • /
    • pp.373-386
    • /
    • 2014
  • A word cloud is a visualization of word frequency in a given text. The importance of each word is shown in font size or color. This plot is useful for quickly perceiving the most prominent words and for locating a word alphabetically to determine its relative prominence. With dynamic graphics, we can find the changing pattern of prominent words and their frequencies according to the changing selection of chapters in a given text. We can define the word frequency matrix. In this matrix, rows are chapters in text and columns are ranks corresponding to word frequency about the words in the text. We can draw the word frequency matrix plot with this matrix. Dynamic graphic can indicate the changing pattern of the word frequency matrix according to the changing selection of the range of ranks of words. We execute an English Bible text visualization using word clouds and dynamic graphics technology.

Analysis of Mission, Vision and Core values in Korean Tertiary General Hospitals Through Text Mining (텍스트 마이닝을 통한 상급종합병원의 미션, 비전, 핵심가치 분석 연구)

  • Ji-Hoon Lee
    • Korea Journal of Hospital Management
    • /
    • v.28 no.2
    • /
    • pp.32-43
    • /
    • 2023
  • Purposes: This research is conducted to identify main features and trends of mission, vision and core values in Korean tertiary general hospitals by using text-mining. Methodology: For the study, 45 mission, 112 vision and 190 core values are collected from 45 tertiary general hospitals' homepages in 2022 and use word frequency analysis and Leyword co-occurrence analysis. Findings: In the tertiary general hospitals' mission, there are high frequency words such as 'health', 'humanity', 'medical treatment', 'education', 'research', 'happiness', 'love', 'best', 'spirit', and mission mainly includes the content of contributing humanity's health and happiness with these words. In case of vision, high frequency words are 'hospital', 'medical treatment', 'research', 'lead', 'trust', 'centered', 'patient', 'best', 'future'. By using these words in vision, it represents the definition and characteristics of vision such as ideal organizations in the future, goals and targets. As a result of the Leyword co-occurrence analysis, vision includes the content of 'high-tech medical treatment', 'special care for patients', 'leading education and research', 'the highest trust with customer', 'creative talents training'. -astly, the high frequency word-pairs in core values are 'social distribution', 'innovation pursuit', 'cooperation and harmony', and it defines standards of behavior for organizations. Practical Implication: To correct the problems of vision, mission and core values from findings, firstly, it needs for Korean tertiary general hospitals to use the words that can explain organization's identity and differentiate others in their mission. Secondly, considering strengthening the role of hospitals in their community and the importance of members in organizations, it is necessary to establish vision with considering community and members to activate vision effectively. Thirdly, because there are no specific guidelines of establishing mission, vision and core values for healthcare organizations, this research concepts and results could be utilized when other organizations establish mission, vision and core values.

  • PDF

Unstructured Data Analysis and Multi-pattern Storage Technique for Traffic Information Inference (교통정보 추론을 위한 비정형데이터 분석과 다중패턴저장 기법)

  • Kim, Yonghoon;Kim, Booil;Chung, Mokdong
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.2
    • /
    • pp.211-223
    • /
    • 2018
  • To understand the meaning of data is a common goal of research on unstructured data. Among these unstructured data, there are difficulties in analyzing the meaning of unstructured data related to corpus and sentences. In the existing researches, the researchers used LSA to select sentences with the most similar meaning to specific words of the sentences. However, it is problematic to examine many sentences continuously. In order to solve unstructured data classification problem, several search sites are available to classify the frequency of words and to serve to users. In this paper, we propose a method of classifying documents by using the frequency of similar words, and the frequency of non-relevant words to be applied as weights, and storing them in terms of a multi-pattern storage. We use Tensorflow's Softmax to the nearby sentences for machine learning, and utilize it for unstructured data analysis and the inference of traffic information.

A Study on the Perception of Metaverse Fashion Using Big Data Analysis

  • Hosun Lim
    • Fashion & Textile Research Journal
    • /
    • v.25 no.1
    • /
    • pp.72-81
    • /
    • 2023
  • As changes in social and economic paradigms are accelerating, and non-contact has become the new normal due to the COVID-19 pandemic, metaverse services that build societies in online activities and virtual reality are spreading rapidly. This study analyzes the perception and trend of metaverse fashion using big data. TEXTOM was used to extract metaverse and fashion-related words from Naver and Google and analyze their frequency and importance. Additionally, structural equivalence analysis based on the derived main words was conducted to identify the perception and trend of metaverse fashion. The following results were obtained: First, term frequency(TF) analysis revealed the most frequently appearing words were "metaverse," "fashion," "virtual," "brand," "platform," "digital," "world," "Zepeto," "company," and "game." After analyzing TF-inverse document frequency(TF-IDF), "virtual" was the most important, followed by "brand," "platform," "Zepeto," "digital," "world," "industry," "game," "fashion show," and "industry." "Metaverse" and "fashion" were found to have a high TF but low TF-IDF. Further, words such as "virtual," "brand," "platform," "Zepeto," and "digital" had a higher TF-IDF ranking than TF, indicating that they had high importance in the text. Second, convergence of iterated correlations analysis using UNICET revealed four clusters, classified as "virtual world," "metaverse distribution platform," "fashion contents technology investment," and "metaverse fashion week." Fashion brands are hosting virtual fashion shows and stores on metaverse platforms where the virtual and real worlds coexist, and investment in developing metaverse-related technologies is under way.

Appearance Frequency of 'Eco-Friendly' Emotion and Sensibility Words and their Changes (친환경 감성 어휘의 종류별 사용빈도 및 변화 양상)

  • Na, Young-Joo
    • Science of Emotion and Sensibility
    • /
    • v.14 no.2
    • /
    • pp.207-220
    • /
    • 2011
  • The purpose of this study is to investigate sensibility words related with eco-friendly in the two media fashion magazines and internet newspapers and to analysis their appearance frequency and changes by the year through 1999~2010. Most frequently used words are 'nature, eco, cotton, natural fiber, health, fresh, clear, preservation, harmony, com fiber, and Lohas'. The words are divided in 4 groups: 'Nature/Environment, Material/Fiber, Human, and Adjectives/Micell'. A point of appearing time is analyzed: 'ecology, memory-shape material, organic, spa' were used before 2000, 'nature environment, eco-friendly, stretch material, wellbeing, substitute, recycling' were in 2000-2001, 'smart material, eco material, green' in 2002-2003, 'coolbiz, Lohas, natural dye' in 2004-2005, 'herb medicine, sustainable, warmbiz' in 2006-2007, 'greensumer, greenlife, solar energy, forest bath' in 2008-2009. Looking into their changes, in early 2000, the words of eco-friendly emotion and sensibility had appeared frequently relatively, but later on they decreased, and again recently increased showing highest appearing frequency. 'Nature/Environment' words have appeared recently very much, while 'Human' sensibility words have not changed much or decreased a little. 'Adjective/Micell' words has increased little bit recently. 'Material/Fiber' words showed decrease at fashion magazine, while they increased at the pages of internet news.

  • PDF