• 제목/요약/키워드: words frequency

검색결과 876건 처리시간 0.024초

Research Trends Analysis on ESG Using Unsupervised Learning

  • Woo-Ryeong YANG;Hoe-Chang YANG
    • 융합경영연구
    • /
    • 제11권3호
    • /
    • pp.47-66
    • /
    • 2023
  • Purpose: The purpose of this study is to identify research trends related to ESG by domestic and overseas researchers so far, and to present research directions and clues for the possibility of applying ESG to Korean companies in the future and ESG practice through comparison of derived topics. Research design, data and methodology: In this study, as of October 20, 2022, after searching for the keyword 'ESG' in 'scienceON', 341 domestic papers with English abstracts and 1,173 overseas papers were extracted. For analysis, word frequency analysis, word co-occurrence frequency analysis, BERTopic, LDA, and OLS regression analysis were performed to confirm trends for each topic using Python 3.7. Results: As a result of word frequency analysis, It was found that words such as management, company, performance, and value were commonly used in both domestic and overseas papers. In domestic papers, words such as activity and responsibility, and in overseas papers, words such as sustainability, impact, and development were included in the top 20 words. As a result of analyzing the co-occurrence frequency of words, it was confirmed that domestic papers were related mainly to words such as company, management, and activity, and overseas papers were related to words such as investment, sustainability, and performance. As a result of topic modeling, 3 topics such as named ESG from the corporate perspective were derived for domestic papers, and a total of 7 topics such as named sustainable investment for overseas papers were derived. As a result of the annual trend analysis, each topic did not show a relatively increasing or decreasing tendency, confirming that all topics were neutral. Conclusions: The results of this study confirmed that although it is desirable that domestic papers have recently started research on consumers, the subject diversity is lower than that of overseas papers. Therefore, it is suggested that future research needs to approach various topics such as forecasting future risks related to ESG and corporate evaluation methods.

'슬기로운 생활'에 수록된 물리 영역 과학 용어 분석 (Analyzing the Science Words of Physics in 'Wise Life' Textbooks)

  • 윤은정;박윤배
    • 한국초등과학교육학회지:초등과학교육
    • /
    • 제32권2호
    • /
    • pp.127-138
    • /
    • 2013
  • The purpose of this study was to select the basic words of physics for science education which were learned through everyday life or school education and be foundation of learning science. For this, we collected all words in the 'Wise Life' textbooks by 7th and 2007 National Curriculum, and extract the science words. As a result, there were 8,970 words in 8 textbooks of 'Wise Life', and about 18% of them, 1,585 words, were science words. There were 266 kinds of science words and most of them were biology words. And the textbooks by 2007 National Curriculum had more science words than by 7th's. Finally we selected 24 basic words of science only in the physics area by comprehensively considering difficulty, need and frequency.

Intonational Pattern Frequency of Seoul Korean and Its Implication to Word Segmentation

  • Kim, Sa-Hyang
    • 음성과학
    • /
    • 제15권2호
    • /
    • pp.21-30
    • /
    • 2008
  • The current study investigated distributional properties of the Korean Accentual Phrase and their implication to word segmentation. The properties examined were the frequency of various AP tonal patterns, the types of tonal patterns that are imposed upon content words, and the average number and temporal location of content words within the AP. A total of 414 sentences from the Read speech corpus and the Radio corpus were used for the data analysis. The results showed that the 84% of the APs contained one content word, and that almost 90% of the content words are located in AP-initial position. When the AP-initial onset was not an aspirated or tense consonant, the most common AP patterns were LH, LHH, and LHLH (78%), and 88% of the multisyllabic content words start with a rising tone in AP-initial position. When the AP-initial onset was an aspirated or tense consonant, the most common AP patterns were HH, HHLH, and HHL (72%), and 74% of the multisyllabic content words start with a level H tone in AP-initial position. The data further showed that 84.1% of APs end with the final H tone. The findings provide valuable information about the prosodic pattern and structure of Korean APs, and account for the results of a previous study which showed that Korean listeners are sensitive to AP-initial rising and AP-final high tones (Kim, 2007). This is in line with other cross-linguistic research which has revealed the correlation between prosodic probability and speech processing strategy.

  • PDF

주택디자인에서 건축가들의 어휘 사용행태 및 기본어휘에 관한 연구 (A Study on the Lexicon-Use Behaviour of Architects & the Basic Lexicons in House Design)

  • 윤대한
    • 한국주거학회논문집
    • /
    • 제17권5호
    • /
    • pp.27-37
    • /
    • 2006
  • This paper analyzed statistically two corpora that were constructed from the texts about house designs written by Korean architects and PA Awards architects. The main results are as follows; (1) The numbers of words in Korean house-design corpus were 9,352 and those of words in PA Awards house design corpus were 2,379. The former were 18.7% and the latter 4.8% of about 50,000 words regarded as the rest using scale in actual life. (2) When the architects described their house designs, the lexicon-concentration phenomenon was pervasive in both groups. Therefore, we can infer that the high-frequency lexicons are very important in house design. (3) The architects' behaviour patterns of using the house-design lexicons, went by rules according to the word frequency order. The tendency formulas of them had the $R^{2}$ values which were more than 90%. (4) In Korean house design corpus, the high frequency lexicons were '공간', '층', '주택', '집', '대지', '거실', and '실'. In PA awards house design corpus, they were 'house','room','space','living','wall','level' and 'area'. From these results, We can tell that 'space' is the highest frequency word in house design of the two groups, and that '대지 ' and 'wall' are the words that reveal well the differences between the two groups.

Jsoup를 이용한 조선왕조실록의 빅 데이터 분석 (Big Data Analysis of the Annals of the Joseon Dynasty Using Jsoup)

  • 변영일;이충호
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2021년도 추계학술대회
    • /
    • pp.131-133
    • /
    • 2021
  • 조선왕조실록은 UNESCO에 등재된 중요한 기록물이다. 본 논문은 한글로 번역된 조선왕조 실록에서 단어의 빈도수를 조사하여 빅데이터를 분석하는 방법을 제안한다. 조선왕조 실록을 인터넷 사이트에서 액세스하여 단어의 빈도수를 조사하려 할 때, 그 페이지에 포함된 소스를 직접 액세스하면 HTML 문법에 필요한 키워드가 포함되어 있어 필요한 본문에서 단어 빈도수에 의한 빅데이터 분석을 하는 것이 어렵다. 본 논문에서는 Java의 Jsoup를 활용한 크롤링 기능을 사용하여 조선왕조 실록의 본문을 분석하는 방법을 제안한다. 실험에서는 조선왕조실록의 태조부분만을 추출하여 본 방법의 유효성을 검증하였다.

  • PDF

한국의 중남미 지역연구 네트워크와 중심성 및 무역과 경제에 대한 토픽 변동분석 (Network, Centrality, and Topic Analysis on Korea's Trade and Economy with Latin America and the Caribbean Area)

  • 이재득
    • 무역학회지
    • /
    • 제47권6호
    • /
    • pp.189-209
    • /
    • 2022
  • This study aims to analyze Latin America and the Caribbean papers published in Korea during the past 2000-2020 years. Through this study, it is possible to understand the main subject and direction of research in Korea's Latin America and the Caribbean area. As the research mythologies, this study uses the text mining and Social Network Analysis such as frequency analysis, several centrality analyses, and topic analysis. After analyzing the empirical results, there has been a tendency to change the key words and centrality coefficients between 2000-2010 and 2011-2020 years. During 2011-2020 years, the most frequent keywords were changed from Neoliberalism and culture to policy education, and economy related words. The degree and closeness centrality analyses appeared the higher frequency key words. However, the eigenvector centrality appeared very different from the order of frequency key words. The topic analysis shows that the culture, language, and Neoliberalism were the most important keywords during 2000-2010 years but economy, labor trade, industry, development became the most important keywords during 2011-2020 years in topics.

범주화 과제에서의 한글단어 빈도효과 (Hangul Word-Frequency in Semantic Categorization Task)

  • 조중열
    • 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
    • /
    • 한국정보과학회언어공학연구회 1999년도 제11회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.351-358
    • /
    • 1999
  • 범주화과제를 사용한 두 실험에서 단어 빈도가 단어의 의미를 처리하는데 영향을 주는지를 알아보았다. 두 실험에서 사용된 자극은 두 글자의 한글이었는데, 실험 1에서는 사례와 목표자극은 두 번째 글자의 종성에서만 달랐고(예, 범주: 관직: 사례: 시장; 목표자극: 시작), 실험 2에서는 첫 번째 글자의 종성에서만 달랐다(예, 범주: 관직: 사례: 시장; 목표자극: 심장). 실험 1에서는 통제자극보다 저빈도 목표자극의 오반응이 더 많았고, 고빈도 사례의 반응시간이 더 길었다. 실험 2에서는 고빈도 사례-저빈도 목표자극 조건이 통제조건보다 반응시간이 더 길었다. 이 결과는 이중경로모형(Jared & Seidenberg, 1991)을 지지한다고 볼 수 있다. 이 결과들은 음운 정보와 시각 정보의 사용은 단어의 빈도에 의존하며, 특히 음운정보의 활성화는 필연적인 과정이 아니라 선택적인 것을 시사한다.

  • PDF

텍스트 마이닝 분석을 통한 수학교육 연구 동향 분석 (A Text Mining Analysis for Research Trend about the Mathematics Education)

  • 진미르;고호경
    • East Asian mathematical journal
    • /
    • 제35권4호
    • /
    • pp.489-508
    • /
    • 2019
  • In this paper we used text mining method to analyze journals of mathematics education posterior to the year of 2016. To figure out trends of mathematics education research. we analyzed the key words largely mentioned in the recent mathematics education journals by Term Frequency and Term Frequency-Inverse Document Frequency method. We also looked at how these keywords match up with the key words that appear of education to prepare for future society. This result can infer the characteristics of mathematics education research in the aspect upcoming research topics.

한글 두 글자 단어와 비단어의 어휘판단에 글자 빈도, 글자 유형, 받침이 미치는 영향: KLP 자료의 분석 (The Effect of Syllable Frequency, Syllable Type and Final Consonant on Hangeul Word and Pseudo-word Lexical Decision: An Analysis of the Korean Lexicon Project Database)

  • 신명석;박창호
    • 인지과학
    • /
    • 제34권4호
    • /
    • pp.277-297
    • /
    • 2023
  • 본 연구는 한국어 심성어휘 데이터베이스(KLP-DB)의 분석을 통해 글자 빈도, 글자의 모음 유형, 받침 유무 등 글자 수준 정보가 두 글자로 된 단어와 비단어의 어휘판단에 어떤 영향을 주는지를 알아보고자 하였다. 반응시간과 오반응률에 대한 위계적 회귀분석을 실시한 결과 단어의 어휘판단에는 단어빈도가 중대한 영향을 미치지만, 첫째 글자의 빈도, 첫째 글자와 둘째 글자의 모음 유형과 받침 유무와 같은 글자 속성이 영향을 미쳤고, 두 글자의 모음 유형의 조합 및 둘째 글자의 빈도와 받침 유무의 조합도 영향을 주었다. 비단어의 어휘판단에는 첫째 글자와 둘째 글자의 빈도, 첫째 글자의 모음 유형, 첫째 글자와 둘째 글자의 받침 유무와 같은 글자 속성이 영향을 미쳤고, 두 글자의 사용빈도의 조합, 모음 유형의 조합, 및 첫째 글자의 빈도와 받침의 조합도 영향을 주었다. 단어빈도는 단어의 어휘판단에서 강력한 영향을 미쳤으며, 글자속성은 단어보다 비단어의 판단에서 더 일관적인 영향을 미쳤다. 본 연구의 결과는 어휘판단과제에서 단어와 비단어 목록의 구성 및 반응시간의 해석에 글자 속성의 문제를 충분히 고려해야 함을 가리킨다. 글자 속성의 효과에 대한 이해는 단어 재인 과정의 이해에도 기여할 것이다.

한국어 시각단어재인에서 나타나는 이웃효과 (The Neighborhood Effect in Korean Visual Word Recognition)

  • 권유안;조혜숙;김충명;남기춘
    • 대한음성학회지:말소리
    • /
    • 제60호
    • /
    • pp.29-45
    • /
    • 2006
  • We investigated whether the first syllable plays an important role in lexical access in Korean visual word recognition. To do so, one lexical decision task (LDT) and two form primed LDT experiments examined the nature of the syllabic neighborhood effect. In Experiment 1, the syllabic neighborhood density and the syllabic neighborhood frequency was manipulated. The results showed that lexical decision latencies were only influenced by the syllabic neighborhood frequency. The purpose of experiment 2 was to confirm the results of experiment 1 with form-primed LDT task. The lexical decision latency was slower in form-related condition compared to form-unrelated condition. The effect of syllabic neighborhood density was significant only in form-related condition. This means that the first syllable plays an important role in the sub-lexical process. In Experiment 3, we conducted another form-primed LDT task manipulating the number of syllabic neighbors in words with higher frequency neighborhood. The interaction of syllabic neighborhood density and form relation was significant. This result confirmed that the words with higher frequency neighborhood are more inhibited by neighbors sharing the first syllable than words with no higher frequency neighborhood in the lexical level. These findings suggest that the first syllable is the unit of neighborhood and the unit of representation in sub-lexical representation is syllable in Korea.

  • PDF