• Title/Summary/Keyword: Python 3

Search Result 226, Processing Time 0.027 seconds

Changes in Consumer Perception of One Mile-Wear and Home Wear: The Impact of Covid-19 Outbreak (원마일웨어와 홈웨어에 대한 소비자 인식 변화: 코로나19 발생의 영향)

  • Choi, Yeong-Hyeon;Lee, Kyu-Hye
    • Journal of Fashion Business
    • /
    • v.25 no.2
    • /
    • pp.110-126
    • /
    • 2021
  • This study aims to explore consumers' perception regarding "one-mile wear" and "home wear" fashion, an emerging trend during the Coronavirus disease (COVID-19) pandemic, and to identify the changes in consumers' perception of this style before and after the pandemic. The data collection period was set as one year before and after the outbreak as of January 1, 2020, and blog posts with keywords "one-mile wear" and "home wear" were collected. Further, textual data crawled and refined using Python 3.7 libraries, and centralities were measured and visualized through NodeXL 1.0.1 and Ucinet 6. According to the results, first, consumers' perception regarding one-mile wear fashion was divided into the following eight categories: wearing situation, expected attribute, style, item, color, textile, shape, and target wearer. Second, before the pandemic, home wear was recognized as pajamas or indoor wear; after the pandemic, home wear was recognized as one-mile wear, outdoor wear, and daily wear. Moreover, keywords, such as "telecommuting", "social distancing", "untact", and "upper body", appeared after the pandemic. It was confirmed that consumers' perception of home wear was affected by the pandemic.

Comparative Analysis of Vectorization Techniques in Electronic Medical Records Classification (의무 기록 문서 분류를 위한 자연어 처리에서 최적의 벡터화 방법에 대한 비교 분석)

  • Yoo, Sung Lim
    • Journal of Biomedical Engineering Research
    • /
    • v.43 no.2
    • /
    • pp.109-115
    • /
    • 2022
  • Purpose: Medical records classification using vectorization techniques plays an important role in natural language processing. The purpose of this study was to investigate proper vectorization techniques for electronic medical records classification. Material and methods: 403 electronic medical documents were extracted retrospectively and classified using the cosine similarity calculated by Scikit-learn (Python module for machine learning) in Jupyter Notebook. Vectors for medical documents were produced by three different vectorization techniques (TF-IDF, latent sematic analysis and Word2Vec) and the classification precisions for three vectorization techniques were evaluated. The Kruskal-Wallis test was used to determine if there was a significant difference among three vectorization techniques. Results: 403 medical documents were relevant to 41 different diseases and the average number of documents per diagnosis was 9.83 (standard deviation=3.46). The classification precisions for three vectorization techniques were 0.78 (TF-IDF), 0.87 (LSA) and 0.79 (Word2Vec). There was a statistically significant difference among three vectorization techniques. Conclusions: The results suggest that removing irrelevant information (LSA) is more efficient vectorization technique than modifying weights of vectorization models (TF-IDF, Word2Vec) for medical documents classification.

A Study on Leadership Trends from the Perspective of Domestic Researcher's Using BERTopic and LDA

  • Sung-Su, SHIN;Hoe-Chang, Yang
    • East Asian Journal of Business Economics (EAJBE)
    • /
    • v.11 no.1
    • /
    • pp.53-71
    • /
    • 2023
  • Purpose - This study aims to find clues necessary for the direction of leadership development suitable for the current situation by exploring the direction in which leadership has been studied from the perspective of domestic researchers, along with the arrangement of leadership theories studied in various ways. Research design, data, and methodology - A total of 7,425 papers were obtained due to the search, and 5,810 papers with English abstracts were used for analysis. For analysis, word frequency analysis, word clouding, and co-occurrence were confirmed using Python 3.7. In addition, after classifying topics related to research trends through BERTopic and LDA, trends were identified through dynamic topic modeling and OLS regression analysis. Result - As a result of the BERTopic, 14 topics such as 'Leadership management and performance' and 'Sports leadership' were derived. As a result of conducting LDA on 1,976 outliers, five topics were derived. As a result of trend analysis on topics by year, it was confirmed that five topics, such as 'military police leadership' received relative attention. Conclusion - Through the results of this study, a study on the reinterpretation of past leadership studies, a study on LMX with an expanded perspective, and a study on integrated leadership sub-factors of modern leadership theory were proposed.

Online Shopping Research Trend Analysis Using BERTopic and LDA

  • Yoon-Hwang, JU;Woo-Ryeong, YANG;Hoe-Chang, YANG
    • The Journal of Economics, Marketing and Management
    • /
    • v.11 no.1
    • /
    • pp.21-30
    • /
    • 2023
  • Purpose: As one of the ongoing studies on the distribution industry, the purpose of this study is to identify the research trends on online shopping so far to propose not only the development of online shopping companies but also the possibility of coexistence between online and offline retailers and the development of the distribution industry. Research design, data and methodology: In this study, the English abstracts of 645 papers on online shopping registered in scienceON were obtained. For the analysis through BERTopic and LDA using Python 3.7 and identifying which topics were interesting to researchers. Results: As a result of word frequency analysis and co-occurrence analysis, it was found that studies related to online shopping were frequently conducted on factors such as products, services, and shopping malls. As a result of BERTopic, five topics such as 'service quality' and 'sales strategy' were derived, and as a result of LDA, three topics including 'purchase experience' were derived. It was confirmed that 'Customer Recommendation' and 'Fashion Mall' showed relatively high interest, and 'Sales Strategy' showed relatively low interest. Conclusions: It was suggested that more diverse studies related to the online shopping mall platform, sales content, and usage influencing factors are needed to develop the online shopping industry.

Exploring Depression Research Trends Using BERTopic and LDA

  • Woo-Ryeong, YANG;Hoe-Chang, YANG
    • The Korean Journal of Food & Health Convergence
    • /
    • v.9 no.1
    • /
    • pp.19-28
    • /
    • 2023
  • The purpose of this study is to explore which areas have been more interested in depression research in Korea through analysis of academic papers related to depression, and then to provide insights that can solve future depression problems. 1,032 papers searched with the keyword "depression" in scienceON were analyzed using Python 3.7 for word frequency analysis, word co-occurrence analysis, BERTopic, LDA, and OLS regression analysis. The results of word frequency and co-occurrence frequency analysis showed that related words were composed around words such as patient, disorder and symptom. As a result of topic modeling, a total of 13 topics including 'childhood depression' and 'eating anxiety' were derived. And it has been identified as a topic of interest that 'suicidal thoughts', 'treatment', 'occupational health', and 'health treatment program' were statistically significant topics, while 'child depression' and 'female treatment' were relatively less. As a result of the analysis of research trends, future research will not only study physiological and psychological factors but also social and environmental causes, as well as it was suggested that various collaborative studies of experts in academia were needed such as convergence and complex perspectives for depression relief and treatment.

Development of AI-Surrogate model for climate stress test (기후 스트레스 테스트를 위한 AI-Surrogate 모형 개발)

  • Tae Hyeong Kim;Boo Sik Kang
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.99-99
    • /
    • 2023
  • 기후변화는 물 관리의 가장 큰 리스크 요인이므로 물 관리 계획을 수립하는 과정에서 기후변화의 영향을 고려하는 것이 필수적이다. 기후변화에 대한 수자원 예측 관련 연구가 이루어지고 있으나, 대부분의 연구에는 수문학적 모델링이나 시뮬레이션이 동반되는데, 이 과정에는 시간과 비용이 많이 들어가며, 지역이나 연구목적에 따른 정밀한 매개변수의 보정은 전문지식이 필요하기 때문에 현업에서 연구결과를 의사결정에 활용하기에는 한계가 있다고 볼 수 있다. 이에 따라 수문학적 모델링의 입력 및 출력 결과를 딥러닝의 학습자료로 하여 수문모델을 사용하지 않아도 효율적으로 결과를 도출할 수 있는 딥러닝 기반 Surrogate 모형에 대한 연구가 이루어지고 있으나 수자원 분야에 접목된 사례는 부재한 실정이다. 따라서 이 연구를 통해 국내 유역을 대상으로 Surrogate 모형을 구축한 뒤, 그 성능을 평가하고자 한다. 이를 위한 Surrogate 모형 구축 과정은 다음과 같다. 충주댐 유역을 대상으로 과거 20년간의 강우 및 기온 자료를 수집한 뒤, 이 자료를 바탕으로 기후변화의 영향을 고려한 3,162개의 시나리오를 생성한다. 그 후 장기유출모형 IHACRES에 생성된 시나리오를 입력자료로 하여 유입량 결과를 도출하고, 이 결과를 Python코드 기반의 딥러닝 학습자료로 하여 최적 예측 결과를 도출해내는 Surrogate 모형을 생성한 뒤 기존 장기유출모형과의 성능을 비교하고자 한다. 이와 같은 Surrogate 모형은 추가적인 데이터와 매개변수의 보정 과정이 없어도 장기유출모형과 같은 결과를 짧은 시간내에 상당히 정확하게 모사할 수 있어 시간과 비용을 줄일 수 있으며, 비전문가도 쉽게 사용할 수 있다는 장점을 가진다.

  • PDF

Crop Yield and Crop Production Predictions using Machine Learning

  • Divya Goel;Payal Gulati
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.9
    • /
    • pp.17-28
    • /
    • 2023
  • Today Agriculture segment is a significant supporter of Indian economy as it represents 18% of India's Gross Domestic Product (GDP) and it gives work to half of the nation's work power. Farming segment are required to satisfy the expanding need of food because of increasing populace. Therefore, to cater the ever-increasing needs of people of nation yield prediction is done at prior. The farmers are also benefited from yield prediction as it will assist the farmers to predict the yield of crop prior to cultivating. There are various parameters that affect the yield of crop like rainfall, temperature, fertilizers, ph level and other atmospheric conditions. Thus, considering these factors the yield of crop is thus hard to predict and becomes a challenging task. Thus, motivated this work as in this work dataset of different states producing different crops in different seasons is prepared; which was further pre-processed and there after machine learning techniques Gradient Boosting Regressor, Random Forest Regressor, Decision Tree Regressor, Ridge Regression, Polynomial Regression, Linear Regression are applied and their results are compared using python programming.

Research on Ways to Revitalize Traditional Markets by Exploring Research Trends (연구동향 탐색을 통한 전통시장 활성화 방안 연구)

  • Choon-Ho LEE;Hoe-Chang YANG
    • The Journal of Economics, Marketing and Management
    • /
    • v.11 no.4
    • /
    • pp.53-63
    • /
    • 2023
  • Purpose: The purpose of this study is to examine the research trends in the papers published by Korean researchers related to traditional markets, to check what topics have been studied, and to make various suggestions for research directions and effective ways to revitalize traditional markets. Research design, data and methodology: To this end, this study conducted word frequency analysis, co-occurrence frequency analysis, BERTopic, LDA, dynamic topic modeling and OLS regression analysis using Python 3.7 on the English abstracts of a total of 502 papers extracted through ScienceON. Results: As a result of word frequency analysis and co-occurrence frequency analysis, it was found that studies related to traditional markets have been conducted not only on factors related to customers, but also on traditional market merchants and government policies, and the degree of service, quality, and satisfaction perceived by customers using traditional markets. Through BERTopic and LDA, three topics such as 'Traditional market safety management' were identified, and among them, it was found that 'Traditional market safety management' is relatively less attention by researchers. Conclusions: The results of this study suggest that future research on the revitalization of traditional markets should be conducted from a specific consulting perspective along with the establishment of various data, a causal model study from various perspectives such as the characteristics of merchants as well as consumers, and an integrated and convergent approach to policy formulation by the government and local governments.

MSRP Prediction System Utilizing KERAS and DNN (Keras와 DNN을 이용한 자동차 MSRP 예측 시스템)

  • Kang, Jiwon;Yun, Hyonbin;Lee, Sanghyun;Choi, Hyunho;Moon, Yoo-Jin
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2021.01a
    • /
    • pp.355-356
    • /
    • 2021
  • 본 논문에서는 Python 3의 Keras 모듈을 이용하여 특정 자동차에 대한 최적의 판매자권장소비자가격(MSRP)을 예측하는 시스템을 제안한다. 이 시스템은 2004년에 미국에서 시판된 428종류의 자동차에 대한 정보를 제조사, 차종, 생산지, 엔진 크기, 실린더 수, 시내 주행 시 연비, 고속도로 주행 시 연비, 마력, 차체 무게, 차체 길이의 독립변수를 사용하여 자체적으로 딥러닝한 회귀모델을 통해 특정 지표가 주어진 차량에 대해 종속변수인 판매자권장소비자가격을 예측한다. Optimizer를 adam으로, 학습률을 0.005으로 설정한 경우의 검증 MAE 값이 3842.98로 가장 낮게 산출되었고, 해당 모델의 결과는 예측값과 실제값의 오차율이 ±15% 정도 내외로 예측된 표본의 비율이 약 80.14%로 측정되었다. 위 모델은 향후 신차 가격 결정 및 중고차 시장에서 구매, 판매 결정을 돕는 등 특정 시장 내에서 다양한 자동차의 가치를 판단하기에 유용할 것으로 전망된다.

  • PDF

Prediction of the Mechanical Properties of Additively Manufactured Continuous Fiber-Reinforced Composites (적층제조 연속섬유강화 고분자 복합재료의 물성 예측)

  • P. Kahhal;H. Ghorbani-Menghari;H. T. Kim;J. H. Kim
    • Transactions of Materials Processing
    • /
    • v.32 no.1
    • /
    • pp.28-34
    • /
    • 2023
  • In this research, a representative volume element (RVE)-based FE Model is presented to estimate the mechanical properties of additively manufactured continuous fiber-reinforced composites with different fiber orientations. To construct the model, an ABAQUS Python script has been implemented to produce matrix and fiber in the desired orientations at the RVE. A script has also been developed to apply the periodic boundary conditions to the RVE. Experimental tests were conducted to validate the numerical models. Tensile specimens with the fiber directions aligned in the 0, 45, and 90 degrees to the loading direction were manufactured using a continuous fiber 3D printer and tensile tests were performed in the three directions. Tensile tests were also simulated using the RVE models. The predicted Young's moduli compared well with the measurements: the Young's modulus prediction accuracy values were 83.73, 97.70, and 92.92 percent for the specimens in the 0, 45, and 90 degrees, respectively. The proposed method with periodic boundary conditions precisely evaluated the elastic properties of additively manufactured continuous fiber-reinforced composites with complex microstructures.