• Title/Summary/Keyword: Natural language process

Search Result 252, Processing Time 0.029 seconds

Leveraging LLMs for Corporate Data Analysis: Employee Turnover Prediction with ChatGPT (대형 언어 모델을 활용한 기업데이터 분석: ChatGPT를 활용한 직원 이직 예측)

  • Sungmin Kim;Jee Yong Chung
    • Knowledge Management Research
    • /
    • v.25 no.2
    • /
    • pp.19-47
    • /
    • 2024
  • Organizational ability to analyze and utilize data plays an important role in knowledge management and decision-making. This study aims to investigate the potential application of large language models in corporate data analysis. Focusing on the field of human resources, the research examines the data analysis capabilities of these models. Using the widely studied IBM HR dataset, the study reproduces machine learning-based employee turnover prediction analyses from previous research through ChatGPT and compares its predictive performance. Unlike past research methods that required advanced programming skills, ChatGPT-based machine learning data analysis, conducted through the analyst's natural language requests, offers the advantages of being much easier and faster. Moreover, its prediction accuracy was found to be competitive compared to previous studies. This suggests that large language models could serve as effective and practical alternatives in the field of corporate data analysis, which has traditionally demanded advanced programming capabilities. Furthermore, this approach is expected to contribute to the popularization of data analysis and the spread of data-driven decision-making (DDDM). The prompts used during the data analysis process and the program code generated by ChatGPT are also included in the appendix for verification, providing a foundation for future data analysis research using large language models.

Metadata extraction using AI and advanced metadata research for web services (AI를 활용한 메타데이터 추출 및 웹서비스용 메타데이터 고도화 연구)

  • Sung Hwan Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.2
    • /
    • pp.499-503
    • /
    • 2024
  • Broadcasting programs are provided to various media such as Internet replay, OTT, and IPTV services as well as self-broadcasting. In this case, it is very important to provide keywords for search that represent the characteristics of the content well. Broadcasters mainly use the method of manually entering key keywords in the production process and the archive process. This method is insufficient in terms of quantity to secure core metadata, and also reveals limitations in recommending and using content in other media services. This study supports securing a large number of metadata by utilizing closed caption data pre-archived through the DTV closed captioning server developed in EBS. First, core metadata was automatically extracted by applying Google's natural language AI technology. The next step is to propose a method of finding core metadata by reflecting priorities and content characteristics as core research contents. As a technology to obtain differentiated metadata weights, the importance was classified by applying the TF-IDF calculation method. Successful weight data were obtained as a result of the experiment. The string metadata obtained by this study, when combined with future string similarity measurement studies, becomes the basis for securing sophisticated content recommendation metadata from content services provided to other media.

A Study on the Process Form Generation and Expressive Characteristic by Storytelling in BIG's Architecture (BIG의 건축에서 나타나는 스토리텔링에 의한 형태생성 프로세스와 표현 특성에 관한 연구)

  • Kim, Jong-Sung;Kim, Kai-Chun
    • Korean Institute of Interior Design Journal
    • /
    • v.24 no.6
    • /
    • pp.79-86
    • /
    • 2015
  • This study started from the concern for Bjrake Ingels, an emerging architect in the architecture circle, who is creative and popular. Recently, the architecture field provides architects with a foundation to express a process on a new form creation through various new expressive languages, design concepts, and methods. The global Danish group BIG(Bjarke Ingels Group) develops a story by their distinctive architectural language. The storytelling is being used in various fields and now the tool called 'story' is settling down as an important element in the life that human lives. Bjarke Ingels leading the group BIG aims for the form expression by the scientific analysis and adaptation after being affected by Danish regional background and OMA. It creates a form to share stories with local members by visually simplifying the region, culture, environment, social phenomenon, economy, and politics that are invisible and do not have the form in the modern society. The elements and expressive features of the space storytelling include locality, cultural, natural environment, and connectivity which are the content structure(story) that enables you to intervene in the story according to the main agent to imagine a new space. The expressive element includes the watching moving line story of the successive, hierarchical, and organic structures which are constructive elements creating various spaces through the mixture, transmutability, and relocation of the program and inducing users to the space. The space storytelling is composed of the symbolism, community, and eco-friendliness to appear diversely through BIG's case analysis. This study will have significance that it drew a method and feature looked at by many contemporary architects from the storytelling viewpoint in the form-creating process, classified the form-creating process through a new storytelling type, and showed a possibility on the development of various methodologies.

A Study on the Continuous Speech Recognition for the Automatic Creation of International Phonetics (국제 음소의 자동 생성을 활용한 연속음성인식에 관한 연구)

  • Kim, Suk-Dong;Hong, Seong-Soo;Shin, Chwa-Cheul;Woo, In-Sung;Kang, Heung-Soon
    • Journal of Korea Game Society
    • /
    • v.7 no.2
    • /
    • pp.83-90
    • /
    • 2007
  • One result of the trend towards globalization is an increased number of projects that focus on natural language processing. Automatic speech recognition (ASR) technologies, for example, hold great promise in facilitating global communications and collaborations. Unfortunately, to date, most research projects focus on single widely spoken languages. Therefore, the cost to adapt a particular ASR tool for use with other languages is often prohibitive. This work takes a more general approach. We propose an International Phoneticizing Engine (IPE) that interprets input files supplied in our Phonetic Language Identity (PLI) format to build a dictionary. IPE is language independent and rule based. It operates by decomposing the dictionary creation process into a set of well-defined steps. These steps reduce rule conflicts, allow for rule creation by people without linguistics training, and optimize run-time efficiency. Dictionaries created by the IPE can be used with the speech recognition system. IPE defines an easy-to-use systematic approach that can obtained 92.55% for the recognition rate of Korean speech and 89.93% for English.

  • PDF

A Study on the Dataset of the Korean Multi-class Emotion Analysis in Radio Listeners' Messages (라디오 청취자 문자 사연을 활용한 한국어 다중 감정 분석용 데이터셋연구)

  • Jaeah, Lee;Gooman, Park
    • Journal of Broadcast Engineering
    • /
    • v.27 no.6
    • /
    • pp.940-943
    • /
    • 2022
  • This study aims to analyze the Korean dataset by performing Korean sentence Emotion Analysis in the radio listeners' text messages collected personally. Currently, in Korea, research on the Emotion Analysis of Korean sentences is variously continuing. However, it is difficult to expect high accuracy of Emotion Analysis due to the linguistic characteristics of Korean. In addition, a lot of research has been done on Binary Sentiment Analysis that allows positive/negative classification only, but Multi-class Emotion Analysis that is classified into three or more emotions requires more research. In this regard, it is necessary to consider and analyze the Korean dataset to increase the accuracy of Multi-class Emotion Analysis for Korean. In this paper, we analyzed why Korean Emotion Analysis is difficult in the process of conducting Emotion Analysis through surveys and experiments, proposed a method for creating a dataset that can improve accuracy and can be used as a basis for Emotion Analysis of Korean sentences.

Development of Rule for Quality Checking Items to Raise Quality of BIM Model -Focusing on the Domestic BIM Guidelines- (BIM 모델의 완성도를 높이기 위한 품질검토항목의 룰 개발 - 국내 BIM 지침을 중심으로 -)

  • Song, Jong-Kwan;Ju, Ki-Beom
    • Korean Journal of Construction Engineering and Management
    • /
    • v.14 no.5
    • /
    • pp.131-143
    • /
    • 2013
  • There is the difference of criteria to apply guidelines among the project participants and to depend on the purpose of utilizing BIM models, although modeling criteria are basically provided through BIM guidelines. Therefore, it is quite important to check compliance with guidelines to raise quality of the BIM model. But Quality Checking (QC) items and method for BIM model modeled in accordance with guidelines is not provided. This study suggested QC items and Rule Specifications(RS) for automatic QC. First of all, QC items were derived by analyzing domestic BIM guidelines and a process for structuring natural language is conducted by utilizing flowchart and pseudocode. So, by combining them, RS was suggested. Finally, RS-based case study was conducted by implementing automatic QC process with solibri model checker$^{TM}$. This study will contribute to the improvement of design quality and completeness of BIM model including huge data of 3 dimension. Furthermore, it is necessary to develop BIM guidelines according to the use case and to provide detailed process and standard for QC of BIM model.

Hate Speech Detection Using Modified Principal Component Analysis and Enhanced Convolution Neural Network on Twitter Dataset

  • Majed, Alowaidi
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.1
    • /
    • pp.112-119
    • /
    • 2023
  • Traditionally used for networking computers and communications, the Internet has been evolving from the beginning. Internet is the backbone for many things on the web including social media. The concept of social networking which started in the early 1990s has also been growing with the internet. Social Networking Sites (SNSs) sprung and stayed back to an important element of internet usage mainly due to the services or provisions they allow on the web. Twitter and Facebook have become the primary means by which most individuals keep in touch with others and carry on substantive conversations. These sites allow the posting of photos, videos and support audio and video storage on the sites which can be shared amongst users. Although an attractive option, these provisions have also culminated in issues for these sites like posting offensive material. Though not always, users of SNSs have their share in promoting hate by their words or speeches which is difficult to be curtailed after being uploaded in the media. Hence, this article outlines a process for extracting user reviews from the Twitter corpus in order to identify instances of hate speech. Through the use of MPCA (Modified Principal Component Analysis) and ECNN, we are able to identify instances of hate speech in the text (Enhanced Convolutional Neural Network). With the use of NLP, a fully autonomous system for assessing syntax and meaning can be established (NLP). There is a strong emphasis on pre-processing, feature extraction, and classification. Cleansing the text by removing extra spaces, punctuation, and stop words is what normalization is all about. In the process of extracting features, these features that have already been processed are used. During the feature extraction process, the MPCA algorithm is used. It takes a set of related features and pulls out the ones that tell us the most about the dataset we give itThe proposed categorization method is then put forth as a means of detecting instances of hate speech or abusive language. It is argued that ECNN is superior to other methods for identifying hateful content online. It can take in massive amounts of data and quickly return accurate results, especially for larger datasets. As a result, the proposed MPCA+ECNN algorithm improves not only the F-measure values, but also the accuracy, precision, and recall.

Design and Implementation of a Subjective-type Evaluation System Using Natural Language Processing Technique (유의어 사전을 이용한 주관식 문제 채점 시스템 설계 및 구현)

  • Park, HeeJung;Kang, WonSeog
    • The Journal of Korean Association of Computer Education
    • /
    • v.6 no.3
    • /
    • pp.207-216
    • /
    • 2003
  • An instructor in education generally takes the objective-type evaluation for grading. The subjective-type evaluation has the merit that it can estimate the high-recognition ability, but the problem of the objectivity and reliability of the evaluation. This paper proposes the model which grades for the subjective-type evaluation. and designs and implements the evaluation system using the synonym thesaurus. This system can process the diverse and wide subjective-type questions and provide the easy usage for a beginner. It also can reduce the time and endeavor for evaluation and provide the objectivity of the evaluation. The system results the 73% success rate. We expect that this system will become a basis of the research on the subjective-type evaluation.

  • PDF

Relational Logic Definition of Articles and Sentences in Korean Building Code for the Automated Building Permit System (인허가관련 설계품질검토 자동화를 위한 건축법규 문장 관계논리에 관한 연구)

  • Kim, Hyunjung;Lee, Jin-Kook
    • Korean Journal of Computational Design and Engineering
    • /
    • v.21 no.4
    • /
    • pp.433-442
    • /
    • 2016
  • This paper aims to define the relational logic of in-between code articles as well as within atomic sentences in Korean Building Code, as an intermediate research and development process for the automated building permit system of Korea. The approach depicted in this paper enables the software developers to figure out the logical relations in order to compose KBimCode and its databases. KBimCode is a computer-readable form of Korean Building Code sentences based on a logic rule-based mechanism. Two types of relational logic definition are described in this paper. First type is a logic definition of relation between code sentences. Due to the complexity of Korean Building code structure that consists of decree, regulation or ordinance, an intensive analysis of sentence relations has been performed. Code sentences have a relation based on delegation or reference each other. Another type is a relational logic definition in a code sentence based on translated atomic sentence(TAS) which is an explicit form of atomic sentence(AS). The analysis has been performed because the natural language has intrinsic ambiguity which hinders interpreting embedded meaning of Building Code. Thus, both analyses have been conducted for capturing accurate meaning of building permit-related requirements as a part of the logic rule-based mechanism.

A Study on Metonymy of the Image-Space and the Symbol - Focused on the image of the Gothic Cathedral in the Middle Age - (상징과 이미지공간의 환유 - 고딕 성당 건축의 이미지 -)

  • 김미옥;심희정
    • Korean Institute of Interior Design Journal
    • /
    • v.13 no.3
    • /
    • pp.43-51
    • /
    • 2004
  • 1"his Study is to make research a Image and Symbol in the Gothic Architecture. Firstly, the concept of the symbol and the image effect are essentially based on the meaning that the activity of consciousness take on the constitution of the symbol. Therefore the image represent the deepest content of human nature has been imagination and reality. Secondly, the symbol concept showed in Gothic Architecture was associated with the passion of the order of middle ages. The world, the cosmos and the history had been revealed in Gothic architecture. In Gothic Image, Emile Male maintained the symbol of the Catholic Church as above mentioned. The Catholic Church was mirror of the natural, the institution, the moral and history, then this passion of the order was a christian order. Especially, E. Panofsky said that the scholastic method was shown in Gothic architecture. He assert that it was especially dialectic method used in theology. That is, he explained the transfiguration of Gothic Architecture with the process of thesis, anti-thesis, synthesis. These symbolize light, magnitude, clearness. Finally, symbolic image is important in Post-Modem architecture at present. Post-Modern architecture is associated with the public and the Pluralism. Post-Modern architecture attempt to communicate as symbolic language among the assumption for diverse aspect of culture.