• Title/Summary/Keyword: Language Models

Search Result 872, Processing Time 0.027 seconds

A Fuzzing Seed Generation Technique Using Natural Language Processing Model (자연어 처리 모델을 활용한 퍼징 시드 생성 기법)

  • Kim, DongYonug;Jeon, SangHoon;Ryu, MinSoo;Kim, Huy Kang
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.32 no.2
    • /
    • pp.417-437
    • /
    • 2022
  • The quality of the fuzzing seed file is one of the important factors to discover vulnerabilities faster. Although the prior seed generation paradigm, using dynamic taint analysis and symbolic execution techniques, enhanced fuzzing efficiency, the yare not extensively applied owing to their high complexity and need for expertise. This study proposed the DDRFuzz system, which creates seed files based on sequence-to-sequence models. We evaluated DDRFuzz on five open-source applications that used multimedia input files. Following experimental results, DDRFuzz showed the best performance compared with the state-of-the-art studies in terms of fuzzing efficiency.

Wave propagation analysis of the ball in the handball's game

  • Yongyong Wang;Qixia Jia;Tingting Deng;Mostafa Habibi;Sanaa Al-Kikani;H. Elhosiny Ali
    • Structural Engineering and Mechanics
    • /
    • v.85 no.6
    • /
    • pp.729-742
    • /
    • 2023
  • It is a recent attraction to the mechanical scientists to investigate state of wave propagation, buckling and vibration in the sport balls to observe the importance of different parameters on the performance of the players and the quality of game. Therefore, in the present study, we aim to investigate the wave propagation in handball game ball in term of mass of the ball and geometrical parameters wit incorporation of the viscoelastic effects of the ball material into account. In this regard, the ball is modeled using thick shell structure and classical elasticity models is utilized to obtain the equation of motion via Hamilton's principle. The displacement field of the ball model is obtained using first order shear deformation theory. The resultant equations are solved with the aid of generalized differential quadrature method. The results show that mass of the ball and viscoelastic coefficient have considerable influence on the state of wave propagation in the ball shell structure.

AI-BASED Monitoring Of New Plant Growth Management System Design

  • Seung-Ho Lee;Seung-Jung Shin
    • International journal of advanced smart convergence
    • /
    • v.12 no.3
    • /
    • pp.104-108
    • /
    • 2023
  • This paper deals with research on innovative systems using Python-based artificial intelligence technology in the field of plant growth monitoring. The importance of monitoring and analyzing the health status and growth environment of plants in real time contributes to improving the efficiency and quality of crop production. This paper proposes a method of processing and analyzing plant image data using computer vision and deep learning technologies. The system was implemented using Python language and the main deep learning framework, TensorFlow, PyTorch. A camera system that monitors plants in real time acquires image data and provides it as input to a deep neural network model. This model was used to determine the growth state of plants, the presence of pests, and nutritional status. The proposed system provides users with information on plant state changes in real time by providing monitoring results in the form of visual or notification. In addition, it is also used to predict future growth conditions or anomalies by building data analysis and prediction models based on the collected data. This paper is about the design and implementation of Python-based plant growth monitoring systems, data processing and analysis methods, and is expected to contribute to important research areas for improving plant production efficiency and reducing resource consumption.

Language Matters: A Systemic Functional Linguistics-Enhanced Machine Learning Framework for Cyberbullying Detection

  • Raghad Altowairgi;Ala Eshamwi;Lobna Hsairi
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.9
    • /
    • pp.192-198
    • /
    • 2023
  • Cyberbullying is a growing problem among adolescents and can have serious psychological and emotional consequences for the victims. In recent years, machine learning techniques have emerged as promising approach for detecting instances of cyberbullying in online communication. This research paper focuses on developing a machine learning models that are able to detect cyberbullying including support vector machines, naïve bayes, and random forests. The study uses a dataset of real-world examples of cyberbullying collected from Twitter and extracts features that represents the ideational metafunction, then evaluates the performance of each algorithm before and after considering the theory of systemic functional linguistics in terms of precision, recall, and F1-score. The result indicates that all three algorithms are effective at detecting cyberbullying with 92% for naïve bayes and an accuracy of 93% for both SVM and random forests. However, the study also highlights the challenges of accurately detecting cyberbullying, particularly given the nuanced and context-dependent nature of online communication. This paper concludes by discussing the implications of these findings for future research and the development of practical tool for cyberbullying prevention and intervention.

Development and Evaluation of Information Extraction Module for Postal Address Information (우편주소정보 추출모듈 개발 및 평가)

  • Shin, Hyunkyung;Kim, Hyunseok
    • Journal of Creative Information Culture
    • /
    • v.5 no.2
    • /
    • pp.145-156
    • /
    • 2019
  • In this study, we have developed and evaluated an information extracting module based on the named entity recognition technique. For the given purpose in this paper, the module was designed to apply to the problem dealing with extraction of postal address information from arbitrary documents without any prior knowledge on the document layout. From the perspective of information technique practice, our approach can be said as a probabilistic n-gram (bi- or tri-gram) method which is a generalized technique compared with a uni-gram based keyword matching. It is the main difference between our approach and the conventional methods adopted in natural language processing that applying sentence detection, tokenization, and POS tagging recursively rather than applying the models sequentially. The test results with approximately two thousands documents are presented at this paper.

Predicting Urban Tourism Flow with Tourism Digital Footprints Based on Deep Learning

  • Fangfang Gu;Keshen Jiang;Yu Ding;Xuexiu Fan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.4
    • /
    • pp.1162-1181
    • /
    • 2023
  • Tourism flow is not only the manifestation of tourists' special displacement change, but also an important driving mode of regional connection. It has been considered as one of significantly topics in many applications. The existing research on tourism flow prediction based on tourist number or statistical model is not in-depth enough or ignores the nonlinearity and complexity of tourism flow. In this paper, taking Nanjing as an example, we propose a prediction method of urban tourism flow based on deep learning methods using travel diaries of domestic tourists. Our proposed method can extract the spatio-temporal dependence relationship of tourism flow and further forecast the tourism flow to attractions for every day of the year or for every time period of the day. Experimental results show that our proposed method is slightly better than other benchmark models in terms of prediction accuracy, especially in predicting seasonal trends. The proposed method has practical significance in preventing tourists unnecessary crowding and saving a lot of queuing time.

An Efficient Study of Emotion Inference in USN Computing (USN 컴퓨팅에서 효율적인 감성 추론 연구)

  • Yang, Dong-Il;Kim, Young-Gyu;Jeong, Yeon-Man
    • Journal of the Korea Society of Computer and Information
    • /
    • v.14 no.1
    • /
    • pp.127-134
    • /
    • 2009
  • Recently, much research have been done on ubiquitous computing models in advanced countries as well as in Korea. Ubiquitous computing is defined as a computing environment that isn't bounded by time and space. Different kinds of computers are embedded in artifacts, devices, and environment, thus people can be connected everywhere and every time. To recognize user's emotion, facial expression, temperature, humidity, weather, and lightning factors are used for building ontology. Ontology Web Language (OWL) is adopted to implement ontology and Jena is used as an emotional inference engine. The context-awareness service infrastructure suggested in this research can be divided into several modules by their functions.

Deep Learning Based Semantic Similarity for Korean Legal Field (딥러닝을 이용한 법률 분야 한국어 의미 유사판단에 관한 연구)

  • Kim, Sung Won;Park, Gwang Ryeol
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.2
    • /
    • pp.93-100
    • /
    • 2022
  • Keyword-oriented search methods are mainly used as data search methods, but this is not suitable as a search method in the legal field where professional terms are widely used. In response, this paper proposes an effective data search method in the legal field. We describe embedding methods optimized for determining similarities between sentences in the field of natural language processing of legal domains. After embedding legal sentences based on keywords using TF-IDF or semantic embedding using Universal Sentence Encoder, we propose an optimal way to search for data by combining BERT models to check similarities between sentences in the legal field.

Effects of Jakyakkamchobuja-tang on Rheumatoid Arthritis in Rat Model: Systemic Review and Meta-Analysis (류마티스 관절염 백서 모델에서 작약감초부자탕의 효과: 체계적 문헌고찰 및 메타분석)

  • Che-Yeon Kim;Sang-Hyun Lee;Man-Suk Hwang
    • Journal of Korean Medicine Rehabilitation
    • /
    • v.33 no.3
    • /
    • pp.79-96
    • /
    • 2023
  • Objectives This study was designed to review the effect of Jakyakkamchobuja-tang on rat model with rheumatoid arthritis. Methods We used seven databases (PubMed, EMBASE, Cochrane CENTRAL, China National Knowledge Infrastructure, Oriental Medicine Advanced Searching Integrated System, Korean studies Information Service System, National Digital Science Library) from their inception to May 2023 without language restrictions. Systematic Review Centre for Laboratory Animal Experimentation's tool was used to evaluate the risk of bias. RevMan software (V5.4) was used for the meta-analysis. Results Five studies were selected following our inclusion criteria. The arthritis index decreased significantly (standardized mean difference=-2.06; 95% confidence interval=-3.07 to -1.04; p<0.0001) in Jakyakkamchobuja-tang group. Also, serum cytokines in serum and paw swelling degree decreased in Jakyakkamchobuja-tang group. Conclusions Jakyakkamchobuja-tang may be effective in treating rheumatoid arthritis. Although there is a limitation that the design of drug dosage varies between papers, it can be expected to be applied as an alternative to Western medicine, and it is believed to contribute to the standardization of herbal treatment for rheumatoid arthritis.

A Transformer-Based Emotion Classification Model Using Transfer Learning and SHAP Analysis (전이 학습 및 SHAP 분석을 활용한 트랜스포머 기반 감정 분류 모델)

  • Subeen Leem;Byeongcheon Lee;Insu Jeon;Jihoon Moon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.05a
    • /
    • pp.706-708
    • /
    • 2023
  • In this study, we embark on a journey to uncover the essence of emotions by exploring the depths of transfer learning on three pre-trained transformer models. Our quest to classify five emotions culminates in discovering the KLUE (Korean Language Understanding Evaluation)-BERT (Bidirectional Encoder Representations from Transformers) model, which is the most exceptional among its peers. Our analysis of F1 scores attests to its superior learning and generalization abilities on the experimental data. To delve deeper into the mystery behind its success, we employ the powerful SHAP (Shapley Additive Explanations) method to unravel the intricacies of the KLUE-BERT model. The findings of our investigation are presented with a mesmerizing text plot visualization, which serves as a window into the model's soul. This approach enables us to grasp the impact of individual tokens on emotion classification and provides irrefutable, visually appealing evidence to support the predictions of the KLUE-BERT model.