• Title/Summary/Keyword: deep similarity

Search Result 227, Processing Time 0.023 seconds

Learning Source Code Context with Feature-Wise Linear Modulation to Support Online Judge System (온라인 저지 시스템 지원을 위한 Feature-Wise Linear Modulation 기반 소스코드 문맥 학습 모델 설계)

  • Hyun, Kyeong-Seok;Choi, Woosung;Chung, Jaehwa
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.11
    • /
    • pp.473-478
    • /
    • 2022
  • Evaluation learning based on code testing is becoming a popular solution in programming education via Online judge(OJ). In the recent past, many papers have been published on how to detect plagiarism through source code similarity analysis to support OJ. However, deep learning-based research to support automated tutoring is insufficient. In this paper, we propose Input & Output side FiLM models to predict whether the input code will pass or fail. By applying Feature-wise Linear Modulation(FiLM) technique to GRU, our model can learn combined information of Java byte codes and problem information that it tries to solve. On experimental design, a balanced sampling technique was applied to evenly distribute the data due to the occurrence of asymmetry in data collected by OJ. Among the proposed models, the Input Side FiLM model showed the highest performance of 73.63%. Based on result, it has been shown that students can check whether their codes will pass or fail before receiving the OJ evaluation which could provide basic feedback for improvements.

Automatic Classification of Academic Articles Using BERT Model Based on Deep Learning (딥러닝 기반의 BERT 모델을 활용한 학술 문헌 자동분류)

  • Kim, In hu;Kim, Seong hee
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.3
    • /
    • pp.293-310
    • /
    • 2022
  • In this study, we analyzed the performance of the BERT-based document classification model by automatically classifying documents in the field of library and information science based on the KoBERT. For this purpose, abstract data of 5,357 papers in 7 journals in the field of library and information science were analyzed and evaluated for any difference in the performance of automatic classification according to the size of the learned data. As performance evaluation scales, precision, recall, and F scale were used. As a result of the evaluation, subject areas with large amounts of data and high quality showed a high level of performance with an F scale of 90% or more. On the other hand, if the data quality was low, the similarity with other subject areas was high, and there were few features that were clearly distinguished thematically, a meaningful high-level performance evaluation could not be derived. This study is expected to be used as basic data to suggest the possibility of using a pre-trained learning model to automatically classify the academic documents.

Prediction Model of Real Estate Transaction Price with the LSTM Model based on AI and Bigdata

  • Lee, Jeong-hyun;Kim, Hoo-bin;Shim, Gyo-eon
    • International Journal of Advanced Culture Technology
    • /
    • v.10 no.1
    • /
    • pp.274-283
    • /
    • 2022
  • Korea is facing a number difficulties arising from rising housing prices. As 'housing' takes the lion's share in personal assets, many difficulties are expected to arise from fluctuating housing prices. The purpose of this study is creating housing price prediction model to prevent such risks and induce reasonable real estate purchases. This study made many attempts for understanding real estate instability and creating appropriate housing price prediction model. This study predicted and validated housing prices by using the LSTM technique - a type of Artificial Intelligence deep learning technology. LSTM is a network in which cell state and hidden state are recursively calculated in a structure which added cell state, which is conveyor belt role, to the existing RNN's hidden state. The real sale prices of apartments in autonomous districts ranging from January 2006 to December 2019 were collected through the Ministry of Land, Infrastructure, and Transport's real sale price open system and basic apartment and commercial district information were collected through the Public Data Portal and the Seoul Metropolitan City Data. The collected real sale price data were scaled based on monthly average sale price and a total of 168 data were organized by preprocessing respective data based on address. In order to predict prices, the LSTM implementation process was conducted by setting training period as 29 months (April 2015 to August 2017), validation period as 13 months (September 2017 to September 2018), and test period as 13 months (December 2018 to December 2019) according to time series data set. As a result of this study for predicting 'prices', there have been the following results. Firstly, this study obtained 76 percent of prediction similarity. We tried to design a prediction model of real estate transaction price with the LSTM Model based on AI and Bigdata. The final prediction model was created by collecting time series data, which identified the fact that 76 percent model can be made. This validated that predicting rate of return through the LSTM method can gain reliability.

Development of a Deep Learning-based Long-term PredictionGenerative Model of Wind and Sea Conditions for Offshore Wind Farm Maintenance Optimization (해상풍력단지 유지보수 최적화 활용을 위한 풍황 및 해황 장기예측 딥러닝 생성모델 개발)

  • Sang-Hoon Lee;Dae-Ho Kim;Hyuk-Jin Choi;Young-Jin Oh;Seong-Bin Mun
    • Journal of Wind Energy
    • /
    • v.13 no.2
    • /
    • pp.42-52
    • /
    • 2022
  • In this paper, we propose a time-series generation methodology using a generative adversarial network (GAN) for long-term prediction of wind and sea conditions, which are information necessary for operations and maintenance (O&M) planning and optimal plans for offshore wind farms. It is a "Conditional TimeGAN" that is able to control time-series data with monthly conditions while maintaining a time dependency between time-series. For the generated time-series data, the similarity of the statistical distribution by direction was confirmed through wave and wind rose diagram visualization. It was also found that the statistical distribution and feature correlation between the real data and the generated time-series data was similar through PCA, t-SNE, and heat map visualization algorithms. The proposed time-series generation methodology can be applied to monthly or annual marine weather prediction including probabilistic correlations between various features (wind speed, wind direction, wave height, wave direction, wave period and their time-series characteristics). It is expected that it will be able to provide an optimal plan for the maintenance and optimization of offshore wind farms based on more accurate long-term predictions of sea and wind conditions by using the proposed model.

Content-based Korean journal recommendation system using Sentence BERT (Sentence BERT를 이용한 내용 기반 국문 저널추천 시스템)

  • Yongwoo Kim;Daeyoung Kim;Hyunhee Seo;Young-Min Kim
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.3
    • /
    • pp.37-55
    • /
    • 2023
  • With the development of electronic journals and the emergence of various interdisciplinary studies, the selection of journals for publication has become a new challenge for researchers. Even if a paper is of high quality, it may face rejection due to a mismatch between the paper's topic and the scope of the journal. While research on assisting researchers in journal selection has been actively conducted in English, the same cannot be said for Korean journals. In this study, we propose a system that recommends Korean journals for submission. Firstly, we utilize SBERT (Sentence BERT) to embed abstracts of previously published papers at the document level, compare the similarity between new documents and published papers, and recommend journals accordingly. Next, the order of recommended journals is determined by considering the similarity of abstracts, keywords, and title. Subsequently, journals that are similar to the top recommended journal from previous stage are added by using a dictionary of words constructed for each journal, thereby enhancing recommendation diversity. The recommendation system, built using this approach, achieved a Top-10 accuracy level of 76.6%, and the validity of the recommendation results was confirmed through user feedback. Furthermore, it was found that each step of the proposed framework contributes to improving recommendation accuracy. This study provides a new approach to recommending academic journals in the Korean language, which has not been actively studied before, and it has also practical implications as the proposed framework can be easily applied to services.

Efficient Use of Unused Spare Columns for Reducing Memory Miscorrections

  • Jung, Ji-Hun;Ishaq, Umair;Song, Jae-Hoon;Park, Sung-Ju
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • v.12 no.3
    • /
    • pp.331-340
    • /
    • 2012
  • In the deep sub-micron ICs, growing amounts of on-die memory and scaling effects make embedded memories increasingly vulnerable to reliability and yield problems. Spare columns are often included in memories to repair defective cells or bit lines during production test. In many cases, the repair process will not use all spare columns. Schemes have been proposed to exploit these unused spare columns to store additional check bits which can be used to reduce the miscorrection probability for triple errors in single error correction-double error detection (SEC-DED). These additional check bits increase the dimensions of the parity check matrix (H-matrix) requiring extra area overhead. A method is proposed in this paper to efficiently fill the extra rows of the H-matrix on the basis of similarity of logic between the other rows. Optimization of the whole H-matrix is accomplished through logic sharing within a feasible operating time resulting in reduced area overhead. A detailed implementation using fuse technology is also proposed in this paper.

Adult-onset kaposiform hemangioendothelioma with neurofibromatosis type 1: A case report and literature review

  • Kim, Dong Hwi;Lee, Ji Sung;Shin, Jong Weon;Kim, Jean A;Jun, Young Joon
    • Archives of Plastic Surgery
    • /
    • v.45 no.6
    • /
    • pp.583-587
    • /
    • 2018
  • Kaposiform hemangioendothelioma (KHE) is a very rare, locally aggressive vascular neoplasm. It occurs mostly in children and is rarely observed in adults. It typically originates on the skin, later affecting the deep soft tissue of the extremities, head or neck, and retroperitoneum by infiltrative growth. It is locally aggressive, does not regress spontaneously, and tends to metastasize locally as well as to the regional lymph nodes. In this article, we report a case of adult-onset KHE with neurofibromatosis type 1. The patient presented to our department with a 2-month history of a painful ulceration in her left popliteal area. Since KHE had not previously been reported in patients with neurofibromatosis, the diagnosis was difficult due to the similarity of the skin manifestation to neurofibromatosis-associated lesions. We share our experience of diagnosing and treating this rare case of adult-onset KHE.

Dancing with Partners at SKT: RMM Model with SKT case (SKT 에서 파트너와 함께 춤을 : RMM 모형과 SKT 사례 적용)

  • Gwon, Tae-Hyeong;Bang, Seok-Beom;Lee, Gang-Eop;Choe, Jae-Ung
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2006.11a
    • /
    • pp.3-16
    • /
    • 2006
  • In the age where the Internet changes everything, even the earth has become flat. The boarders between nations, locations, times, and industries are not maningful, and no single company can do the whole process well. Therefore, various types of 'Value network' and 'Relation web' emerge for moving first and fast learning. Bort the relationship maturity model (RMM) proposed and the partnership management initiatives at SKT demonstrate that the concept is important, and that the final goal can be reached only through a series of critical outcome at each phase. In particular, recognizing as core infrastructures various online/offline channels, deep trust, and rich communications is an important finding for a successful relationship management. Also, related literatures suggest the following key factors to be influential in more that two phases: professionalism including expertise, similarity, channel capacity. Based on these findings, future efforts need to be put on the research & development of related measurement and management tools. We do hope more people dance with their partners through these efforts.

  • PDF

Resolving Multi-Translatable Verbs Japanese-TO-Korean Machine Translation

  • Kim Jung-In;Lee Kang-Hyuk
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.6
    • /
    • pp.790-797
    • /
    • 2005
  • It is well-known that there are many similarities between Japanese and Korean language. For example, the order of words and the nature of the grammatical conjugation of both languages are almost the same. Another similarity is the frequent omission of the subject from a sentence. Moreover, both languages have honorific expressions and the identical concept for expressing nouns in terms of Chinese characters. Using these similarities, we have developed a word-to-word translation system which does away with any deep level analysis of syntactic and semantic structures of the two languages. If we use these similarities, the direct translation method is superior to the internal language translation method or transfer-based translation method. Although the MT system based on the direct translation method is more easily developed than the ones based on other methods, it may have a lot of difficulties when it tries to select the appropriate target word from ambiguous source verbs. In this paper, we propose a new algorithm to extract the meaning of substantives and to make use of the order of the extracted meaning. We could select $86.5\%$ appropriate verbs in the sample sentences from IPAL-verb-dictionary. $13.5\%$ indicates the cases in which we could not distinguish the meaning of substantives. We are convinced, however, that the succeeding rate can be increased by getting rid of the meaning of verbs thatare not used so often.

  • PDF

Speaker Adaptation Using i-Vector Based Clustering

  • Kim, Minsoo;Jang, Gil-Jin;Kim, Ji-Hwan;Lee, Minho
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.7
    • /
    • pp.2785-2799
    • /
    • 2020
  • We propose a novel speaker adaptation method using acoustic model clustering. The similarity of different speakers is defined by the cosine distance between their i-vectors (intermediate vectors), and various efficient clustering algorithms are applied to obtain a number of speaker subsets with different characteristics. The speaker-independent model is then retrained with the training data of the individual speaker subsets grouped by the clustering results, and an unknown speech is recognized by the retrained model of the closest cluster. The proposed method is applied to a large-scale speech recognition system implemented by a hybrid hidden Markov model and deep neural network framework. An experiment was conducted to evaluate the word error rates using Resource Management database. When the proposed speaker adaptation method using i-vector based clustering was applied, the performance, as compared to that of the conventional speaker-independent speech recognition model, was improved relatively by as much as 12.2% for the conventional fully neural network, and by as much as 10.5% for the bidirectional long short-term memory.