• Title/Summary/Keyword: vocabulary data

Search Result 285, Processing Time 0.024 seconds

Burmese Sentiment Analysis Based on Transfer Learning

  • Mao, Cunli;Man, Zhibo;Yu, Zhengtao;Wu, Xia;Liang, Haoyuan
    • Journal of Information Processing Systems
    • /
    • v.18 no.4
    • /
    • pp.535-548
    • /
    • 2022
  • Using a rich resource language to classify sentiments in a language with few resources is a popular subject of research in natural language processing. Burmese is a low-resource language. In light of the scarcity of labeled training data for sentiment classification in Burmese, in this study, we propose a method of transfer learning for sentiment analysis of a language that uses the feature transfer technique on sentiments in English. This method generates a cross-language word-embedding representation of Burmese vocabulary to map Burmese text to the semantic space of English text. A model to classify sentiments in English is then pre-trained using a convolutional neural network and an attention mechanism, where the network shares the model for sentiment analysis of English. The parameters of the network layer are used to learn the cross-language features of the sentiments, which are then transferred to the model to classify sentiments in Burmese. Finally, the model was tuned using the labeled Burmese data. The results of the experiments show that the proposed method can significantly improve the classification of sentiments in Burmese compared to a model trained using only a Burmese corpus.

The Effects of Corpus Use on Learning L2 Collocations of Light Verbs and Nouns

  • Yoshiho Satake
    • Asia Pacific Journal of Corpus Research
    • /
    • v.4 no.2
    • /
    • pp.41-55
    • /
    • 2023
  • In data-driven learning (DDL), learners explore a corpus to understand vocabulary and grammar. Although many studies have emphasized the role of DDL in second language (L2) acquisition, L2 light verbs have been largely under-explored. To bridge this gap, this study focused on the learning outcomes of L2 light verbs among 29 intermediate-level Japanese university students. The research zeroed in on six prevalent light verbs in English: "make," "do," "take," "have," "give," and "get." Over nine weeks, the participants engaged with verb-noun collocations using worksheets that juxtaposed Japanese translations of the target collocations with their English equivalents, with the verbs omitted. With the aid of Wordbanks Online, they filled in the blanks and constructed accurate sentences. Before this activity, a 20-minute tutorial was given to the participants on how to interpret the concordance lines. The effectiveness of the DDL method was evaluated using pre-tests, immediate post-tests, and delayed post-tests. The results showed that DDL significantly improved the participants' knowledge of the target collocations of light verbs and nouns; the post-test and delayed post-test scores were significantly higher than the pre-test scores. The results showed that, overall, DDL contributed to memorizing the collocations of light verbs and nouns; however, DDL had different effects on the memorization of collocations across different light verbs. The extent of work on the worksheet is not the only factor in its retention, and observing concordance lines may promote learners' memorization of light-verb collocations.

The Effect of Korean Wave (Hallyu) on the Music Industry

  • Woo-Jun JANG;Min-Ho, CHANG
    • The Journal of Industrial Distribution & Business
    • /
    • v.14 no.11
    • /
    • pp.11-18
    • /
    • 2023
  • Purpose: This study aims to respond to essential queries regarding significant impacts the Korean Wave had on the music business especially in light of K-pop's explosive growth on the international scene and how to transform Hallyu into the global dynamics of the music business. Also, the study investigates what degree of cultural bridging through Music's universal language has Kpop achieved beyond its status as a purely musical genre. Research design, data and methodology: For the process of data collecting, the current investigators used a combination of keywords and controlled vocabulary terms to conduct in-depth searches across reputable academic databases, including PubMed, Scopus, Web of Science, and Google Scholar. Keywords are significant in searching databases such that the desired articles can be sought out wiith the keywords "Korean Wave," "Hallyu," and "music industry,". Results: The investigators found the globalization of K-pop, diverse audience engagement, digital transformation, and cultural exchange through Music as four critical effects of the Korean Wave on the music business. Conclusions: Lastly, this study concludes that As we end our investigation into Hallyu's effects on the music business, it is clear that Korean Music's cultural impact and international appeal have created new opportunities and particular difficulties for both professionals and artists.

Domain Adaptive Fruit Detection Method based on a Vision-Language Model for Harvest Automation (작물 수확 자동화를 위한 시각 언어 모델 기반의 환경적응형 과수 검출 기술)

  • Changwoo Nam;Jimin Song;Yongsik Jin;Sang Jun Lee
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.19 no.2
    • /
    • pp.73-81
    • /
    • 2024
  • Recently, mobile manipulators have been utilized in agriculture industry for weed removal and harvest automation. This paper proposes a domain adaptive fruit detection method for harvest automation, by utilizing OWL-ViT model which is an open-vocabulary object detection model. The vision-language model can detect objects based on text prompt, and therefore, it can be extended to detect objects of undefined categories. In the development of deep learning models for real-world problems, constructing a large-scale labeled dataset is a time-consuming task and heavily relies on human effort. To reduce the labor-intensive workload, we utilized a large-scale public dataset as a source domain data and employed a domain adaptation method. Adversarial learning was conducted between a domain discriminator and feature extractor to reduce the gap between the distribution of feature vectors from the source domain and our target domain data. We collected a target domain dataset in a real-like environment and conducted experiments to demonstrate the effectiveness of the proposed method. In experiments, the domain adaptation method improved the AP50 metric from 38.88% to 78.59% for detecting objects within the range of 2m, and we achieved 81.7% of manipulation success rate.

A Study on 『Korean Translation of ·』 -Focused on declared characteristics and characteristics in different versions- (『국역본 <>·<>』 고찰 -표기적 특징과 이본적 성격을 중심으로-)

  • Kan, Ho-yun
    • Journal of Korean Classical Literature and Education
    • /
    • no.15
    • /
    • pp.355-387
    • /
    • 2008
  • The purpose of the study was to decide Korean translation and the copying period of "Korean Translation of " and to look all around their characteristics in different versions carefully until now. The "Korean Translation" is a collection of Korean-translated romance and love stories excavated by a professor Kim,Il Geun, and there is not a little meaning in the context of novel history in the point of view of 'Korean translation of a court possession'. Arranging conclusion of the study generally, it is as follows. (1) Considering phonological phenomena, grammar and vocabulary in the study of Korean language, it is presumed that they would be translated into Korean and copied between the regime period of the King Sukjong and the regime period of the King Yungjo in the Joseon Dynasty. For, they were composed of a middle declaration of copied 'Myeoknambon "Korean Translation of Taepyeonggwanggi(태평광기)"' and 'NakseonJaebon(낙선재본)' between the middle of the 17th century and the middle of the 18th century and the regime period of the King Jeongjo in the Joseon Dynasty appointed as the background period of the novels should be excepted. Consequently, through the Korean Translation, we can confirm that the novel scope between the 17th century and the 18th century in Korean novel history was widened until 'The Royal Court' and 'Women'. (2) In the side of vocabulary, the "Korean Translation" also has not a little meaning in the side of a collection translated in the Royal Court. It doesn't have new vocabularies, but partial vocabularies as '(Traces:痕)' '(Clean eyes:明眸)', ' (Sail:帆)', '(Get up:起)', '글이플(Weak grass:弱草)', '쇼록(Owl:? 梟 or 鴉?)', '이 사라심(This life:此生)', and '노혀오매(Look for:訪)' are good data in the study of Korean language. (3) The "Korean Translation" is a valuable data about translation and copying of a court novel and we can discover intentionally changed parts and partially omitted sentences rather in the than in the . There are differences between a translation book and a copying book and we can catch sight of intention of translation and unsettledness of copying in the second work. Therefore, we can know that the "Korean Translation" has a double context which one work is translated and a work in different version is derived, compared to a simple copy. (4) The "Korean Translation" has a close relation with "Hangoldong(閒汨董)", but it doesn't regard the same copy as a foundation. The basic copy of translation of the "Korean Translation" is a different version of the same line as "Hangoldong" and "Jeochobon(저초본:정명기 소장본)" and is more similar line to "Hangoldong", but it is also not the same basic copy. (5) Considering that the "Korean Translation" doesn't has a distinct relation with the "Hangoldong", there is no correlation between the "Korean Translation" and and the "Hangoldong" and . In addition, we could not discover a writer's identity between the two.

A Study on the New Trends of EDI based Internet (인터넷을 기반으로 하는 EDI 신조류)

  • 조원길
    • The Journal of Information Technology
    • /
    • v.4 no.1
    • /
    • pp.125-139
    • /
    • 2001
  • EDI(Electronic Data Interchange) works by providing a collection of standard message formats and element dictionary in a simple way for businesses to exchange data via any electronic messaging service. Open-edi is electronic data interchange among autonomous parties using public standards and aiming towards interoperability over time, business sectors, information technology and data types. The number of Internet services using XML/EDI has grown rapidly since it is easily expansible and exchangeable. To use this service, the client does not have to install EDI S/W but only needs internet browser. Consequently, it became much easier and faster to handle the trading process in an office. eBusiness SML (extensible markup language) electronic data interchange. eXedi is the service that realizes B2B of XML/EDI. eXedi can be used easily in small and medium sized companies. Companies in any place can access to eXedi using the existing Internet connection. XML/EDI provides a standard framework to exchange different types of data -- for example, an invoice, healthcare claim, project status -- so that the information be it in a transaction, exchanged via an Application Program Interface (API), web automation, database portal, catalog, a workflow document or message can be searched, decoded, manipulated, and displayed consistently and correctly by first implementing EDI dictionaries and extending our vocabulary via on-line repositories to include our business language, rules and objects.

  • PDF

A Study on the Design of LADM-based Cadastral Data Model for Mongolia (LADM 기반의 몽골 지적 데이터 모델 설계에 관한 연구)

  • Munkhbaatar, Buuveibaatar;Kim, Moon-Gie;Lee, Young-ho;Koh, June-Hwan
    • Journal of Cadastre & Land InformatiX
    • /
    • v.48 no.2
    • /
    • pp.51-64
    • /
    • 2018
  • The paper reviews the adoption of ISO 19152, Land Administration Domain Model (LADM) for the enhancement of the current Mongolian cadastral system. It can be said that the current cadastral system is developed for the pure purpose of land registration. There is a need for a comprehensive data model for not only this reason but also reflecting the current problems in the Mongolian cadastral system. The LADM was published by the International Organization for Standardization later in 2012 as an International Standard for modeling cadastral and land administration information for the purpose of providing a common vocabulary(ontology) and efficient system development. This study examined possibilities of adopting the LADM to the cadastral system for Mongolia focused on Land Manager system. Data model of the Land Manager was examined against the corresponding LADM classes and as a result, gaps between each data model have been drawn. Lastly we proposed the LADM-based new data model for Mongolian cadastral system ensuring that the current problems be reflected.

A study of Big-data analysis for relationship between students (학생들의 관계성 파악을 위한 빅-데이터 분석에 관한 연구)

  • Hwang, Deuk-Young;Kim, Jin-Mook
    • Convergence Security Journal
    • /
    • v.15 no.4
    • /
    • pp.113-119
    • /
    • 2015
  • Recent, cyber violence is increasing in a school and the severity of the problems encountered day by day. In particular, the severity of the cyber force using the smart phone is recognized as a very high and great problems socially. Cyberbullying have long damage degree and a wide range time duration against of existed physical cyber violence. Then student's affects is very seriously. Therefore, we analyzes the relationship and languages in the classroom for students to use to identify signs of cyber violence that may occur between friends in the class. And we support this information to identified parent, classroom teachers and school sheriff for prevent cyberbullying accidents in the school. For this research, we will design and implement a messenger in the cyber classroom. It have many components that are Big-data vocabulary, analyzer, and communication interface. Our proposed messenger can analyze lingual sign and friendship between students using Big-data analysis method such as text mining. It can analysis relationship by per-student, per-classroom.

A Study on The Usability Evaluation Based on Text Analysis for The Development of Comfort-Shoes for Middle-Aged

  • KIM, Ji Ho;YOON, Sang Hoon;KWON, Ki Hyun;SEO, Jeong Kwon;HAN, Seung Jin
    • Journal of Sport and Applied Science
    • /
    • v.3 no.2
    • /
    • pp.17-27
    • /
    • 2019
  • Purpose: This study is to conduct usability evaluations from the perspective of developing comfort-shoes for the middle-aged and elderly to identify key factors and derive implications for optimal comfort-shoes production. Research design, data, and methodology: A total of 10 middle-aged and elderly women in their 50s and 60s were selected as eligible for the rescue. For data collection, the study was conducted in a Gang Survey, where pre-explanations, shoes test, and interviews were conducted. The collected data were analyzed in a total of four stages. In step 1, the contents obtained through interviews with the subjects were recorded in text, organized and analyzed systematically, and in step 2, unnecessary vocabulary, sentences, and overlapping opinions were eliminated. In step 3, we classified areas around key functions and carried out categorization tasks. Finally, in Step 4, the results and implications of the study were derived by classifying each usability evaluation shoe as positive and negative text around categorized data. Results: There are a total of seven factors for comfort-shoes usability evaluation, which are categorized as cushion, fitting, stability, flexibility, lightweight, comfort, and pressure. Positive/negative factors for the derived usability evaluation factors were shown in the form of a positive-centered, negative-centered, and positive-mixed mix for each of the four products. Positive-focused products are VA products, which are seven times more positive than negative factors. Negative-centered products are CL and SA products, which are five times more negative than positive factors. Positive mixing was a CA product with a ratio of 1:1. Text-based usability evaluations allow us to proceed with analysis based on more scientific data rather than simply listening to opinions and judging by comments. Conclusions: The study discussed implications of developing comfort-shoes for middle-aged consumers and future directions were discussed.

Summarization of Korean Dialogues through Dialogue Restructuring (대화문 재구조화를 통한 한국어 대화문 요약)

  • Eun Hee Kim;Myung Jin Lim;Ju Hyun Shin
    • Smart Media Journal
    • /
    • v.12 no.11
    • /
    • pp.77-85
    • /
    • 2023
  • After COVID-19, communication through online platforms has increased, leading to an accumulation of massive amounts of conversational text data. With the growing importance of summarizing this text data to extract meaningful information, there has been active research on deep learning-based abstractive summarization. However, conversational data, compared to structured texts like news articles, often contains missing or transformed information, necessitating consideration from multiple perspectives due to its unique characteristics. In particular, vocabulary omissions and unrelated expressions in the conversation can hinder effective summarization. Therefore, in this study, we restructured by considering the characteristics of Korean conversational data, fine-tuning a pre-trained text summarization model based on KoBART, and improved conversation data summary perfomance through a refining operation to remove redundant elements from the summary. By restructuring the sentences based on the order of utterances and extracting a central speaker, we combined methods to restructure the conversation around them. As a result, there was about a 4 point improvement in the Rouge-1 score. This study has demonstrated the significance of our conversation restructuring approach, which considers the characteristics of dialogue, in enhancing Korean conversation summarization performance.