• Title/Summary/Keyword: 텍스트 검색

Search Result 684, Processing Time 0.031 seconds

Multi-view learning review: understanding methods and their application (멀티 뷰 기법 리뷰: 이해와 응용)

  • Bae, Kang Il;Lee, Yung Seop;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.1
    • /
    • pp.41-68
    • /
    • 2019
  • Multi-view learning considers data from various viewpoints as well as attempts to integrate various information from data. Multi-view learning has been studied recently and has showed superior performance to a model learned from only a single view. With the introduction of deep learning techniques to a multi-view learning approach, it has showed good results in various fields such as image, text, voice, and video. In this study, we introduce how multi-view learning methods solve various problems faced in human behavior recognition, medical areas, information retrieval and facial expression recognition. In addition, we review data integration principles of multi-view learning methods by classifying traditional multi-view learning methods into data integration, classifiers integration, and representation integration. Finally, we examine how CNN, RNN, RBM, Autoencoder, and GAN, which are commonly used among various deep learning methods, are applied to multi-view learning algorithms. We categorize CNN and RNN-based learning methods as supervised learning, and RBM, Autoencoder, and GAN-based learning methods as unsupervised learning.

Development of Artificial Intelligence-based Legal Counseling Chatbot System

  • Park, Koo-Rack
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.3
    • /
    • pp.29-34
    • /
    • 2021
  • With the advent of the 4th industrial revolution era, IT technology is creating new services that have not existed by converging with various existing industries and fields. In particular, in the field of artificial intelligence, chatbots and the latest technologies have developed dramatically with the development of natural language processing technology, and various business processes are processed through chatbots. This study is a study on a system that provides a close answer to the question the user wants to find by creating a structural form for legal inquiries through Slot Filling-based chatbot technology, and inputting a predetermined type of question. Using the proposal system, it is possible to construct question-and-answer data in a more structured form of legal information, which is unstructured data in text form. In addition, by managing the accumulated Q&A data through a big data storage system such as Apache Hive and recycling the data for learning, the reliability of the response can be expected to continuously improve.

A study on the User Experience at Unmanned Checkout Counter Using Big Data Analysis (빅데이터를 활용한 편의점 간편식에 대한 의미 분석)

  • Kim, Ae-sook;Ryu, Gi-hwan;Jung, Ju-hee;Kim, Hee-young
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.4
    • /
    • pp.375-380
    • /
    • 2022
  • The purpose of this study is to find out consumers' perception and meaning of convenience store convenience food by using big data. For this study, NNAVER and Daum analyzed news, intellectuals, blogs, cafes, intellectuals(tips), and web documents, and used 'convenience store convenience food' as keywords for data search. The data analysis period was selected as 3 years from January 1, 2019 to December 31, 2021. For data collection and analysis, frequency and matrix data were extracted using TEXTOM, and network analysis and visualization analysis were conducted using the NetDraw function of the UCINET 6 program. As a result, convenience store convenience foods were clustered into health, diversity, convenience, and economy according to consumers' selection attributes. It is expected to be the basis for the development of a new convenience menu that pursues convenience and convenience based on consumers' meaning of convenience store convenience foods such as appropriate prices, discount coupons, and events.

Analysis Method of User Review using Open Data (오픈 데이터를 이용한 사용자 리뷰 분석 방법)

  • Choi, Taeho;Hwang, Mansoo;Kim, Neunghoe
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.22 no.6
    • /
    • pp.185-190
    • /
    • 2022
  • Open data has a lot of economic value. Not only Korea, but many other countries are doing their best to make various policies and efforts to expand and utilize open data. However, although Korea has a large amount of data, the data is not utilized effectively. Thus, attempts to utilize those data should be made in various industries. In particular, in the fashion industry, exchange and refund problems are the most common due to unpredictable consumers. Better feedback is necessary for service providers to solve this problem. We want to solve it by showing improved images of dissatisfactions along with user reviews including consumer needs. In this paper, user reviews are analyzed on online shopping mall websites to identify consumer needs, and product attributes are defined by utilizing the attributes of K-fashion data. The users' request is defined as a dissatisfaction attribute, and labeling data with the corresponding attribute is searched. The users' request is provided to the service provider in forms of text data or attributes, as well as an image to help improve the product.

Critical Discourse Analysis on Drug Addiction (마약 중독에 대한 비판적 담론 분석)

  • Shin, Seon-Hee
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.9
    • /
    • pp.712-726
    • /
    • 2022
  • The purpose of this study is to find out what discourse the newspaper's articles produce and distribute about 'drug addiction' and to reveal the topography and meaning of the discourse. Data were collected by searching 'drug' 'drug addiction' as keywords for news articles in four daily newspapers in Korea. As a result of analyzing using Norman Fairclough's critical discourse analysis, first, the 'crime-punishment' discourse was dominant in textual analysis. Drug addiction is a social evil and a serious crime such as sex crimes, child crimes, and violence, so it should be strictly punished. Second, in the discourse practice analysis, drug addiction is a mental disease that needs treatment, so systematic management by the state is required. Third, in the socio-cultural practice analysis, drug addiction is a means of making money for economic benefit, is related to corruption of political power, and is an object that should be strongly controlled to prevent drug crimes from threatening the foundation of the state. Culturally, drug addiction stems from the motivation of pleasure seeking, and is the result of moral degradation. Through this analysis, the conversion to the 'disease-treatment' discourse and drug policies centered on treatment and rehabilitation were suggested as alternatives.

The Development of the Model of Information Structure for Photo Archives in University Archives (대학기록관 사진 아카이브를 위한 정보구조 모형 제안)

  • Hyewon Lee;Seunghee Han
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.23 no.1
    • /
    • pp.101-126
    • /
    • 2023
  • Photographic archives of universities are one of the most valuable types of records that establish the university's identity and provide historical evidence. Unlike text records, however, they are weak in conveying meanings. Therefore, it is difficult to support users' search and utilization unless the information of photo records is comprehensively described. In this study, for the university photo archives, we tried to structure the classification system of photo archives and develop a metadata set that reflects the category characteristics in the classification. To this end, the photo archives classification system and metadata elements of domestic and American university archives were analyzed and based on this, the model of information structure was proposed. The information structure model presented in this study can help university archives improve the data quality of their photo archives and support users with the abundant discovery of photo archives.

A Study on the Characteristics of Re-Organized Shortform Contents (재가공형 숏폼 콘텐츠의 특성 연구)

  • Lee, Jin;Yun, Hyunjung;Yun, Hye-Young
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.5
    • /
    • pp.67-80
    • /
    • 2022
  • The purpose of this study is to clarify the meaning and characteristics of re-organized shortform contents, which is centered on media companies to edit and service existing broadcast content. For this, KBS, MBC, SBS, JTBC, and tvN's entire drama videos and representative entertainment programs opened on the Naver TV platform from 2014 to 2021 were selected for analysis and a synchronic and diachronic approach was conducted at the same time. As a result of the analysis, quantitative and qualitative expansion was made, with the number and form of videos provided by both dramas and entertainment programs diversifying from a synchronic approach. In particular, in the case of special videos, the meaning as independent content was also strengthened, such as sequencing centered on characters, themes, and materials. It was confirmed that thumbnails and titles were also formalized as tags as paratexts that act as curation for searches. From a diachronic point of view, it was found that re-organized shortform contents is considered to be character-oriented contents and independent viewing context through comparison with real-time views and original videos. This study is significant as an attempt to capture the meaning and phase change of shortform contents, which was considered incidental.

A Study on Utilization Method of Information Visualization in the Humanities and Area Studies (인문·지역연구에서의 정보시각화 활용 방안 연구)

  • Kang, Ji-Hoon;Lee, Dong-Yul;Moon, Sang-Ho
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.5 no.5
    • /
    • pp.59-68
    • /
    • 2015
  • Since interdisciplinary convergence could beyond the borders of each disciplines, it is able to create new and meaningful knowledge through collaborative research between different study areas. Especially, in recent years, the Digital Humanities has attracted the attention as the convergence form of the Humanities and ICT. From a research methodology perspective, the Digital Humanities is a tool that can be used as a convergence system for various information utilization such as storage, retrieve, share, and spread. In view of Information system, Digital Humanities has been constructed and used in a variety of systems. Among them, studies related to information visualization for the Digital Humanities have been actively conducted. To visualize data or information, various types such as images, multimedia, interface, and etc could be applied. In this paper, we analyze the cases of various information visualization in digital humanities systems, and propose a method to utilize them in the Humanities and Area Studies.

A Study on Speech Synthesizer Using Distributed System (분산형 시스템을 적용한 음성합성에 관한 연구)

  • Kim, Jin-Woo;Min, So-Yeon;Na, Deok-Su;Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.3
    • /
    • pp.209-215
    • /
    • 2010
  • Recently portable terminal is received attention by wireless networks and mass capacity ROM. In this result, TTS(Text to Speech) system is inserted to portable terminal. Nevertheless high quality synthesis is difficult in portable terminal, users need high quality synthesis. In this paper, we proposed Distributed TTS (DTTS) that was composed of server and terminal. The DTTS on corpus based speech synthesis can be high quality synthesis. Synthesis system in server that generate optimized speech concatenation information after database search and transmit terminal. Synthesis system in terminal make high quality speech synthesis as low computation using transmitted speech concatenation information from server. The proposed method that can be reducing complexity, smaller power consumption and efficient maintenance.

Analyzing Different Contexts for Energy Terms through Text Mining of Online Science News Articles (온라인 과학 기사 텍스트 마이닝을 통해 분석한 에너지 용어 사용의 맥락)

  • Oh, Chi Yeong;Kang, Nam-Hwa
    • Journal of Science Education
    • /
    • v.45 no.3
    • /
    • pp.292-303
    • /
    • 2021
  • This study identifies the terms frequently used together with energy in online science news articles and topics of the news reports to find out how the term energy is used in everyday life and to draw implications for science curriculum and instruction about energy. A total of 2,171 online news articles in science category published by 11 major newspaper companies in Korea for one year from March 1, 2018 were selected by using energy as a search term. As a result of natural language processing, a total of 51,224 sentences consisting of 507,901 words were compiled for analysis. Using the R program, term frequency analysis, semantic network analysis, and structural topic modeling were performed. The results show that the terms with exceptionally high frequencies were technology, research, and development, which reflected the characteristics of news articles that report new findings. On the other hand, terms used more than once per two articles were industry-related terms (industry, product, system, production, market) and terms that were sufficiently expected as energy-related terms such as 'electricity' and 'environment.' Meanwhile, 'sun', 'heat', 'temperature', and 'power generation', which are frequently used in energy-related science classes, also appeared as terms belonging to the highest frequency. From a network analysis, two clusters were found including terms related to industry and technology and terms related to basic science and research. From the analysis of terms paired with energy, it was also found that terms related to the use of energy such as 'energy efficiency,' 'energy saving,' and 'energy consumption' were the most frequently used. Out of 16 topics found, four contexts of energy were drawn including 'high-tech industry,' 'industry,' 'basic science,' and 'environment and health.' The results suggest that the introduction of the concept of energy degradation as a starting point for energy classes can be effective. It also shows the need to introduce high-tech industries or the context of environment and health into energy learning.