• Title/Summary/Keyword: 텍스트 연구

Search Result 3,494, Processing Time 0.029 seconds

Development and Validation of the Letter-unit based Korean Sentimental Analysis Model Using Convolution Neural Network (회선 신경망을 활용한 자모 단위 한국형 감성 분석 모델 개발 및 검증)

  • Sung, Wonkyung;An, Jaeyoung;Lee, Choong C.
    • The Journal of Society for e-Business Studies
    • /
    • v.25 no.1
    • /
    • pp.13-33
    • /
    • 2020
  • This study proposes a Korean sentimental analysis algorithm that utilizes a letter-unit embedding and convolutional neural networks. Sentimental analysis is a natural language processing technique for subjective data analysis, such as a person's attitude, opinion, and propensity, as shown in the text. Recently, Korean sentimental analysis research has been steadily increased. However, it has failed to use a general-purpose sentimental dictionary and has built-up and used its own sentimental dictionary in each field. The problem with this phenomenon is that it does not conform to the characteristics of Korean. In this study, we have developed a model for analyzing emotions by producing syllable vectors based on the onset, peak, and coda, excluding morphology analysis during the emotional analysis procedure. As a result, we were able to minimize the problem of word learning and the problem of unregistered words, and the accuracy of the model was 88%. The model is less influenced by the unstructured nature of the input data and allows for polarized classification according to the context of the text. We hope that through this developed model will be easier for non-experts who wish to perform Korean sentimental analysis.

Discovery of Genre Information on the Web (웹 상에서의 특정 장르 문서 발견)

  • Joo, Won-Kyun;Myaeng, Sung-Hyon
    • Annual Conference on Human and Language Technology
    • /
    • 1999.10e
    • /
    • pp.28-35
    • /
    • 1999
  • 정보공유를 목적으로 제안된 웹의 활성화와 함께 유용한 정보들이 웹상에 기하급수적으로 등장함에 따라 정보공간의 확장으로 인한 검색 신뢰도의 저하 문제에 직면하게 되었다. 본 연구에서는 대용량 웹 환경하에서 사용자의 정보발견을 돕기 위해 텍스트이외의 새로운 요소들을 사용하여 특정장르문서를 발견하는 개념을 도입하였다. 먼저 사용자가 발견하고자 하는 장르의 모습을 텍스트, URL정보, 링크 정보. 문서구조 정보 등의 장르 식별요소 값을 이용해 표현한 후, 후보 문서들의 장르관련도를 측정함으로써 특정장르 문서를 검색한다. 각 장르식별요소값은 나름대로의 방법에 의해 계산되는데 $0{\sim}1$사이의 값을 가지며, 종합적인 장르관련도는 각 장르식별요소값의 증거통합 방법에 의해 구한다. 본 논문에서는 각 장르식별요소들의 역할과 장르식별요소가 장르발견에 미치는 영향을 알아보며, 최종적으로 특정 장르 문서발견에 있어서의 검색 신뢰도 향상을 보이기 위해 실험모델을 설계/구현하였다. 본 실험은 웹 문서를 대상으로 하는데, 아직까지 URL, 링크 정보를 모두 갖춘 테스트컬렉션이 없기 때문에 실험을 위해 일반적인 웹 문서로 직접 구성한 컬렉션을 사용하였다. 발견하고자 하는 장르는 "컴퓨터 분야의 컨퍼런스 홈페이지"로 정하였으며 30개의 컴퓨터 분야를 선정하였다. 비교대상으로는 일반 웹 검색 엔진인 알타비스타와 메타검색 엔진인 메타크롤러를 선택하였고. 각 질의에 대해 상위 30개의 결과를 대상으로 정확도를 평가하였다. 결과로서 각 장르식별요소들은 모두 검색 신뢰도의 향상에 기여를 하며, 제안하는 방법은 알타비스타와 메타크롤러에 비해 각각 평균적으로 67.34%, 71.78%의 검색 신뢰도 향상을 보임을 입증하였다.적응에 문제점을 가지기도 하였다. 본 연구에서는 그 동안 계속되어 온 한글과 한잔의 사용에 관한 논쟁을 언어심리학적인 연구 방법을 통해 조사하였다. 즉, 글을 읽는 속도, 글의 의미를 얼마나 정확하게 이해했는지, 어느 것이 더 기억에 오래 남는지를 측정하여 어느 쪽의 입장이 옮은 지를 판단하는 것이다. 실험 결과는 문장을 읽는 시간에서는 한글 전용문인 경우에 월등히 빨랐다. 그러나. 내용에 대한 기억 검사에서는 국한 혼용 조건에서 더 우수하였다. 반면에, 이해력 검사에서는 천장 효과(Ceiling effect)로 두 조건간에 차이가 없었다. 따라서, 본 실험 결과에 따르면, 글의 읽기 속도가 중요한 문서에서는 한글 전용이 좋은 반면에 글의 내용 기억이 강조되는 경우에는 한자를 혼용하는 것이 더 효율적이다.이 높은 활성을 보였다. 7. 이상을 종합하여 볼 때 고구마 끝순에는 페놀화합물이 다량 함유되어 있어 높은 항산화 활성을 가지며, 아질산염소거능 및 ACE저해활성과 같은 생리적 효과도 높아 기능성 채소로 이용하기에 충분한 가치가 있다고 판단된다.등의 관련 질환의 예방, 치료용 의약품 개발과 기능성 식품에 효과적으로 이용될 수 있음을 시사한다.tall fescue 23%, Kentucky bluegrass 6%, perennial ryegrass 8%) 및 white clover 23%를 유지하였다. 이상의 결과를 종합할 때, 초종과 파종비율에 따른 혼파초지의 건물수량과 사료가치의 차이를 확인할 수 있었으며, 레드 클로버 + 혼파 초지가 건물수량과 사료가치를 높이는데 효과적이었다.\ell}$ 이었으며 , yeast extract 첨가(添加)하여 배양시(培養時)는 yeast extract 농도(濃度)가 증가(增加)함

  • PDF

Developmental disability Diagnosis Assessment Systems Implementation using Multimedia Authorizing Tool (멀티미디어 저작도구를 이용한 발달장애 진단.평가 시스템 구현연구)

  • Byun, Sang-Hea;Lee, Jae-Hyun
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.3 no.1
    • /
    • pp.57-72
    • /
    • 2008
  • Serve and do so that graft together specialists' view application field of computer and developmental disability diagnosis estimation data to construct developmental disability diagnosis estimation system in this Paper and constructed developmental disability diagnosis estimation system. Developmental disability diagnosis estimation must supply information of specification area that specialists are having continuously. Developmental disability diagnosis estimation specialist system need multimedia data processing that is specialized little more for developmental disability classification diagnosis and decision-making and is atomized for this. Characteristic of developmental disability diagnosis estimation system that study in this paper can supply quick feedback about result, and can reduce mistake on recording and calculation as well as can shorten examination's enforcement time, and background of training is efficient system fairly in terms of nonprofessional who is not many can use easily. But, as well as when multimedia information that is essential data of system construction for developmental disability diagnosis estimation is having various kinds attribute and a person must achieve description about all developmental disability diagnosis estimation informations, great amount of work done is accompanied, technology about equal data can become different according to management. Because of these problems, applied search technology of contents base (Content-based) that search connection information by contents of edit target data for developmental disability diagnosis estimation data processing multimedia data processing technical development. In the meantime, typical access way for conversation style data processing to support fast image search, after draw special quality of data by N-dimension vector, store to database regarding this as value of N dimension and used data structure of Tree techniques to use index structure that search relevant data based on this costs. But, these are not coincided correctly in purpose of developmental disability diagnosis estimation because is developed focusing in application field that use data of low dimension such as original space DataBase or geography information system. Therefore, studied save structure and index mechanism of new way that support fast search to search bulky good physician data.

  • PDF

Design and Implementation of Visual Environment for Parallel Object-Oriented Programming (병렬 객체지향 프로그래밍을 위한 시각 환경의 설계 및 구현)

  • Choe, Suk-Yeong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.2
    • /
    • pp.485-496
    • /
    • 1999
  • Comparing with sequential programming, parallel programming has additional complexity due to the consideration of parallelism, communication and synchronization of processes. A synergism between users and compliers should be established, each assisting the other to produce high quality parallel programs. On the above underlying philosophy, we developed a parallel Object-Oriented specification language, POOSL, as preliminary works. However, it is still likely to hard for users to write parallel program because users have to consider grammar of POOSL and to write text-based parallel program. It would be more desirable to provide users wit visual environment for effective parallel programming. Therefore, we propose a visual programming environment. VEPO(Visual environment for Parallel Object-Oriented Programming), based on POOSL in order that users can develop parallel programs more easily and conveniently. It aims at supporting a programming environment in which users can represent their programs more naturally and visually I parallel manner with object-oriented concept and essential steps during parallel program development such as program specification, compilation, execution and animation of execution are integrated. VEPO has useful features for parallel processing. Especially, complicated parallel codes for synchronization and communication of processes are automatically generated in the translation phase, so users can be relieved of writing error-prone parallel codes. The system is targeted to the transputer-based parallel system, MC-3. The graphic user interface of VEPO was implemented using Visual C++. Visual programs descirbed on VEPO are translated into Inmos C and executed on MC-3.

  • PDF

Health Consciousness and Health Information Orientation on Health Information Searching Behaviors of Middle-Aged Adults (중년층의 건강관심도와 건강정보추구도가 인터넷 건강정보 검색행동에 미치는 영향)

  • Lee, Hawyoung;Oh, Sanghee
    • Journal of the Korean Society for information Management
    • /
    • v.38 no.3
    • /
    • pp.73-99
    • /
    • 2021
  • The purpose of this study is to analyze the health information use experience of middle-aged people in their 40s and 50s and to observe and analyze their health information search behaviors according to health consciousness and health information orientation. This study uses Information Foraging Theory with the concept of information scents which leads users to detect and collect cues in information searching. Types and contents of information cues that middle-aged people use when searching for health information were investigated. Also, how their health consciousness and health information orientation affected using information cues were analyzed. Three methods of research were used; (1) pre-interviews, (2) search experiments, and (3) post-interviews. Thirty-two middle-aged people participated in the study. Their performance on health information searching was recorded and referred to in the post-interviews using a think-aloud protocol. Findings presented that middle-aged people's health consciousness and health information orientation affected the perception of information scents in health information search; those with high health consciousness and health information orientation consider the text made by the government office the most critical information cues. We believe findings from this study could be used for public libraries or non-profit institutions to understand middle-aged people's health information behaviors to design education programs for information retrieval considering users' health consciousness and health information orientation. Findings could also contribute to Internet portal site or health-related web site designers developing strategies for middle-aged users to access health information effectively.

Comparative Analysis of Low Fertility Policy and the Public Perceptions using Text-Mining Methodology (텍스트 마이닝을 활용한 저출산 정책과 대중인식 비교)

  • Bae, Giryeon;Moon, HyunJeong;Lee, Jaeil;Park, Mina;Park, Arum
    • Journal of Digital Convergence
    • /
    • v.19 no.12
    • /
    • pp.29-42
    • /
    • 2021
  • As the low fertility intensifies in Korea, this study investigated fundamental differences between the government's low fertility policy and public perception of it. To this end, we selected four times 'Aging Society and Population Policy' documents and news comments for two weeks immediately after announcement of the third and fourth Policy as analysis targets. Then we conducted word frequency analysis, co-occurrence analysis and CONCOR analysis. As a result of analyses, first, direct childcare support during the first and second periods, and a social structural approach during third and fourth periods were noticeable. Second, it was revealed that both policies and comments aim for the work-family compatibility in 'parenting'. Lastly it was showed public interest in environment of raising children and the critical mind to effectiveness of the policy. This study is meaningful in that it confirmed the public perception using big data analysis, and it will help improve the direction for the future low fertility policy.

Transcultural Practice of the History of Modern Korean Literature Written in China (중국에서 저술된 한국근현대문학사의 문화횡단적 실천 - 남한문학사·북한문학사·자국문학사라는 세 겹의 프리즘 -)

  • Lee, Sun-yi
    • Cross-Cultural Studies
    • /
    • v.48
    • /
    • pp.107-133
    • /
    • 2017
  • This study compares the history of modern Korean literature written in China with the history of South Korean literature, the history of North Korean literature and the history of national literature, explores aspects of narrative and therefore examines transcultural practice presented in such texts. There have hitherto been approximately 25 works on the history of Korean literature written in China, and 16 of 25 works are on the history of modern Korean literature. Regarding their purpose, the number of pedagogical works outstandingly exceeds the number of research works. In terms of perspective and contents, it can be divided into three categories; one that only embraces the history of South Korean literature, another embracing the history of North Korean literature only and the other embracing the history of South Korean and North Korean literature. This study has selected representative texts from each category and compared recognition and narrative aspects to that of the history of South Korean literature, the history of North Korean literature and the history of Chinese literature. It further examines loci of definitions' transfer and formation as well. As a result, this study reveals valuable understanding of recognition and narration of the history of Korean literature. First, this study offers an introspective attitude, as the history of modern Korean literature accentuates influence of only Western literature, overlooking influence of Chinese literature. Second, this study proposes a new narrative perspective on the history of Unified Korean literature through independent and objective identification of the history of North Korean literature. Last, it emphasizes popularization of literature - aside from pure literary-centrism - and expands possibilities of embracing distinct works relevant to multimedia.

Formulating Strategies from Consumer Opinion Analysis on AI Kids Phone using Text Mining (AI 키즈폰의 소비자리뷰 분석을 통한 제품개선 전략에 대한 연구)

  • Kim, Dohun;Cha, Kyungjin
    • The Journal of Society for e-Business Studies
    • /
    • v.24 no.2
    • /
    • pp.71-89
    • /
    • 2019
  • In order to come up with satisfying product and improvement, firms use traditional marketing research methods to obtain consumers' opinions and further try to reflect them. Recently, gathering data from consumer communication platforms like internet and SNS has become popular methods. Meanwhile, with the development of information technology, mobile companies are launching new digital products for children to protect them from harmful content and provide them with necessary functions and information. Among these digital products, Kids Phone, which is a wearable device with safe functions that enable parents to learn childern's location. Kids phone is relatively cheaper and simpler than smartphone but it is noted that there are several problems such as some useless functions and frequent breakdowns. This study analyzes the reviews of Kids phones from domestic mobile companies, identifies the characteristics, strengths and weaknesses of the products, proposes improvement methods strategies for devices and services through SNS consumer analysis. In order to do that customer review data from online shopping malls was gathered and was further analyzed through text mining methods such as TF/IDF, Sentiment Analysis, and network analysis. Customer review data was gathered through crawling Online shopping Mall and Naver Blog/$Caf\acute{e}$. Data analysis and visualization was done using 'R', 'Textom', and 'Python'. Such analysis allowed us to figure out main issues and recent trends regarding kids phones and to suggest possible service improvement strategies based on sentiment analysis.

A Curriculum Study to Strengthen AI and Data Science Job Competency (AI·데이터 사이언스 분야 직무 역량 강화를 위한 커리큘럼 연구)

  • Kim, Hyo-Jung;Kim, Hee-Woong
    • Informatization Policy
    • /
    • v.28 no.2
    • /
    • pp.34-56
    • /
    • 2021
  • According to the Fourth Industrial Revolution, demand for and interest in jobs in the field of AI and data science - such as artificial intelligence/data analysts - are increasing. In order to keep pace with this trend, and to supply human resources that can effectively perform such jobs in the relevant fields in a timely manner, job seekers must develop the competencies required by the companies, and universities must be in charge of training. However, it is difficult to devise appropriate response strategies at the level of job seekers, companies and universities, which are stakeholders in terms of supplying suitably competent personnel. Therefore, the purpose of this study is to determine which competencies are required in practice in order to cultivate and supply human talents equipped with the necessary job competencies, and to propose plans for the development of the required competencies at the university level. In order to identify the required competencies in the field of AI and data science, data on job postings on the LinkedIn site, the recruitment platform, were analyzed using text mining techniques. Then, research was conducted with the aim of devising and proposing concrete plans for competency development at the university level by comparing and verifying the results of the international graduate school curriculum in the field of AI and data science, and the interview results with the hiring managers, respectively, with the results of the topic model.

Analysis of Major COVID-19 Issues Using Unstructured Big Data (비정형 빅데이터를 이용한 COVID-19 주요 이슈 분석)

  • Kim, Jinsol;Shin, Donghoon;Kim, Heewoong
    • Knowledge Management Research
    • /
    • v.22 no.2
    • /
    • pp.145-165
    • /
    • 2021
  • As of late December 2019, the spread of COVID-19 pandemic began which put the entire world in panic. In order to overcome the crisis and minimize any subsequent damage, the government as well as its affiliated institutions must maximize effects of pre-existing policy support and introduce a holistic response plan that can reflect this changing situation- which is why it is crucial to analyze social topics and people's interests. This study investigates people's major thoughts, attitudes and topics surrounding COVID-19 pandemic through the use of social media and big data. In order to collect public opinion, this study segmented time period according to government countermeasures. All data were collected through NAVER blog from 31 December 2019 to 12 December 2020. This research applied TF-IDF keyword extraction and LDA topic modeling as text-mining techniques. As a result, eight major issues related to COVID-19 have been derived, and based on these keywords, this research presented policy strategies. The significance of this study is that it provides a baseline data for Korean government authorities in providing appropriate countermeasures that can satisfy needs of people in the midst of COVID-19 pandemic.