• Title/Summary/Keyword: Word Detection

Search Result 220, Processing Time 0.024 seconds

A Method for Spelling Error Correction in Korean Using a Hangul Edit Distance Algorithm (한글 편집거리 알고리즘을 이용한 한국어 철자오류 교정방법)

  • Bak, Seung Hyeon;Lee, Eun Ji;Kim, Pan Koo
    • Smart Media Journal
    • /
    • v.6 no.1
    • /
    • pp.16-21
    • /
    • 2017
  • Long time has passed since computers which used to be a means of research were commercialized and available for the general public. People used writing instruments to write before computer was commercialized. However, today a growing number of them are using computers to write instead. Computerized word processing helps write faster and reduces fatigue of hands than writing instruments, making it better fit to making long texts. However, word processing programs are more likely to cause spelling errors by the mistake of users. Spelling errors distort the shape of words, making it easy for the writer to find and correct directly, but those caused due to users' lack of knowledge or those hard to find may make it almost impossible to produce a document free of spelling errors. However, spelling errors in important documents such as theses or business proposals may lead to falling reliability. Consequently, it is necessary to conduct research on high-level spelling error correction programs for the general public. This study was designed to produce a system to correct sentence-level spelling errors to normal words with Korean alphabet similarity algorithm. On the basis of findings reported in related literatures that corrected words are significantly similar to misspelled words in form, spelling errors were extracted from a corpus. Extracted corrected words were replaced with misspelled ones to correct spelling errors with spelling error detection algorithm.

A Deep Learning-based Depression Trend Analysis of Korean on Social Media (딥러닝 기반 소셜미디어 한글 텍스트 우울 경향 분석)

  • Park, Seojeong;Lee, Soobin;Kim, Woo Jung;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.1
    • /
    • pp.91-117
    • /
    • 2022
  • The number of depressed patients in Korea and around the world is rapidly increasing every year. However, most of the mentally ill patients are not aware that they are suffering from the disease, so adequate treatment is not being performed. If depressive symptoms are neglected, it can lead to suicide, anxiety, and other psychological problems. Therefore, early detection and treatment of depression are very important in improving mental health. To improve this problem, this study presented a deep learning-based depression tendency model using Korean social media text. After collecting data from Naver KonwledgeiN, Naver Blog, Hidoc, and Twitter, DSM-5 major depressive disorder diagnosis criteria were used to classify and annotate classes according to the number of depressive symptoms. Afterwards, TF-IDF analysis and simultaneous word analysis were performed to examine the characteristics of each class of the corpus constructed. In addition, word embedding, dictionary-based sentiment analysis, and LDA topic modeling were performed to generate a depression tendency classification model using various text features. Through this, the embedded text, sentiment score, and topic number for each document were calculated and used as text features. As a result, it was confirmed that the highest accuracy rate of 83.28% was achieved when the depression tendency was classified based on the KorBERT algorithm by combining both the emotional score and the topic of the document with the embedded text. This study establishes a classification model for Korean depression trends with improved performance using various text features, and detects potential depressive patients early among Korean online community users, enabling rapid treatment and prevention, thereby enabling the mental health of Korean society. It is significant in that it can help in promotion.

A Name Recognition Based Call-and-Come Service for Home Robots (가정용 로봇의 호출음 등록 및 인식 시스템)

  • Oh, Yoo-Rhee;Yoon, Jae-Sam;Park, Ji-Hun;Kim, Min-A;Kim, Hong-Kook;Kong, Dong-Geon;Myung, Hyun;Bang, Seok-Won
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.360-365
    • /
    • 2008
  • We propose an efficient robot name registration and recognition method in order to enable a Call-and-Come service for home robots. In the proposed method for the name registration, the search space is first restricted by using monophone-based acoustic models. Second, the registration of robot names is completed by using triphone-based acoustic models in the restricted search space. Next, the parameter for the utterance verification is calculated to reduce the acceptance rate of false calls. In addition, acoustic models are adapted by using a distance speech database to improve the performance of distance speech recognition, Moreover, the location of a user is estimated by using a microphone array. The experimental result on the registration and recognition of robot names shows that the word accuracy of speech recognition is 98.3%.

  • PDF

A Community-Based Influence Measuring Scheme in Delay-Tolerant Networks (지연 감내 네트워크에서 커뮤니티 기반 영향력 측정 기법)

  • Kim, Chan-Myung;Kim, Yong-Hwan;Han, Youn-Hee
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.38B no.1
    • /
    • pp.87-96
    • /
    • 2013
  • Influence propagation is an important research issue in social networks. Influence propagation means that the status or the disposition of nodes get changed by new idea, information and gossip propagated by other nodes. Influenced nodes also make other nodes influenced across the network. The influence propagation problem based on 'word of mouth' referral is to find most influential nodes set in networks to maximize influence. In this paper, we study the influence measuring and finding most influential nodes set in Delay-Tolerant Networks. It is difficult to measure exact influential power in Delay-Tolerant networks where network topology is not stable due to the nodal mobility. In this paper, we propose a distributed scheme that each node constructs $k$-clique community structure and estimates local influential power in Delay-Tolerant Networks. Simulation results show that the influential nodes information estimated by proposed scheme is in agreement with a global view of influential nodes information.

An Information Diffusion Model Considering Non-explicit Relationships in the Blog World (블로그 월드에서 비명시적 관계를 고려한 정보 파급 모델)

  • Kwon, Yong-Suk;Kim, Sang-Wook;Park, Sun-Ju;Lim, Seung-Hwan;Lee, Jae-Bum
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.15 no.5
    • /
    • pp.360-364
    • /
    • 2009
  • Analyzing information diffusion in a blog world is a very useful research issue, which can be used for predicting information diffusion, abnormally detection, marketing, and revitalizing the blog world. Existing studies on information diffusion in blog networks establish explicit relationship between blogs, and analyze only the word-of-mouth effect through such explicit relationships. However, we observed that more than 85% of all information diffusion in a blog world occurs through non-explicit relationships. In this paper, we propose a new model that considers both explicit and non-explicit relationships between blogs in order to explain all information diffusion phenomena in a blog world. We verify the superiority of our proposed models through extensive experiments of information diffusions at a real blog net-work.

The Assessment of Risk of Bias on Clinical Trials of Korean Medicine for Alopecia (탈모증의 한약제제 임상연구에 대한 비뚤림 위험 평가)

  • Ryu, Deok-hyun;Roh, Seok-sun
    • Journal of Haehwa Medicine
    • /
    • v.24 no.1
    • /
    • pp.25-36
    • /
    • 2015
  • Objective : This study aims to evaluate a risk of bias by Risk of Bias tool and RoBANS(Risk of Bias Assessment tool for Non-randomized Study) tool for clinical trial papers proving treatment effect of herbs to alopecia and provides the newest reason of effectiveness of herbs to alopecia. Methos : Data were collected through electronic database including NDSL, KISS, KMBASE, Koreantk, OASIS, KoreaMed, KISTI, Pubmd, Cochrane CENTRAL and CINAHL. Two experts in Oriental Medince assessed risk of bias of randomized controlled trials by Cochrane group's Risk of Bias tool and non-randomized controlled trials by RoBANS tool after searching, reviewing and selecting papers. Results : Total number of selected trials is 20 including 4 randomized controlled trials, 13 non-randomized controlled trials and 3 case reports. This study evaluates the risk of bias of 17 papers including 4 randomized controlled trials and 13 non-randomized controlled trials except 3 case reports by risk of bias tool and RoBANS tool. All papers of randomized controlled trials are evaluated unclear for random sequence generation and allocation concealment as there are no word on them. And all papers of non-randomized controlled trials are evaluated unclear for blinding of outcome assessments and relatively low for others. Conclusion : We must try to specify concretely methods of allocation concealment after planning and practicing it for reducing a selection bias in randomized controlled trials. Also report a reason of missing value and blinding outcome assessments. And we have to agonize and mention methods of blinding of researchers for reducing a detection bias in non-randomized controlled trials.

  • PDF

A High-performance Lane Recognition Algorithm Using Word Descriptors and A Selective Hough Transform Algorithm with Four-channel ROI (다중 ROI에서 영상 화질 표준화 및 선택적 허프 변환 알고리즘을 통한 고성능의 차선 인식 알고리즘)

  • Cho, Jae-Hyun;Jang, Young-Min;Cho, Sang-Bok
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.52 no.2
    • /
    • pp.148-161
    • /
    • 2015
  • The examples that used camera in the vehicle is increasing with the growth of the automotive market, and the importance of the image processing technique is expanding. In particular, the Lane Departure Warning System (LDWS) and related technologies are under development in various fields. In this paper, in order to improve the lane recognition rate more than the conventional method, we extract a Normalized Luminance Descriptor value and a Normalized Contrast Descriptor value, and adjust image gamma values to modulate Normalized Image Quality by using the correlation between the extracted two values. Then, we apply the Hough transform using the optimized accumulator cells to the four-channel ROI. The proposed algorithm was verified in 27 frame/sec and $640{\times}480$ resolution. As a result, Lane recognition rate was higher than the average 97% in day, night, and late-night road environments. The proposed method also shows successful lane recognition in sections with curves or many lane boundary.

Unsupervised Motion Learning for Abnormal Behavior Detection in Visual Surveillance (영상감시시스템에서 움직임의 비교사학습을 통한 비정상행동탐지)

  • Jeong, Ha-Wook;Chang, Hyung-Jin;Choi, Jin-Young
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.48 no.5
    • /
    • pp.45-51
    • /
    • 2011
  • In this paper, we propose an unsupervised learning method for modeling motion trajectory patterns effectively. In our approach, observations of an object on a trajectory are treated as words in a document for latent dirichlet allocation algorithm which is used for clustering words on the topic in natural language process. This allows clustering topics (e.g. go straight, turn left, turn right) effectively in complex scenes, such as crossroads. After this procedure, we learn patterns of word sequences in each cluster using Baum-Welch algorithm used to find the unknown parameters in a hidden markov model. Evaluation of abnormality can be done using forward algorithm by comparing learned sequence and input sequence. Results of experiments show that modeling of semantic region is robust against noise in various scene.

A Study on the Improvement of DTW with Speech Silence Detection (음성의 묵음구간 검출을 통한 DTW의 성능개선에 관한 연구)

  • Kim, Jong-Kuk;Jo, Wang-Rae;Bae, Myung-Jin
    • Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.117-124
    • /
    • 2003
  • Speaker recognition is the technology that confirms the identification of speaker by using the characteristic of speech. Such technique is classified into speaker identification and speaker verification: The first method discriminates the speaker from the preregistered group and recognize the word, the second verifies the speaker who claims the identification. This method that extracts the information of speaker from the speech and confirms the individual identification becomes one of the most efficient technology as the service via telephone network is popularized. Some problems, however, must be solved for the real application as follows; The first thing is concerning that the safe method is necessary to reject the imposter because the recognition is not performed for the only preregistered customer. The second thing is about the fact that the characteristic of speech is changed as time goes by, So this fact causes the severe degradation of recognition rate and the inconvenience of users as the number of times to utter the text increases. The last thing is relating to the fact that the common characteristic among speakers causes the wrong recognition result. The silence parts being included the center of speech cause that identification rate is decreased. In this paper, to make improvement, We proposed identification rate can be improved by removing silence part before processing identification algorithm. The methods detecting speech area are zero crossing rate, energy of signal detect end point and starting point of the speech and process DTW algorithm by using two methods in this paper. As a result, the proposed method is obtained about 3% of improved recognition rate compare with the conventional methods.

  • PDF

Face Recognition on complex backgrounds using Neural Network (복잡한 배경에서 신경망을 이용한 얼굴인식)

  • Han, Jun-Hee;Nam, Kee-Hwan;Park, Ho-Sik;Lee, Young-Sik;Jung, Yeon-Gil;Ra, Sang-Dong;Bae, Cheol-Soo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • v.9 no.1
    • /
    • pp.1149-1152
    • /
    • 2005
  • Detecting faces in images with complex backgrounds is a difficult task. Our approach, which obtains state of the art results, is based on a generative neural network model: the Constrained Generative Model (CGM). To detect side view faces and to decrease the number of false alarms, a conditional mixture of networks is used. To decrease the computational time cost, a fast search algorithm is proposed. The level of performance reached, in terms of detection accuracy and processing time, allows to apply this detector to a real word application: the indexation of face images on the Web.

  • PDF