Search | Korea Science

The Binarization of Text Regions in Natural Scene Images, based on Stroke Width Estimation (자연 영상에서 획 너비 추정 기반 텍스트 영역 이진화)

Zhang, Chengdong;Kim, Jung Hwan;Lee, Guee Sang
- Smart Media Journal
- /
- v.1 no.4
- /
- pp.27-34
- /
- 2012
In this paper, a novel text binarization is presented that can deal with some complex conditions, such as shadows, non-uniform illumination due to highlight or object projection, and messy backgrounds. To locate the target text region, a focus line is assumed to pass through a text region. Next, connected component analysis and stroke width estimation based on location information of the focus line is used to locate the bounding box of the text region, and each box of connected components. A series of classifications are applied to identify whether each CC(Connected component) is text or non-text. Also, a modified K-means clustering method based on an HCL color space is applied to reduce the color dimension. A text binarization procedure based on location of text component and seed color pixel is then used to generate the final result.
PDF

A Study on architectural historic of Hotel DIABUTSU (대불호텔의 건축사적 고찰)

Sohn, Jang-Won;Cho, Hee-Ra
- Journal of The Korean Digital Architecture Interior Association
- /
- v.11 no.3
- /
- pp.27-34
- /
- 2011
The DIABUTSU hotel was built first in Korea and we know that the hotel was built in 1888. However, it has many questions. This study was conducted to uncover the truth. Non-text media in the study is useful to take advantage of the media. However, it is not used in Korea. I prefer that study by Non-text Media. The findings, DIABUTSU hotel was built in 1884. It was Japanese-style two-story wooden building. HORI was hospitality there and many foreigners stayed. Underwood, Appenzeller and Carles were this hotel and they recorded about the hotel in 1885. We know that three story building was the first hotel. But this is wrong in fact. The first hotel is Japanese-style wooden building built in 1884.
PDF KSCI

Multi-Emotion Recognition Model with Text and Speech Ensemble (텍스트와 음성의 앙상블을 통한 다중 감정인식 모델)

Yi, Moung Ho;Lim, Myoung Jin;Shin, Ju Hyun
- Smart Media Journal
- /
- v.11 no.8
- /
- pp.65-72
- /
- 2022
Due to COVID-19, the importance of non-face-to-face counseling is increasing as the face-to-face counseling method has progressed to non-face-to-face counseling. The advantage of non-face-to-face counseling is that it can be consulted online anytime, anywhere and is safe from COVID-19. However, it is difficult to understand the client's mind because it is difficult to communicate with non-verbal expressions. Therefore, it is important to recognize emotions by accurately analyzing text and voice in order to understand the client's mind well during non-face-to-face counseling. Therefore, in this paper, text data is vectorized using FastText after separating consonants, and voice data is vectorized by extracting features using Log Mel Spectrogram and MFCC respectively. We propose a multi-emotion recognition model that recognizes five emotions using vectorized data using an LSTM model. Multi-emotion recognition is calculated using RMSE. As a result of the experiment, the RMSE of the proposed model was 0.2174, which was the lowest error compared to the model using text and voice data, respectively.
PDF KSCI

Text Detection based on Edge Enhanced Contrast Extremal Region and Tensor Voting in Natural Scene Images

Pham, Van Khien;Kim, Soo-Hyung;Yang, Hyung-Jeong;Lee, Guee-Sang
- Smart Media Journal
- /
- v.6 no.4
- /
- pp.32-40
- /
- 2017
In this paper, a robust text detection method based on edge enhanced contrasting extremal region (CER) is proposed using stroke width transform (SWT) and tensor voting. First, the edge enhanced CER extracts a number of covariant regions, which is a stable connected component from input images. Next, SWT is created by the distance map, which is used to eliminate non-text regions. Then, these candidate text regions are verified based on tensor voting, which uses the input center point in the previous step to compute curve salience values. Finally, the connected component grouping is applied to a cluster closed to characters. The proposed method is evaluated with the ICDAR2003 and ICDAR2013 text detection competition datasets and the experiment results show high accuracy compared to previous methods.
PDF KSCI

SMS Text Messages Filtering using Word Embedding and Deep Learning Techniques (워드 임베딩과 딥러닝 기법을 이용한 SMS 문자 메시지 필터링)

Lee, Hyun Young;Kang, Seung Shik
- Smart Media Journal
- /
- v.7 no.4
- /
- pp.24-29
- /
- 2018
Text analysis technique for natural language processing in deep learning represents words in vector form through word embedding. In this paper, we propose a method of constructing a document vector and classifying it into spam and normal text message, using word embedding and deep learning method. Automatic spacing applied in the preprocessing process ensures that words with similar context are adjacently represented in vector space. Additionally, the intentional word formation errors with non-alphabetic or extraordinary characters are designed to avoid being blocked by spam message filter. Two embedding algorithms, CBOW and skip grams, are used to produce the sentence vector and the performance and the accuracy of deep learning based spam filter model are measured by comparing to those of SVM Light.
https://doi.org/10.30693/SMJ.2018.7.4.24 인용 PDF KSCI

FMM: Fusion media middleware for actual feeling service (실감 서비스 제공을 위한 융합 미디어 미들웨어)

Lee, Ji-Hye;Yoon, Yong-Ik
- Journal of Korea Multimedia Society
- /
- v.13 no.2
- /
- pp.308-315
- /
- 2010
User Generated contents(UGC) interchange with internet users actively in Web2.0 environment. According to growth of content sharing site, the number of non-expert's contents increased. But non-expert's contents have a simple media just recorded. For providing actual feeling like effects and actions to non-expert's contents, we suggest Fusion Media Middleware(FMM). The FMM can increase user satisfaction by providing actual feeling. Furthermore, The content changes advanced media that has emotional impression. The FMM for providing actual feeling classify the inputted media as a scene based on MPEG-7. The FMM provide an actual feeling to simple media by inserting effects like a sound, image and text among the classified media. Using the BSD code of MPEG-21, the FMM can link up with inputted media and effects. Through the mapping BSD code the FMM control synchronization between media and effects. In this paper, Using the Fusion Media Middleware, the non-expert's contents express value as multimedia that has an actual feeling. Futhermore, the FMM creates flow of new media circulation.
PDF KSCI

New Text Steganography Technique Based on Part-of-Speech Tagging and Format-Preserving Encryption

Mohammed Abdul Majeed;Rossilawati Sulaiman;Zarina Shukur
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.18 no.1
- /
- pp.170-191
- /
- 2024
The transmission of confidential data using cover media is called steganography. The three requirements of any effective steganography system are high embedding capacity, security, and imperceptibility. The text file's structure, which makes syntax and grammar more visually obvious than in other media, contributes to its poor imperceptibility. Text steganography is regarded as the most challenging carrier to hide secret data because of its insufficient redundant data compared to other digital objects. Unicode characters, especially non-printing or invisible, are employed for hiding data by mapping a specific amount of secret data bits in each character and inserting the character into cover text spaces. These characters are known with limited spaces to embed secret data. Current studies that used Unicode characters in text steganography focused on increasing the data hiding capacity with insufficient redundant data in a text file. A sequential embedding pattern is often selected and included in all available positions in the cover text. This embedding pattern negatively affects the text steganography system's imperceptibility and security. Thus, this study attempts to solve these limitations using the Part-of-speech (POS) tagging technique combined with the randomization concept in data hiding. Combining these two techniques allows inserting the Unicode characters in randomized patterns with specific positions in the cover text to increase data hiding capacity with minimum effects on imperceptibility and security. Format-preserving encryption (FPE) is also used to encrypt a secret message without changing its size before the embedding processes. By comparing the proposed technique to already existing ones, the results demonstrate that it fulfils the cover file's capacity, imperceptibility, and security requirements.
https://doi.org/10.3837/tiis.2024.01.010 인용 PDF HTML

Sell-sumer: The New Typology of Influencers and Sales Strategy in Social Media (셀슈머(Sell-sumer)로 진화한 인플루언서의 새로운 유형과 소셜미디어에서의 세일즈 전략)

Shin, Hajin;Kim, Sulim;Hong, Manny;Hwang, Bom Nym;Yang, Hee-Dong
- Knowledge Management Research
- /
- v.22 no.4
- /
- pp.217-235
- /
- 2021
As 49% of the world's population uses social media platforms, communication and content sharing within social media are becoming more active than ever. In this environmental base, the one-person media market grew rapidly and formed public opinion, creating a new trend called sell-sumer. This study defined new types of influencers by product category by analyzing the subject concentration of the commercial/non-commercial keywords of influencers and the impact of the ratio of commercial postings on sales. It is hoped that influencers working within social media will be helpful to new sales strategies that are transformed into sell-sumers. The method of this study classifies influencers' commercial/non-commercial posts using Python, performs text mining using KoNLPy, and calculates similarity between FastText-based words. As a result, it has been confirmed that the higher the keyword theme concentration of the influencer's commercial posting, the higher the sales. In addition, it was confirmed through the cluster analysis that the influencer types for each product category were classified into four types and that there was a significant difference between groups according to sales. In other words, the implications of this study may suggest empirical solutions of social media sales strategies for influencers working on social media and marketers who want to use them as marketing tools.
https://doi.org/10.15813/kmr.2021.22.4.012 인용 PDF KSCI

An Enhanced Text Mining Approach using Ensemble Algorithm for Detecting Cyber Bullying

Z.Sunitha Bai;Sreelatha Malempati
- International Journal of Computer Science & Network Security
- /
- v.23 no.5
- /
- pp.1-6
- /
- 2023
Text mining (TM) is most widely used to process the various unstructured text documents and process the data present in the various domains. The other name for text mining is text classification. This domain is most popular in many domains such as movie reviews, product reviews on various E-commerce websites, sentiment analysis, topic modeling and cyber bullying on social media messages. Cyber-bullying is the type of abusing someone with the insulting language. Personal abusing, sexual harassment, other types of abusing come under cyber-bullying. Several existing systems are developed to detect the bullying words based on their situation in the social networking sites (SNS). SNS becomes platform for bully someone. In this paper, An Enhanced text mining approach is developed by using Ensemble Algorithm (ETMA) to solve several problems in traditional algorithms and improve the accuracy, processing time and quality of the result. ETMA is the algorithm used to analyze the bullying text within the social networking sites (SNS) such as facebook, twitter etc. The ETMA is applied on synthetic dataset collected from various data a source which consists of 5k messages belongs to bullying and non-bullying. The performance is analyzed by showing Precision, Recall, F1-Score and Accuracy.
https://doi.org/10.22937/IJCSNS.2023.23.5.1 인용 PDF

Korean Text to Gloss: Self-Supervised Learning approach

Thanh-Vu Dang;Gwang-hyun Yu;Ji-yong Kim;Young-hwan Park;Chil-woo Lee;Jin-Young Kim
- Smart Media Journal
- /
- v.12 no.1
- /
- pp.32-46
- /
- 2023
Natural Language Processing (NLP) has grown tremendously in recent years. Typically, bilingual, and multilingual translation models have been deployed widely in machine translation and gained vast attention from the research community. On the contrary, few studies have focused on translating between spoken and sign languages, especially non-English languages. Prior works on Sign Language Translation (SLT) have shown that a mid-level sign gloss representation enhances translation performance. Therefore, this study presents a new large-scale Korean sign language dataset, the Museum-Commentary Korean Sign Gloss (MCKSG) dataset, including 3828 pairs of Korean sentences and their corresponding sign glosses used in Museum-Commentary contexts. In addition, we propose a translation framework based on self-supervised learning, where the pretext task is a text-to-text from a Korean sentence to its back-translation versions, then the pre-trained network will be fine-tuned on the MCKSG dataset. Using self-supervised learning help to overcome the drawback of a shortage of sign language data. Through experimental results, our proposed model outperforms a baseline BERT model by 6.22%.
https://doi.org/10.30693/SMJ.2023.12.1.32 인용 PDF

Search Result 56, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)