Search | Korea Science

A New Fine-grain SMS Corpus and Its Corresponding Classifier Using Probabilistic Topic Model

Ma, Jialin;Zhang, Yongjun;Wang, Zhijian;Chen, Bolun
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.12 no.2
- /
- pp.604-625
- /
- 2018
Nowadays, SMS spam has been overflowing in many countries. In fact, the standards of filtering SMS spam are different from country to country. However, the current technologies and researches about SMS spam filtering all focus on dividing SMS message into two classes: legitimate and illegitimate. It does not conform to the actual situation and need. Furthermore, they are facing several difficulties, such as: (1) High quality and large-scale SMS spam corpus is very scarce, fine categorized SMS spam corpus is even none at all. This seriously handicaps the researchers' studies. (2) The limited length of SMS messages lead to lack of enough features. These factors seriously degrade the performance of the traditional classifiers (such as SVM, K-NN, and Bayes). In this paper, we present a new fine categorized SMS spam corpus which is unique and the largest one as far as we know. In addition, we propose a classifier, which is based on the probability topic model. The classifier can alleviate feature sparse problem in the task of SMS spam filtering. Moreover, we compare the approach with three typical classifiers on the new SMS spam corpus. The experimental results show that the proposed approach is more effective for the task of SMS spam filtering.
https://doi.org/10.3837/tiis.2018.02.004 인용 PDF KSCI

A Re-configuration Scheme for Social Network Based Large-scale SMS Spam (소셜 네트워크 기반 대량의 SMS 스팸 데이터 재구성 기법)

Jeong, Sihyun;Noh, Giseop;Oh, Hayoung;Kim, Chong-Kwon
- Journal of KIISE
- /
- v.42 no.6
- /
- pp.801-806
- /
- 2015
The Short Message Service (SMS) is one of the most popular communication tools in the world. As the cost of SMS decreases, SMS spam has been growing largely. Even though there are many existing studies on SMS spam detection, researchers commonly have limitation collecting users' private SMS contents. They need to gather the information related to social network as well as personal SMS due to the intelligent spammers being aware of the social networks. Therefore, this paper proposes the Social network Building Scheme for SMS spam detection (SBSS) algorithm that builds synthetic social network dataset realistically, without the collection of private information. Also, we analyze and categorize the attack types of SMS spam to build more complete and realistic social network dataset including SMS spam.
https://doi.org/10.5626/JOK.2015.42.6.801 인용 KSCI

Coward Analysis based Spam SMS Detection Scheme (동시출현 단어분석 기반 스팸 문자 탐지 기법)

Oh, Hayoung
- Journal of the Korea Institute of Information Security & Cryptology
- /
- v.26 no.3
- /
- pp.693-700
- /
- 2016
Analyzing characteristics of spam text messages had limitations since spam datasets are typically difficult to obtain publicly and previous studies focused on spam email. Although existing studies, such as through the use of spam e-mail characterization and utilization of data mining techniques, there are limitations that influence is limited to high spam detection techniques using a single word character. In this paper, we reveal the characteristics of the spam SMS based on experiment and analysis from different perspectives and propose coward analysis based spam SMS detection scheme with a publicly disclosed spam SMS from the University of Singapore. With the extensive performance evaluations, we show false positive and false negative of the proposed method is less than 2%.
https://doi.org/10.13089/JKIISC.2016.26.3.693 인용 PDF KSCI HTML

Sender Authentication Mechanism based on DomainKey with SMS for Spam Mail Sending Protection (대량 스팸메일 발송 방지를 위한 SMS 기반 DomainKey 방식의 송신자 인증 기법)

Lee, Hyung-Woo
- The Journal of the Korea Contents Association
- /
- v.7 no.4
- /
- pp.20-29
- /
- 2007
Although E-mail system is considered as a most important communication media, 'Spam' is flooding the Internet with many copies of the same message, in an attempt to force the message on people who would not otherwise choose to receive it. Most spam is commercial advertising, often for dubious products, get-rich-quick schemes, or quasi-legal services. Therefore advanced anti-spam techniques are required to basically reduce its transmission volume on sender mail server or MTA, etc. In this study, we propose a new sender authentication model with encryption function based on modified DomainKey with SMS for Spam mail protection. From the SMS message, we can get secret information used for verification of its real sender on e-mail message. And by distributing this secret information with SMS like out-of-band channel, we can also combine proposed modules with existing PGP scheme for secure e-mail generation and authentication steps. Proposed scheme provide enhanced authentication function and security on Spam mail protection function because it is a 'dual mode' authentication mechanism.
https://doi.org/10.5392/JKCA.2007.7.4.020 인용 PDF

A Normalization Method of Distorted Korean SMS Sentences for Spam Message Filtering (스팸 문자 필터링을 위한 변형된 한글 SMS 문장의 정규화 기법)

Kang, Seung-Shik
- KIPS Transactions on Software and Data Engineering
- /
- v.3 no.7
- /
- pp.271-276
- /
- 2014
Short message service(SMS) in a mobile communication environment is a very convenient method. However, it caused a serious side effect of generating spam messages for advertisement. Those who send spam messages distort or deform SMS sentences to avoid the messages being filtered by automatic filtering system. In order to increase the performance of spam filtering system, we need to recover the distorted sentences into normal sentences. This paper proposes a method of normalizing the various types of distorted sentence and extracting keywords through automatic word spacing and compound noun decomposition.
https://doi.org/10.3745/KTSDE.2014.3.7.271 인용 PDF KSCI

Discrimination of SPAM and prevention of smishing by sending personally identified SMS(For financial sector) (개인식별화된 SMS 발송을 통한 스팸식별 및 스미싱 예방(금융권중심))

Joo, Choon Kyong;Yoon, Ji Won
- Journal of the Korea Institute of Information Security & Cryptology
- /
- v.24 no.4
- /
- pp.645-653
- /
- 2014
The purpose of this study is to provide low-cost and highly effective methods for customers to pick out SMS(Short Message Service) messages sent by financial institutions among SPAM messages and Smishing, which have rapidly spread recently and have caused critical issues. Above all, the study aims to list problems and limitations of the past efforts and measures to block SPAM messages and provide one method to overcome those limitations. Also, the study aims to verify the effectiveness of the method by implementation of them and conducting surveys of a broad range of customers.
https://doi.org/10.13089/JKIISC.2014.24.4.645 인용 PDF KSCI HTML

A Study on Spam Regulation (스팸규제에 관한 연구)

Baek, Yun-Chul
- Journal of Information Management
- /
- v.38 no.4
- /
- pp.48-67
- /
- 2007
The economic burden which our society has to take exceeds the benefit that it becomes by the free circulation of information. Problems such as inconvenience or inequality between people can also occur since the regulation task of spam e-mail or SMS is imposed on two organs; the Department of Information and Communication and Free Trade Commission. The dualization of regulation separates related laws, which makes exception according to the $\ulcorner$Law on Information Communication Usage and Information Protection$\lrcorner$ or poses double regulation toward the same case. The spam prevention activity at free hands of information communication network provider such as portal site or mobile communication has many limitations along with comparison and analysis of spam regulations abroad. Therefore, examinations on legal obligation such as service restriction, identification and technical measure to spam prevention is needed. This study focuses on making the scope of spam regulation clear by considering the domestic related laws and the general environment of industry, on enacting law which regulates spam including advertisement and on deducting essential facts in enacting or modifying related laws and thus, deducting the form and contents of spam regulation law which is most decent in our domestic environment.
https://doi.org/10.1633/JIM.2007.38.4.048 인용 PDF

SMS Text Messages Filtering using Word Embedding and Deep Learning Techniques (워드 임베딩과 딥러닝 기법을 이용한 SMS 문자 메시지 필터링)

Lee, Hyun Young;Kang, Seung Shik
- Smart Media Journal
- /
- v.7 no.4
- /
- pp.24-29
- /
- 2018
Text analysis technique for natural language processing in deep learning represents words in vector form through word embedding. In this paper, we propose a method of constructing a document vector and classifying it into spam and normal text message, using word embedding and deep learning method. Automatic spacing applied in the preprocessing process ensures that words with similar context are adjacently represented in vector space. Additionally, the intentional word formation errors with non-alphabetic or extraordinary characters are designed to avoid being blocked by spam message filter. Two embedding algorithms, CBOW and skip grams, are used to produce the sentence vector and the performance and the accuracy of deep learning based spam filter model are measured by comparing to those of SVM Light.
https://doi.org/10.30693/SMJ.2018.7.4.24 인용 PDF KSCI

Implementation of A Mobile Application for Spam SMS Filtering Using Set-Based POI Search Algorithm (집합 기반 POI 검색 알고리즘을 활용한 스팸 메시지 판별 모바일 앱 구현)

Ahn, Hye-yeong;Cho, Wan-zee;Lee, Jong-woo
- Journal of Digital Contents Society
- /
- v.16 no.5
- /
- pp.815-822
- /
- 2015
By the growing of SMS phishing victims, applications for processing spam messages are being released in succession. However most spam messages that cleverly modified the content like separating the consonants and vowels are fail to be filtered. In this paper, we implemented an application 'AntiSpam' which is able to identify spam strings in the text message to solve this problem. 'AntiSpam' searches spam strings in the text message by using set-based POI search algorithm, and then calculate the possibility of whether it is spam or not in accordance with the search results. In addition, it catches skillfully disguised spam messages in order to avoid missing the spam filtering. Users, who received a message, can check the result in spam message possibility decision result and the contents of the message and they can choose how to handling the message.
https://doi.org/10.9728/dcs.2015.16.5.815 인용 PDF KSCI

A SVM-based Spam Filtering System for Short Message Service (SMS) (휴대폰 SMS를 위한 SVM 기반의 스팸 필터링 시스템)

Joe, In-Whee;Shim, Hye-Taek
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.34 no.9B
- /
- pp.908-913
- /
- 2009
Mobile phones became important household appliance that cannot be without in our daily lives. And the short messaging service (SMS) in these mobile phones is 1.5 to 2 times more than the voice service. However, the spam filtering functions installed in mobile phones take a method to receive specific number patterns or words and recognize spam messages when those numbers or words are present. However, this method cannot properly filters various types of spam messages currently dispatched. This paper proposes a more powerful and more adaptive spam filtering system using SVM and thesaurus. The system went through a process of isolating words from sample data through pro-processing device and integrating meanings of isolated words using a thesaurus. Then it generated characteristics of integrated words through the chi-square statistics and studied the characteristics. The proposed system is realized in a Window environment and the performance is confirmed through experiments.
PDF KSCI

Search Result 24, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)