• Title/Summary/Keyword: Big data Processing

Search Result 1,063, Processing Time 0.039 seconds

Development of a New Sensor with Divided Multiple Long and Short Wires in Transient Hot-wire Technique (다수의 분할된 긴 열선과 짧은 열선을 갖는 새로운 비정상열선법 센서개발)

  • Lee, Shin-Pyo;Lee, Myung-Hoon
    • Transactions of the Korean Society of Mechanical Engineers B
    • /
    • v.28 no.5
    • /
    • pp.510-517
    • /
    • 2004
  • A fine hot-wire is used both as a heating element and a temperature sensor in transient hot-wire method. The traditional sensor system is unnecessarily big so that it takes large fluid volume to measure the thermal conductivity. To dramatically reduce this fluid volume, a new sensor fabrication and a data processing method are proposed in this article. Contrast to the conventional and most popular two wire sensor, the new sensor system is made up of divided multiple long and short wires. Through validation experiments, it is found that the measured thermal conductivities of the glycerin are exactly same each other between the conventional and proposed new method. Also some technical considerations in arranging the multiple wires are briefly discussed.

Text Mining and Sentiment Analysis for Predicting Box Office Success

  • Kim, Yoosin;Kang, Mingon;Jeong, Seung Ryul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.8
    • /
    • pp.4090-4102
    • /
    • 2018
  • After emerging online communications, text mining and sentiment analysis has been frequently applied into analyzing electronic word-of-mouth. This study aims to develop a domain-specific lexicon of sentiment analysis to predict box office success in Korea film market and validate the feasibility of the lexicon. Natural language processing, a machine learning algorithm, and a lexicon-based sentiment classification method are employed. To create a movie domain sentiment lexicon, 233,631 reviews of 147 movies with popularity ratings is collected by a XML crawling package in R program. We accomplished 81.69% accuracy in sentiment classification by the Korean sentiment dictionary including 706 negative words and 617 positive words. The result showed a stronger positive relationship with box office success and consumers' sentiment as well as a significant positive effect in the linear regression for the predicting model. In addition, it reveals emotion in the user-generated content can be a more accurate clue to predict business success.

Big Data Processing and Management Service on Cloud (클라우드 기반 대규모 데미터 처리 및 관리 기술)

  • Lee, M.Y.
    • Electronics and Telecommunications Trends
    • /
    • v.24 no.4
    • /
    • pp.41-54
    • /
    • 2009
  • 인터넷 서비스 데이터량의 지속적인 증가로 대량의 원시 데이터로부터 정보를 가공 처리하는 과정, 체계화된 정보의 저장 관리 및 유용한 정보를 추출하기 위한 분석 등에 분산 컴퓨팅 기술을 적용하는 움직임이 활발히 진행되고 있다. 기존의 RDBMS 기술, MPI 분산 처리 기술 등은 대규모 데이터 처리 환경에 적용하기에는 운영 환경, 기능/성능면에서 확장성 혹은 고비용 문제가 따른다. 그러므로 저가의 서버들로 구성된 대규모 클러스터 환경을 기반으로 분산 컴퓨팅 기술을 적용한 새로운 시스템들이 대규모 데이터 처리를 요하는 인터넷 서비스 응용에 이용되고 있다. 이를 기반으로 바이오인포매틱스, 과학 시뮬레이션, 비즈니스 인텔리전스 등 다른 응용 영역으로 확대하여 클라우드 서비스로 제공하려는 비즈니스 모델이 제시되고 있다. 본 논문에서는 이와 같은 분산 컴퓨팅 기술을 적용한 대규모 데이터 저장 관리 및 처리 기술 동향을 조사하고 클라우드 기반 서비스로의 발전 방향을 서술한다.

Architecture Design of Smart Mobile Platform for Industry (산업용 모바일 융합단말 플랫폼 구조 설계)

  • Park, Chong-Won
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2011.10a
    • /
    • pp.765-768
    • /
    • 2011
  • At present, the smart mobile device has been big recognition in general due to fusion, mobility and convenience. On the one hand Industy also needs smart mobile device because more and complex data processing. Hereupon this thesis will study reflected industry needs smart mobile pad's design structure, and applicable area to use this device.

  • PDF

Out-Of-Domain Detection Using Hierarchical Dirichlet Process

  • Jeong, Young-Seob
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.1
    • /
    • pp.17-24
    • /
    • 2018
  • With improvement of speech recognition and natural language processing, dialog systems are recently adapted to various service domains. It became possible to get desirable services by conversation through the dialog system, but it is still necessary to improve separate modules, such as domain detection, intention detection, named entity recognition, and out-of-domain detection, in order to achieve stable service offer. When it misclassifies an in-domain sentence of conversation as out-of-domain, it will result in poor customer satisfaction and finally lost business. As there have been relatively small number of studies related to the out-of-domain detection, in this paper, we introduce a new method using a hierarchical Dirichlet process and demonstrate the effectiveness of it by experimental results on Korean dataset.

Deep Convolutional Neural Network with Bottleneck Structure using Raw Seismic Waveform for Earthquake Classification

  • Ku, Bon-Hwa;Kim, Gwan-Tae;Min, Jeong-Ki;Ko, Hanseok
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.1
    • /
    • pp.33-39
    • /
    • 2019
  • In this paper, we propose deep convolutional neural network(CNN) with bottleneck structure which improves the performance of earthquake classification. In order to address all possible forms of earthquakes including micro-earthquakes and artificial-earthquakes as well as large earthquakes, we need a representation and classifier that can effectively discriminate seismic waveforms in adverse conditions. In particular, to robustly classify seismic waveforms even in low snr, a deep CNN with 1x1 convolution bottleneck structure is proposed in raw seismic waveforms. The representative experimental results show that the proposed method is effective for noisy seismic waveforms and outperforms the previous state-of-the art methods on domestic earthquake database.

Profane or Not: Improving Korean Profane Detection using Deep Learning

  • Woo, Jiyoung;Park, Sung Hee;Kim, Huy Kang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.1
    • /
    • pp.305-318
    • /
    • 2022
  • Abusive behaviors have become a common issue in many online social media platforms. Profanity is common form of abusive behavior in online. Social media platforms operate the filtering system using popular profanity words lists, but this method has drawbacks that it can be bypassed using an altered form and it can detect normal sentences as profanity. Especially in Korean language, the syllable is composed of graphemes and words are composed of multiple syllables, it can be decomposed into graphemes without impairing the transmission of meaning, and the form of a profane word can be seen as a different meaning in a sentence. This work focuses on the problem of filtering system mis-detecting normal phrases with profane phrases. For that, we proposed the deep learning-based framework including grapheme and syllable separation-based word embedding and appropriate CNN structure. The proposed model was evaluated on the chatting contents from the one of the famous online games in South Korea and generated 90.4% accuracy.

Research Trends in Quantum Machine Learning (양자컴퓨팅 & 양자머신러닝 연구의 현재와 미래)

  • J.H. Bang
    • Electronics and Telecommunications Trends
    • /
    • v.38 no.5
    • /
    • pp.51-60
    • /
    • 2023
  • Quantum machine learning (QML) is an area of quantum computing that leverages its principles to develop machine learning algorithms and techniques. QML is aimed at combining traditional machine learning with the capabilities of quantum computing to devise approaches for problem solving and (big) data processing. Nevertheless, QML is in its early stage of the research and development. Thus, more theoretical studies are needed to understand whether a significant quantum speedup can be achieved compared with classical machine learning. If this is the case, the underlying physical principles may be explained. First, fundamental concepts and elements of QML should be established. We describe the inception and development of QML, highlighting essential quantum computing algorithms that are integral to QML. The advent of the noisy intermediate-scale quantum era and Google's demonstration of quantum supremacy are then addressed. Finally, we briefly discuss research prospects for QML.

A Study on Big Data Processing and UI Visualization for Web-based Chatbot Media (웹기반 챗봇 미디어를 위한 빅데이터 처리와 UI 시각화 연구)

  • Koh, Seok-Joo;Lee, Kang-Bin;Kim, Kyoung-Min;Park, Jun-heon;Jung, Tae-hyun;Park, Jae-hwa
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2020.07a
    • /
    • pp.613-615
    • /
    • 2020
  • 최근 확장된 인공지능 생태계를 바탕으로 전 분야에 챗봇 서비스(Chatbot Service)의 활용이 증가했다. 이에 따라 챗봇서비스의 활용 분야 및 전달수단도 메신저 앱을 넘어 온라인 웹, 모바일 어플리케이션 등 점차 다양화되는 추세이다. 디지털서비스의 혁신수단으로 인공지능 기반의 챗봇을 적극 도입 중이고 발전하는 챗봇 서비스에 발맞춰 챗봇이 제공하는 데이터도체계를 갖추고 있다. 이에 본 논문은 챗봇이 제공하는 데이터 중 웹을 기반으로 하는 데이터의 시각화 방안을 제시한다. 전국적으로 분포되어있는 방대한 양의 데이터를 처리하여 사용자에게 웹 미디어로 정보를 전달하기 위한 기술적 방법을 연구.개발하였다. 이는 웹을 기반으로 하는 챗봇뿐만 아니라 방대한 양의 정보를 처리해야하는 다양한 웹 미디어서비스에도 적용 가능하며 웹 미디어를더욱 보편화 할 수 있는 방법이다.

  • PDF

The Developing of Analytical Statistics System for the Efficiency of Defense Management (국방경영 효율화를 위한 분석형 통계시스템 구축)

  • Lee, Jung-Man
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.38 no.3
    • /
    • pp.87-94
    • /
    • 2015
  • Recently, management based on statistical data has become a big issue and the importance of the statistics has been emphasized for the management innovation in the defense area. However, the Military Management based on the statistics is hard to expect because of the shortage of the statistics in the military. There are many military information systems having great many data created in real time. Since the infrastructure for gathering data form the many systems and making statistics by using gathered data is not equipped, the usage of the statistics is poor in the military. The Analytical Defense Statistics System is designed to improve effectively the defense management in this study. The new system having the sub-systems of Data Management, Analysis and Service can gather the operational data from interlocked other Defense Operational Systems and produce Defense Statistics by using the gathered data beside providing statistics services. Additionally, the special function for the user oriented statistics production is added to make new statistics by handling many statistics and data. The Data Warehouse is considered to manage the data and Online Analytical Processing tool is used to enhance the efficiency of the data handling. The main functions of the R, which is a well-known analysis program, are considered for the statistical analysis. The Quality Management Technique is applied to find the fault from the data of the regular and irregular type. The new Statistics System will be the essence of the new technology like as Data Warehouse, Business Intelligence, Data Standardization and Statistics Analysis and will be helpful to improve the efficiency of the Military Management.