• Title/Summary/Keyword: 텍스트 연구

Search Result 3,471, Processing Time 0.036 seconds

Stock Price Prediction Using Sentiment Analysis: from "Stock Discussion Room" in Naver (SNS감성 분석을 이용한 주가 방향성 예측: 네이버 주식토론방 데이터를 이용하여)

  • Kim, Myeongjin;Ryu, Jihye;Cha, Dongho;Sim, Min Kyu
    • The Journal of Society for e-Business Studies
    • /
    • v.25 no.4
    • /
    • pp.61-75
    • /
    • 2020
  • The scope of data for understanding or predicting stock prices has been continuously widened from traditional structured format data to unstructured data. This study investigates whether commentary data collected from SNS may affect future stock prices. From "Stock Discussion Room" in Naver, we collect 20 stocks' commentary data for six months, and test whether this data have prediction power with respect to one-hour ahead price direction and price range. Deep neural network such as LSTM and CNN methods are employed to model the predictive relationship. Among the 20 stocks, we find that future price direction can be predicted with higher than the accuracy of 50% in 13 stocks. Also, the future price range can be predicted with higher than the accuracy of 50% in 16 stocks. This study validate that the investors' sentiment reflected in SNS community such as Naver's "Stock Discussion Room" may affect the demand and supply of stocks, thus driving the stock prices.

A Study on the Knowledge Acquisition from Local Companies and Job Seekers using Data Mining Techniques (데이터마이닝 기법을 이용한 지역 기업과 구직자로부터의 지식 도출에 관한 연구)

  • Kim, Jin-Sung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.22 no.2
    • /
    • pp.141-147
    • /
    • 2012
  • The purpose of the study is the acquisitions of knowledge related in job searching from local companies and job seekers using data mining techniques. At the first step, for the study, we had selected the local companies their headquarters are located in Jeonbuk province. Then we had picked the graduating students out from the high schools, colleges, and universities in the same area as the job seekers. After the targeting of the sample, we had surveyed 560 local companies and 14 schools for the collecting of the preliminary data. As the result of the survey, we could collect 173 responses from the companies and 551 responses from the job seekers. At the second step using data mining, we had adapted the C5.0 algorithm to extract the inference rules. Then we had used the Visual Basic (VB) programming language to visualize the rules at the third step. At the fourth step, we transformed the inference rules into DB tables. At the final step, we had executed the rule inferences to support the development of the long-term human resources development (HRD) strategies. As the result of the study, we could suggest the helpful information to the HRD directors and job seekers in designing their strategies in managing their jobs and career development.

Development of an Integrated DataBase System of Marine Geological and Geophysical Data Around the Korean Peninsula (한반도 해역 해양지질 및 지구물리 자료 통합 DB시스템 개발)

  • KIM, Sung-Dae;BAEK, Sang-Ho;CHOI, Sang-Hwa;PARK, Hyuk-Min
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.19 no.2
    • /
    • pp.47-62
    • /
    • 2016
  • An integrated database(DB) system was developed to manage the marine geological data and geophysical data acquired from around the Korean peninsula from 2009 to 2013. Geological data such as size analysis data, columnar section images, X-ray images, heavy metal data, and organic carbon data of sediment samples, were collected in the form of text files, excel files, PDF files and image files. Geophysical data such as seismic data, magnetic data, and gravity data were gathered in the form of SEG-Y binary files, image files and text files. We collected scientific data from research projects funded by the Ministry of Oceans and Fisheries, data produced by domestic marine organizations, and public data provided by foreign organizations. All the collected data were validated manually and stored in the archive DB according to data processing procedures. A geographic information system was developed to manage the spatial information and provide data effectively using the map interface. Geographic information system(GIS) software was used to import the position data from text files, manipulate spatial data, and produce shape files. A GIS DB was set up using the Oracle database system and ArcGIS spatial data engine. A client/server GIS application was developed to support data search, data provision, and visualization of scientific data. It provided complex search functions and on-the-fly visualization using ChartFX and specially developed programs. The system is currently being maintained and newly collected data is added to the DB system every year.

Speech Animation Synthesis based on a Korean Co-articulation Model (한국어 동시조음 모델에 기반한 스피치 애니메이션 생성)

  • Jang, Minjung;Jung, Sunjin;Noh, Junyong
    • Journal of the Korea Computer Graphics Society
    • /
    • v.26 no.3
    • /
    • pp.49-59
    • /
    • 2020
  • In this paper, we propose a speech animation synthesis specialized in Korean through a rule-based co-articulation model. Speech animation has been widely used in the cultural industry, such as movies, animations, and games that require natural and realistic motion. Because the technique for audio driven speech animation has been mainly developed for English, however, the animation results for domestic content are often visually very unnatural. For example, dubbing of a voice actor is played with no mouth motion at all or with an unsynchronized looping of simple mouth shapes at best. Although there are language-independent speech animation models, which are not specialized in Korean, they are yet to ensure the quality to be utilized in a domestic content production. Therefore, we propose a natural speech animation synthesis method that reflects the linguistic characteristics of Korean driven by an input audio and text. Reflecting the features that vowels mostly determine the mouth shape in Korean, a coarticulation model separating lips and the tongue has been defined to solve the previous problem of lip distortion and occasional missing of some phoneme characteristics. Our model also reflects the differences in prosodic features for improved dynamics in speech animation. Through user studies, we verify that the proposed model can synthesize natural speech animation.

Characteristics of Bearing Capacity under Square Footing on Two-layered Sand (2개층 사질토지반에서 정방형 기초의 지지력 특성)

  • 김병탁;김영수;이종현
    • Journal of the Korean Geotechnical Society
    • /
    • v.17 no.4
    • /
    • pp.289-299
    • /
    • 2001
  • 본 연구는 균질 및 2개층 비균질지반에서 사질토지반 상에 놓인 정방형 기초의 극한지지력과 침하에 대하여 고찰하였다. 본 연구는 얕은기초의 거동에 대한 정방형 기초의 크기, 지반 상대밀도, 기초 폭에 대한 상부층의 두께 비(H/B), 상부층 아래 경계면의 경사($\theta$) 그리고 지반강성비의 영향을 규명하기 위하여 모형실험을 수행하였다. 동일 상대밀도에서 지지력 계수($N_{{\gamma}}$)는 일정하지 않으며 기초 폭에 직접적으로 관련되며 지지력계수는 기초 폭이 증가함에 따라 감소하였다. 기초크기의 영향과 구속압력의 영향을 고려하는 Ueno 방법에 의한 극한지지력의 예측값은 고전적인 지지력 산정식보다 더 잘 일치하며 그 값은 실험값의 65% 이상으로 나타났다. $\theta$=$0^{\circ}$인 2개층 지반의 결과에 근거하여, 극한지지력에 대한 하부층 지반의 영향을 무시할 수 있는 한계 상부층 두께는 기초 폭의 2배로 결정되었다. 그러나, 73%의 상부층 상대밀도인 경우는 침하비($\delta$B) 0.05 이하에서만 이 결과가 유효하였다. 경계면이 경사진 2개층 지반의 결과에 근거하여, 상부층의 상대밀도가 느슨할수록 그리고 상부층의 두께가 클수록 극한지지력에 대한 경계면 경사의 영향은 크지 않는 것으로 나타났다. 경계면의 경사가 증가함에 따른 극한침하량의 변화는 경계면이 수평인 경우($\theta$=$0^{\circ}$)를 기준으로 0.82~1.2(상부층 $D_{r}$=73%인 경우) 그리고 0.9~1.07(상부층 $D_{r}$=50%인 경우) 정도로 나타났다.Markup Language 문서로부터 무선 마크업 언어 문서로 자동 변환된 텍스트를 인코딩하는 경우와 같이 특정한 응용 분야에서는 일반 문자열에 대한 확장 인코딩 기법을 적용할 필요가 있을 수 있다.mical etch-stop method for the etching of Si in TMAH:IPA;pyrazine solutions provides a powerful and versatile alternative process for fabricating high-yield Si micro-membranes. the RSC circle, but also to the logistics system in the SLC circle. Thus, the RSLC model can maximize combat synergy effects by integrating the RSC and the SLC. With a similar logic, this paper develops "A Revised System of Systems with Logistics (RSSL)" which combines "A New system of Systems" and logistics. These tow models proposed here help explain several issues such as logistics environment in future warfare, MOE(Measure of Effectiveness( on logistics performance, and COA(Course of Actions) for decreasing mass and increasing velocity. In particular, velocity in logistics is emphasized.

  • PDF

Sensitivity Enhancement of Polydiacetylene Vesicles through Control of Particle Size and Polymerization Temperature (입자크기와 중합온도 제어를 통한 폴리다이아세틸렌의 센싱감도 향상)

  • Lee, Gil Sun;Oh, Jae Ho;Ahn, Dong June
    • Korean Chemical Engineering Research
    • /
    • v.49 no.4
    • /
    • pp.400-404
    • /
    • 2011
  • Many studies on polydiacetylene(PDA) have been investigated to apply to chemical and biological sensors due to their unique optical properties of color change from blue to red and fluorescence change from non-fluorescence to red fluorescence. Especially, high sensitivity against specific molecules is very important to apply polydiacetylenes to various sensors. In this study, we examined the effect of sensitivity enhancement of 10,12-pentacosadynoic acid(PCDA) vesicles in detection ${\alpha}$-cyclodextrin(CD) according to control of vesicle size by filters with different pore sizes and polymerization temperature. Colorimetric response(CR) was calculated using visible spectrometer. In order to investigate the effect of vesicle size on sensitivity of PDA vesicles, two PCDA vesicles were filtered without filtration and with 0.22 ${\mu}m$ filter. The two PCDA vesicles were polymerized at $25^{\circ}C$ and were incubated with ${\alpha}$-CD(5 mM) for 30 min. The CRs of the former and latter vesicles were 31.4% and 74.0%, respectively. Then, two PCDA vesicles filtered with 0.22 ${\mu}m$ filter were polymerized at $25^{\circ}C$ and $5^{\circ}C$ and were reacted with ${\alpha}$-CD(5 mM) for 30 min to examine the effect of polymerization temperature. The CRs of the former and latter vesicles were 74.0 and 99.2%, respectively. This suggests that vesicle sizes and polymerization temperature are key factors in enhancing the sensitivity of PDA vesicles. In addition, these results are expected to be useful to apply the PDA vesicles as biosensors to detect DNA, protein, and cells.

Detection of Gene Interactions based on Syntactic Relations (구문관계에 기반한 유전자 상호작용 인식)

  • Kim, Mi-Young
    • The KIPS Transactions:PartB
    • /
    • v.14B no.5
    • /
    • pp.383-390
    • /
    • 2007
  • Interactions between proteins and genes are often considered essential in the description of biomolecular phenomena and networks of interactions are considered as an entre for a Systems Biology approach. Recently, many works try to extract information by analyzing biomolecular text using natural language processing technology. Previous researches insist that linguistic information is useful to improve the performance in detecting gene interactions. However, previous systems do not show reasonable performance because of low recall. To improve recall without sacrificing precision, this paper proposes a new method for detection of gene interactions based on syntactic relations. Without biomolecular knowledge, our method shows reasonable performance using only small size of training data. Using the format of LLL05(ICML05 Workshop on Learning Language in Logic) data we detect the agent gene and its target gene that interact with each other. In the 1st phase, we detect encapsulation types for each agent and target candidate. In the 2nd phase, we construct verb lists that indicate the interaction information between two genes. In the last phase, to detect which of two genes is an agent or a target, we learn direction information. In the experimental results using LLL05 data, our proposed method showed F-measure of 88% for training data, and 70.4% for test data. This performance significantly outperformed previous methods. We also describe the contribution rate of each phase to the performance, and demonstrate that the first phase contributes to the improvement of recall and the second and last phases contribute to the improvement of precision.

A Study on the Juxtaposition Technique in Nosan Lee Eun-sang's Sijo - Focusing on the Nosan Sijojip(時調集) - (노산 이은상 시조의 병치 기법 연구 - 노산 시조집을 중심으로 -)

  • Lee, Soon-Hee
    • Sijohaknonchong
    • /
    • v.44
    • /
    • pp.75-103
    • /
    • 2016
  • The purpose of this study is to demonstrate that the main creative attitude in Lee Eun-sang's Sijo relies upon the juxtaposition technique, with paying attention to juxtaposition of being found in the works of being put in the Nosan Sijojip(時調集, collection of Sijo poems), and that this creative attitude provides readers with the easiness for understanding. A type in the juxtaposition technique, which was shown in "Nosan Sijojip", was divided in the dimension of the anaphora in a meaning and the confrontation in a meaning. The anaphora of a meaning was classified into synonymous juxtaposition, comprehensive juxtaposition, specific juxtaposition and syntactic juxtaposition. The confrontation of a meaning was examined in the contradictory juxtaposition. Most of Lee Eun-sang's works are applying this juxtaposition technique. Also, the dynamic of image, which is indicated in juxtaposition, is what was influenced by the British and American imagism. This study will be able to solve problems that modern Sijo has to some extent, and will be helpful even for acquiring the identity in Sijo.

  • PDF

A Study on the Value Factors of Culture Consumers for Corporate Culture Marketing through Big Data Techniques (빅데이터 기법을 통한 기업 문화마케팅을 위한 문화소비자의 가치 요소 연구)

  • Oh, Se Jong
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.1
    • /
    • pp.31-36
    • /
    • 2020
  • Corporate Culture Marketing is a marketing tool that enhances a company's cultural image or conveys its image through culture. Culture Consumer value analysis is important predictive data in identifying the value and pursuit of life in individual consumption behavior, explaining the choice behavior of culture consumers, and serves as the basis for decision making. The research method was linked to the text mining and opinion mining techniques of big data, and extracted positive, negative and neutral words. The analysis targets culture consumers participating in concerts at Hyundai Card's 'Super Concert', which is subject to domestic consumers, and CJ ENM's 'KCON', which is subject to foreign consumers. The culture consumer value elements of corporate culture marketing are the basic conditions, and they were derived as 'Consensus Communication (Expression of Sensibility)', 'Participation Sharing(VIP Belonging)', 'Social Change Issue', 'Differentiating Services', 'Price Discount Benefit' and 'Location Quality'. In the future, we will need to foster 'Culture Technology Marketers' and apply them in areas such as arts management planning, cultural investment, cultural distribution, cultural space, Corporate Culture, CSR and K-pop marketing to enhance corporate interests and brand value and enhance brand value.

Study on the Methodology for Extracting Information from SNS Using a Sentiment Analysis (SNS 감성분석을 이용한 정보 추출 방법론에 관한 연구)

  • Hong, Doopyo;Jeong, Harim;Park, Sangmin;Han, Eum;Kim, Honghoi;Yun, Ilsoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.16 no.6
    • /
    • pp.141-155
    • /
    • 2017
  • As the use of SNS becomes more active, many people are posting their thoughts about specific events in their SNS in the form of text. As a result, SNS is used in various fields such as finance and distribution to conduct service satisfaction surveys and consumer monitoring. However, in the transportation area, there are not enough cases to utilize unstructured data analysis such as emotional analysis. In this study, we developed an emotional analysis methodology that can be used in transportation by using highway VOC data, which is atypical data collected by Korea Expressway Corporation. The developed methodology consists of morpheme analysis, emotional dictionary construction, and emotional discrimination of the collected unstructured data. The developed methodology was verified using highway related tweet data. As a result of the analysis, it can be guessed that many information and information about the construction and the accident were related to the highway during the analysis period. Also, it seems that users complain about the delay caused by construction and accident.