• Title/Summary/Keyword: news data

Search Result 885, Processing Time 0.036 seconds

Named Entity Recognition and Dictionary Construction for Korean Title: Books, Movies, Music and TV Programs (한국어 제목 개체명 인식 및 사전 구축: 도서, 영화, 음악, TV프로그램)

  • Park, Yongmin;Lee, Jae Sung
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.3 no.7
    • /
    • pp.285-292
    • /
    • 2014
  • A named entity recognition method is used to improve the performance of information retrieval systems, question answering systems, machine translation systems and so on. The targets of the named entity recognition are usually PLOs (persons, locations and organizations). They are usually proper nouns or unregistered words, and traditional named entity recognizers use these characteristics to find out named entity candidates. The titles of books, movies and TV programs have different characteristics than PLO entities. They are sometimes multiple phrases, one sentence, or special characters. This makes it difficult to find the named entity candidates. In this paper we propose a method to quickly extract title named entities from news articles and automatically build a named entity dictionary for the titles. For the candidates identification, the word phrases enclosed with special symbols in a sentence are firstly extracted, and then verified by the SVM with using feature words and their distances. For the classification of the extracted title candidates, SVM is used with the mutual information of word contexts.

Genetic Clustering with Semantic Vector Expansion (의미 벡터 확장을 통한 유전자 클러스터링)

  • Song, Wei;Park, Soon-Cheol
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.3
    • /
    • pp.1-8
    • /
    • 2009
  • This paper proposes a new document clustering system using fuzzy logic-based genetic algorithm (GA) and semantic vector expansion technology. It has been known in many GA papers that the success depends on two factors, the diversity of the population and the capability to convergence. We use the fuzzy logic-based operators to adaptively adjust the influence between these two factors. In traditional document clustering, the most popular and straightforward approach to represent the document is vector space model (VSM). However, this approach not only leads to a high dimensional feature space, but also ignores the semantic relationships between some important words, which would affect the accuracy of clustering. In this paper we use latent semantic analysis (LSA)to expand the documents to corresponding semantic vectors conceptually, rather than the individual terms. Meanwhile, the sizes of the vectors can be reduced drastically. We test our clustering algorithm on 20 news groups and Reuter collection data sets. The results show that our method outperforms the conventional GA in various document representation environments.

The Study on Domestic Fashion Information Service Industry for Systematization of Fashion Trend Information Planning Process (패션정보기획의 체계화를 위한 국내 패션정보산업의 고찰)

  • Choi, Mi-Young;Son, Mi-Young
    • Fashion & Textile Research Journal
    • /
    • v.10 no.6
    • /
    • pp.926-935
    • /
    • 2008
  • The field of textile and fashion is regard to be sensitive to trend, however, the professional fashion information planning company for trend forecasting has not settled down in Korea. This study was designed to propose systemizing for fashion trend information planning in domestic fashion information service market. The empirical research was conducted by analysing in-depth interview data and news-scrap contents about each fashion information planning company. The result are as follows; First, fashion information service showed a little difference according to the type of fashion information companies, but they provided not only general fashion trends but also external market environmental information, survey-based consumer information and various segmented market research reports including academic information. Second, the fashion information planning process is largely divided into 3 stages; trend analysis, trend forecasting, trend application. The trend application step is the stage which connects the fashion information service industry to the fashion business. Thirdly, as a result of the competitive power evaluation for fashion information planning, the domestic fashion information planning companies came to reveal the fact that the possibility of carrying out and information analysis power were weak, however, how to present trend information had a relatively competitive. Consequently, this study is expected to play a role in understanding the importance of fashion trend information, and further ahead it would be helpful to organize the curriculum of fashion information planning subject in order to educate the future fashion executives.

Exploratory Study on the Success Factors of SPA Brands from Marketing Perspectives -Based on Grounded Theory- (SPA 브랜드의 마케팅 성공요인 탐색 -근거이론을 중심으로-)

  • Kim, Kyung Ran;Yang, Su Jin
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.39 no.2
    • /
    • pp.190-203
    • /
    • 2015
  • The fashion industry has been rearranged by Global SPA brands (like ZARA and H&M), which are powerful retailers that integrate the value chain ranging from manufacturing to sales. SPA brands can offer good quality of clothing at a reasonable price by cutting the margin between the supply chain. They are also called fast fashions since they make expedited efforts to respond to market trends and consumers. Despite the slow growth of the fashion industry in Korea, as global SPA brands rapidly expand market share, traditional fashion companies have launched several SPA brands such as MIXXO and SPAO (E-LAND), 8SECONDS (CHEIL INDUSTRIES). The few academic studies on this subject are focused on the analysis of secondary data such as news and books. The current research is qualitative and empirical attempts to explore the success factor of SPA brands with analysis of 1:1 in-depth interviews with experts who have worked for global SPAs such as Uniqlo, H&M, and ZARA, based on the grounded theory. The main phenomenon was shown to be that global SPA brands were popular since they offer a variety of products with a large assortment at reasonable and cheap prices in a large scale and multifunctional retail store. Most of them displayed main phenomena that can be realized due to the purchasing cycle of clothing that is shorter with consumers' regarding clothing as consumables. Global SPA brands had three types of marketing strategy: sellable product, sales strategy according to consumer response, and multifunctional stores. Each global SPA brand developed marketing strategies based on core competency and national conditions. The three success factors shorten the consumer decision making process of clothing. This study concludes with implications for practitioners of SPA brands born in Korea.

Cultural and Social Implications of Metrosexual Mode

  • Oh, Yun-Jeong;Cho, Kyu-Hwa
    • Journal of Fashion Business
    • /
    • v.10 no.3
    • /
    • pp.117-128
    • /
    • 2006
  • The purpose of this study was to understand changes of the current young generation's lifestyle, aesthetic attitude for an appearance, and way of thinking by making a close investigation into metrosexual, the recent mode, and find out its cultural and social implications. As a method of the study, the literature and the Internet data were reviewed. Articles from newspapers, magazines and the Internet were chosen roughly from the year 2000 to now because metrosexual mode remarkably boomed before and after 2000. Books related to the theory on the mode in a costume culture were referred. Also, articles in daily newspapers which dealt with cultural and social issues were reviewed, fashion magazines for men such as Esquire and GQ showing the new trend in men's lifestyle and fashion were examined, and the Internet providing us the latest news from cultural and social topics to fashion trends were investigated. The backgrounds of the rise of metrosexual mode were a collapse of stereotypes in various fields, spread of lookism in a visual image period, extension of commercialism, and expansion of men's character casual trend. Metrosexual was defined as an urban male with a strong aesthetic sense who spends a great deal of time and money on his appearance and lifestyle. His fashion style was characterized by slim and flowing silhouette, feminine and luxurious materials such as transparent chiffon, silk and cotton with a light and soft touch, and a knitted wear with a flowing line, a wide variety of vivid and pastel colors, floral and geometric patterns, and the decorative details like lace, beads, embroidery, and fur. From spread of this mode, two cultural and social implications were extracted. Firstly, the current young generation's aesthetic standards for the perfect man changed from macho man to considerate man who had a good appearance and this suggested that a conventional sex role broke down. Secondly, men began to explore for their own identity escaping from traditionally standardized masculinity that they had been forced to follow.

Awareness and satisfaction on dental implant treatment (임플란트 시술에 대한 인지도 및 만족도)

  • Kim, Soo-Kyung;Kim, Sun-Yi;Jeon, Hee-Young;Lee, Kyeong-Hee
    • Journal of Korean society of Dental Hygiene
    • /
    • v.13 no.3
    • /
    • pp.395-401
    • /
    • 2013
  • Objectives : The purpose of this research was to investigate the awareness of an adult on implants and the relevant factors which affect the satisfaction of a patient after an implant treatment. Methods : This study was conducted to 407 adult subjects in Seoul and Gyeonggi. A total of 384 data were analyzed except the questionnaires having poor responses or errors. Results : The acquaintance route of implant was TV advertisements, self-knowledge, internet, news, and newspapers. Dentist's ability to practice implant was the most important factor in patient's choice. The responents answered the expected lifespan of an implant was more than 5 years to 10 years. In terms of dental health management behavior on implants, the average response of the highest 4.07 points of 5 Likert scale. Generally women are more concerned with implant than men(p<0.01). The highly educated and elderly patients had tendency to receive more treatment(p<0.0001). Patients were more satisfactory after receiving regular checkups after treatment(p<0.05). The low expenses of implant satisfied the patients(p<0.05). Conclusions : As implant technology advances, the concern of patients on implants also increase. So reduction of cost can make the patients have access to the dentist and the patients' oral health must be improved through continuous dental care.

Topic Modeling on Fine Dust Issues Using LDA Analysis (LDA 기법을 이용한 미세먼지 이슈의 토픽모델링 분석)

  • Yoon, soonuk;Kim, Minchul
    • Journal of Energy Engineering
    • /
    • v.29 no.2
    • /
    • pp.23-29
    • /
    • 2020
  • In this study, the last 10 years of news data on fine dust was collected and 80 topics are selected through LDA analysis. As a result, weather-related information made up the main words for the topic, and we can see that fine dust becomes a big issue below 10 degrees Celsius. The frequency of exposure to the media and the maximum concentration of fine dust are correlated with positive. Topics related to fine dust reduction measures and the government's comprehensive measures over the past decade, topics related to products such as air purifiers related to fine dust, topics related to policies protecting vulnerable people from fine dust, and topics on fine dust reduction through R&D were found to be major topics. Measures against fine dust as a social issue can be seen to be closely related to the government's policy.

Connected Component-Based and Size-Independent Caption Extraction with Neural Networks (신경망을 이용한 자막 크기에 무관한 연결 객체 기반의 자막 추출)

  • Jung, Je-Hee;Yoon, Tae-Bok;Kim, Dong-Moon;Lee, Jee-Hyong
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.7
    • /
    • pp.924-929
    • /
    • 2007
  • Captions which appear in images include information that relates to the images. In order to obtain the information carried by captions, the methods for text extraction from images have been developed. However, most existing methods can be applied to captions with fixed height of stroke's width. We propose a method which can be applied to various caption size. Our method is based on connected components. And then the edge pixels are detected and grouped into connected components. We analyze the properties of connected components and build a neural network which discriminates connected components which include captions from ones which do not. Experimental data is collected from broadcast programs such as news, documentaries, and show programs which include various height caption. Experimental result is evaluated by two criteria : recall and precision. Recall is the ratio of the identified captions in all the captions in images and the precision is the ratio of the captions in the objects identified as captions. The experiment shows that the proposed method can efficiently extract captions various in size.

Control of Time-varying and Nonstationary Stochastic Systems using a Neural Network Controller and Dynamic Bayesian Network Modeling (신경회로망 제어기와 동적 베이시안 네트워크를 이용한 시변 및 비정치 확률시스템의 제어)

  • Cho, Hyun-Cheol;Lee, Jin-Woo;Lee, Young-Jin;Lee, Kwon-Soon
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.7
    • /
    • pp.930-938
    • /
    • 2007
  • Captions which appear in images include information that relates to the images. In order to obtain the information carried by captions, the methods for text extraction from images have been developed. However, most existing methods can be applied to captions with fixed height of stroke's width. We propose a method which can be applied to various caption size. Our method is based on connected components. And then the edge pixels are detected and grouped into connected components. We analyze the properties of connected components and build a neural network which discriminates connected components which include captions from ones which do not. Experimental data is collected from broadcast programs such as news, documentaries, and show programs which include various height caption. Experimental result is evaluated by two criteria : recall and precision. Recall is the ratio of the identified captions in all the captions in images and the precision is the ratio of the captions in the objects identified as captions. The experiment shows that the proposed method can efficiently extract captions various in size.

An Analysis on the Contents Related to Hypertension In the Television Broadcast (영상매체의 고혈압관련 기사 내용 분석)

  • Ko Il Sun;Kim Tae Wha;Kim Eui Sook;Lee Sun Mi;Lee Jung Ja
    • Journal of Korean Public Health Nursing
    • /
    • v.18 no.1
    • /
    • pp.90-102
    • /
    • 2004
  • The purpose of the study was to analyze the current status of hypertension related information on the mass-media. Data were collected on the hypertension related reports in three major broadcasting centers, KBS1$\cdot$2, MBC, SBS, for 2 years, 1999-2001. Sample of the study was 134 reports. The results were as follows: 1. There were differences in frequencies by broadcasting center and programs. KBS and 9PM News were highest in proportions, $62.6\%\;and\;37.3\%$ respectively. 2. In regard to reporting time, $90\%$ were reported in the afternoon. and $62.5\%$ of those reports were in 9 PM, followed by 8PM. & 7PM. and 6AM. 3. In regard to area of the report, $35.8\%$ belonged to social section, followed by $26.1\%$ science. $15.7\%$ international. and $11.2\%$ life and health. 4. In terms of monthly distribution, December, November, and August had higher proportion of reports than other months as well as fall and winter. 5. There were higher proportion of reports containing 'treatment and management' with 'complication management' targeted to 'patients' than 'prevention' targeted to 'general population' in terms of content of the report. In summary, MBC and SBS were more focused on 'treatment and management' with KBS more focused on 'prevention'. There were more 'prevention' related reports in summer, and 'complication management' reports in the morning with 'treatment and management' reports in the afternoon.

  • PDF