• Title/Summary/Keyword: 교차비

Search Result 1,168, Processing Time 0.025 seconds

Korean Word Sense Disambiguation using Dictionary and Corpus (사전과 말뭉치를 이용한 한국어 단어 중의성 해소)

  • Jeong, Hanjo;Park, Byeonghwa
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.1-13
    • /
    • 2015
  • As opinion mining in big data applications has been highlighted, a lot of research on unstructured data has made. Lots of social media on the Internet generate unstructured or semi-structured data every second and they are often made by natural or human languages we use in daily life. Many words in human languages have multiple meanings or senses. In this result, it is very difficult for computers to extract useful information from these datasets. Traditional web search engines are usually based on keyword search, resulting in incorrect search results which are far from users' intentions. Even though a lot of progress in enhancing the performance of search engines has made over the last years in order to provide users with appropriate results, there is still so much to improve it. Word sense disambiguation can play a very important role in dealing with natural language processing and is considered as one of the most difficult problems in this area. Major approaches to word sense disambiguation can be classified as knowledge-base, supervised corpus-based, and unsupervised corpus-based approaches. This paper presents a method which automatically generates a corpus for word sense disambiguation by taking advantage of examples in existing dictionaries and avoids expensive sense tagging processes. It experiments the effectiveness of the method based on Naïve Bayes Model, which is one of supervised learning algorithms, by using Korean standard unabridged dictionary and Sejong Corpus. Korean standard unabridged dictionary has approximately 57,000 sentences. Sejong Corpus has about 790,000 sentences tagged with part-of-speech and senses all together. For the experiment of this study, Korean standard unabridged dictionary and Sejong Corpus were experimented as a combination and separate entities using cross validation. Only nouns, target subjects in word sense disambiguation, were selected. 93,522 word senses among 265,655 nouns and 56,914 sentences from related proverbs and examples were additionally combined in the corpus. Sejong Corpus was easily merged with Korean standard unabridged dictionary because Sejong Corpus was tagged based on sense indices defined by Korean standard unabridged dictionary. Sense vectors were formed after the merged corpus was created. Terms used in creating sense vectors were added in the named entity dictionary of Korean morphological analyzer. By using the extended named entity dictionary, term vectors were extracted from the input sentences and then term vectors for the sentences were created. Given the extracted term vector and the sense vector model made during the pre-processing stage, the sense-tagged terms were determined by the vector space model based word sense disambiguation. In addition, this study shows the effectiveness of merged corpus from examples in Korean standard unabridged dictionary and Sejong Corpus. The experiment shows the better results in precision and recall are found with the merged corpus. This study suggests it can practically enhance the performance of internet search engines and help us to understand more accurate meaning of a sentence in natural language processing pertinent to search engines, opinion mining, and text mining. Naïve Bayes classifier used in this study represents a supervised learning algorithm and uses Bayes theorem. Naïve Bayes classifier has an assumption that all senses are independent. Even though the assumption of Naïve Bayes classifier is not realistic and ignores the correlation between attributes, Naïve Bayes classifier is widely used because of its simplicity and in practice it is known to be very effective in many applications such as text classification and medical diagnosis. However, further research need to be carried out to consider all possible combinations and/or partial combinations of all senses in a sentence. Also, the effectiveness of word sense disambiguation may be improved if rhetorical structures or morphological dependencies between words are analyzed through syntactic analysis.

Proposal of Localization Policy Based on the Status of Chinese's Research Facilities and Equipment Construction in Korean Basic and Analytical Science Field (국내 기초·분석과학 분야 내 중국산 연구시설·장비 구축 현황에 따른 국산화 정책 제언)

  • Kim, Chang-Yong;Chung, Taewon;Kong, Jaehyun;Park, Chan-Soo
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.6
    • /
    • pp.460-471
    • /
    • 2019
  • The aim of this study was to examine the scale and market share of Chinese's research facility & equipment in the domestic research equipment market of basic and analytical science field for analyzing the difference of the number and amount of construction by year of acquisition, national research facility equipment standard classification code, and type of institution based on the information of the research equipment invested by the Korean government for the past 14 years. In addition, we analyzed the correlation among the year of acquisition, equipment standard classification code, and type of institution variables. As of January 1 2019, from 2005 to 2018, 50 Chinese's research facilities & equipments (main equipment with a construction cost of 30 million won or more) built in the basic and analytical science fields were selected for this study and their number of construction, amount of construction, year of acquisition, type of institution, and standard classification code were analyzed. Differences of the number and amount of construction with-in and by year of acquisition, standard classification code, and type of institution were tested using a single sample Chi-square test, Mann-Whitney U test, and Kruskal-wallis test. The correlation among the three variables was analyzed by using the Chi-square test of cross-tabulation analysis. And there was a statistically significant correlation among the year of acquisition, standard classification code, and type of institution (p<.05). Compared to the 2000s, in the 2010s, high-priced Optical Electronics/Video Equipment was installed at private universities, private enterprises, and government-affiliated research institute. Therefore, the domestic construction status of Chinese's research facility & equipment in the basic science and analytical science field is less than that of the domestic ones, but the number and the amount of construction are increasing statistically. So it is necessary for the government to be able to recognize the possibility that the Chinese's research facility and equipment can encroach on the domestic research industry market and to prepare related provision.

The manage of a public office who 'Junsangseo(典牲署)' and Official Road(官路) of lower officials(參下官) at the 17th - 18th century (17~18세기 전생서(典牲署)의 관직 운영과 참하관(參下官)의 관로(官路))

  • Na, young hun
    • (The)Study of the Eastern Classic
    • /
    • no.69
    • /
    • pp.45-82
    • /
    • 2017
  • This paper aims at concrete examination of the 'Official Road(官路)' of the late Joseon Dynasty through government administration of the 17th - 18th century 'Junsangseo(典牲署)'. Until now, the study of the central political system in the Joseon Dynasty was mainly studied by the politically important bureaucrat 'Dangsanggwan(堂上官)', and even if he studied the 'Official Road(官路)', he was a student from the a graduate of Mungwa(文科) and the 'Clean and imfortant Official(淸要職)' connected with it It was examined mainly. As a result, this research attempts to elucidate the routes of 'non - Clean and imfortant Official(非淸要職)' who have not been studied so far. However, it is difficult to deal with all the 'lower officials(參下官)' reaching 263 in total, so it was targeted at the 'Junsangseo(典牲署)' where the 'List of official(先生案)' exists in the 17th - 18th century. In chapter 2, we examined the historical value of 'List of official(先生案)' and were able to secure the confidence of the materials. In Chapter 3, we specifically examined the origins of officials from the 'Junsangseo(典牲署)', the official route, and the occupation. As a result, the 'Junsangseo(典牲署)' 'lower officials(參下官)' was predominantly from the 'Munum(門蔭)'. In addition, I confirmed that I was stepping on a public road that roughly promoted one rank. The number of days in office has also been promoted satisfying the court occupation days. Although this is an analysis limited to 'Junsangseo(典牲署)', it seems that 'lower officials(參下官)' of 'Junsangseo(典牲署)' had gone through routes and routes that were roughly similar to the 'lower officials' of the main office. If we can assume this, we can understand the character of the late Joseon Dynasty 'lower officials(參下官)' by understanding the character of 'lower officials(參下官)' of 'Junsangseo(典牲署)'. To declare this, more case analysis is necessary, and it is necessary to convert a lot of 'List of official(先生案)' data scattered nationwide into DB.

Prediction of Distribution Changes of Carpinus laxiflora and C. tschonoskii Based on Climate Change Scenarios Using MaxEnt Model (MaxEnt 모델링을 이용한 기후변화 시나리오에 따른 서어나무 (Carpinus laxiflora)와 개서어나무 (C. tschonoskii)의 분포변화 예측)

  • Lee, Min-Ki;Chun, Jung-Hwa;Lee, Chang-Bae
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.23 no.1
    • /
    • pp.55-67
    • /
    • 2021
  • Hornbeams (Carpinus spp.), which are widely distributed in South Korea, are recognized as one of the most abundant species at climax stage in the temperate forests. Although the distribution and vegetation structure of the C. laxiflora community have been reported, little ecological information of C. tschonoskii is available. Little effort was made to examine the distribution shift of these species under the future climate conditions. This study was conducted to predict potential shifts in the distribution of C. laxiflora and C. tschonoskii in 2050s and 2090s under the two sets of climate change scenarios, RCP4.5 and RCP8.5. The MaxEnt model was used to predict the spatial distribution of two species using the occurrence data derived from the 6th National Forest Inventory data as well as climate and topography data. It was found that the main factors for the distribution of C. laxiflora were elevation, temperature seasonality, and mean annual precipitation. The distribution of C. tschonoskii, was influenced by temperature seasonality, mean annual precipitation, and mean diurnal rang. It was projected that the total habitat area of the C. laxiflora could increase by 1.05% and 1.11% under RCP 4.5 and RCP 8.5 scenarios, respectively. It was also predicted that the distributional area of C. tschonoskii could expand under the future climate conditions. These results highlighted that the climate change would have considerable impact on the spatial distribution of C. laxiflora and C. tschonoskii. These also suggested that ecological information derived from climate change impact assessment study can be used to develop proper forest management practices in response to climate change.

Biasing Factors in Self-Report Assessment of Bullying/Victimization: Examining Variability in Involvement Rates by Testing Conditions (자기보고식 괴롭힘 경험률 평가의 편향요인 탐색: 평가조건 변인을 중심으로)

  • Lee, Donghyung
    • Korean Journal of School Psychology
    • /
    • v.15 no.3
    • /
    • pp.459-488
    • /
    • 2018
  • The self-report assessment has been most commonly used to estimate bullying/victimization (B/V) rates in most domestic and international prevalence studies. However, the presence of many potential biasing factors in such an assessment method, including specific operationalization/measurement strategies and testing conditions, has become an issue due to a considerable variability in reported involvement rates across studies. This study analyzed self-reported B/V involvement rates on Olweus Bullying Questionnaire (OBQ) among 690 Korean middle school students by gender and two different cut-offs (generous vs. strict cut-offs) and examined if the involvement rates were significantly varied by testing conditions such as presentation vs. omission of a precise definition of B/V, anonymous vs. non-anonymous/confidential administration, and the use of global vs. specific questions. Chi-square analyses revealed that boys displayed higher involvement rates on global measures of B/V and on items related to direct forms of B/V, with no significant gender differences on specific measures of relational B/V rates. It was also found that a global rate of bullying and specific rates of verbal B/V were 111% to 157% higher when no definition was provided. However, anonymous vs. non-anonymous administration had no significant impacts on rates of involvement, except for one item; there were also no significant differences in reported degrees of frankness and perceived confidentiality of their responses across two adminstration conditions. Finally, when involvement rates were assessed by using specific vs. global items, they were 68% to 148% higher with binominal correlations in low to moderate ranges. Findings also indicated that global items had a high specificity but a relatively low sensitivity. Implications of these findings were fully discussed for researchers and practitioners in the field of B/V assessment.

Preference of Elementary School Students Compared by Dietitians' Perception in School Lunch Program (학교급식 음료 선호도에 대한 초등학생과 영양사의 인식 비교)

  • Bae, Moon-Hee;Seo, Sun-Hee
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.36 no.8
    • /
    • pp.1083-1093
    • /
    • 2007
  • The purpose of this study was to investigate the difference between students' beverage preference and dietitians' perception in elementary school lunch program. This study was conducted in three phases: (1) questionnaire development and survey administration to elementary school students (2) survey administration to dietitians who were in charge of the elementary school food service, and (3) comparison of beverage preferences between elementary school students and dietitians. In phase I, 703 elementary school students in Seoul were surveyed from July 11 to July 19. In Phase II, 100 school food service dietitians in Seoul participated by mail survey from September 15 to October 30, 2006. Based on the results, elementary school students tended to show a neutral milk preference (mean=3.04), whereas dietitians perceived that elementary school students had lower milk preference (mean=2.67). Also dietitians perceived higher yogurt preference (mean=4.27) than the real elementary school students' preference (mean=4.02). T-test results showed the gender difference on milk and yogurt preference. Male students had higher milk preference (t=4.912, p<0.001) and yogurt preference (t=3.621, p<0.001) than female students. Elementary school students showed high fruit juice preference (mean=4.34); however, dietitians perceived lower fruit juice preference of students (mean=3.92). There was no gender difference on fruit juice preference. Though elementary school students had higher fruit juice preference, the frequency of fruit juice served in school lunch was quite low. Over half of the dietitians reported that they served fruit juice less than once a semester. The results of this study indicated the existence of distinctive difference between students' fruit juice preference and school lunch menu offerings.

Construction of Consumer Confidence index based on Sentiment analysis using News articles (뉴스기사를 이용한 소비자의 경기심리지수 생성)

  • Song, Minchae;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.1-27
    • /
    • 2017
  • It is known that the economic sentiment index and macroeconomic indicators are closely related because economic agent's judgment and forecast of the business conditions affect economic fluctuations. For this reason, consumer sentiment or confidence provides steady fodder for business and is treated as an important piece of economic information. In Korea, private consumption accounts and consumer sentiment index highly relevant for both, which is a very important economic indicator for evaluating and forecasting the domestic economic situation. However, despite offering relevant insights into private consumption and GDP, the traditional approach to measuring the consumer confidence based on the survey has several limits. One possible weakness is that it takes considerable time to research, collect, and aggregate the data. If certain urgent issues arise, timely information will not be announced until the end of each month. In addition, the survey only contains information derived from questionnaire items, which means it can be difficult to catch up to the direct effects of newly arising issues. The survey also faces potential declines in response rates and erroneous responses. Therefore, it is necessary to find a way to complement it. For this purpose, we construct and assess an index designed to measure consumer economic sentiment index using sentiment analysis. Unlike the survey-based measures, our index relies on textual analysis to extract sentiment from economic and financial news articles. In particular, text data such as news articles and SNS are timely and cover a wide range of issues; because such sources can quickly capture the economic impact of specific economic issues, they have great potential as economic indicators. There exist two main approaches to the automatic extraction of sentiment from a text, we apply the lexicon-based approach, using sentiment lexicon dictionaries of words annotated with the semantic orientations. In creating the sentiment lexicon dictionaries, we enter the semantic orientation of individual words manually, though we do not attempt a full linguistic analysis (one that involves analysis of word senses or argument structure); this is the limitation of our research and further work in that direction remains possible. In this study, we generate a time series index of economic sentiment in the news. The construction of the index consists of three broad steps: (1) Collecting a large corpus of economic news articles on the web, (2) Applying lexicon-based methods for sentiment analysis of each article to score the article in terms of sentiment orientation (positive, negative and neutral), and (3) Constructing an economic sentiment index of consumers by aggregating monthly time series for each sentiment word. In line with existing scholarly assessments of the relationship between the consumer confidence index and macroeconomic indicators, any new index should be assessed for its usefulness. We examine the new index's usefulness by comparing other economic indicators to the CSI. To check the usefulness of the newly index based on sentiment analysis, trend and cross - correlation analysis are carried out to analyze the relations and lagged structure. Finally, we analyze the forecasting power using the one step ahead of out of sample prediction. As a result, the news sentiment index correlates strongly with related contemporaneous key indicators in almost all experiments. We also find that news sentiment shocks predict future economic activity in most cases. In almost all experiments, the news sentiment index strongly correlates with related contemporaneous key indicators. Furthermore, in most cases, news sentiment shocks predict future economic activity; in head-to-head comparisons, the news sentiment measures outperform survey-based sentiment index as CSI. Policy makers want to understand consumer or public opinions about existing or proposed policies. Such opinions enable relevant government decision-makers to respond quickly to monitor various web media, SNS, or news articles. Textual data, such as news articles and social networks (Twitter, Facebook and blogs) are generated at high-speeds and cover a wide range of issues; because such sources can quickly capture the economic impact of specific economic issues, they have great potential as economic indicators. Although research using unstructured data in economic analysis is in its early stages, but the utilization of data is expected to greatly increase once its usefulness is confirmed.

Management of Critical Control Points to Improve Microbiological Quality of Potentially Hazardous Foods Prepared by Restaurant Operations (외식업체에서 제공하는 잠재적 위험 식품의 미생물적 품질향상을 위한 중점관리점 관리방안)

  • Chun, Hae-Yeon;Choi, Jung-Hwa;Kwak, Tong-Kyung
    • Korean journal of food and cookery science
    • /
    • v.30 no.6
    • /
    • pp.774-784
    • /
    • 2014
  • The purpose of this study was to present management guidelines for critical control points by analyzing microbiological hazardous elements through screening Potentially Hazardous Foods (PHF) menus in an effort improve the microbiological quality of foods prepared by restaurant operations. Steamed spinach with seasoning left at room temperature presents a range of risk temperatures which microorganisms could flourish, and it exceeded all microbiological safety limits in our study. On the other hand, steamed spinach with seasoning stored in a refrigerator had Aerobic Plate Counts of $2.86{\pm}0.5{\log}\;CFU/g$ and all other microbiological tests showed that their levels were below the limit. The standard plate counts of raw materials of lettuce and tomato were $4.66{\pm}0.4{\log}\;CFU/g$ and $3.08{\pm}0.4{\log}\;CFU/g$, respectively. Upon washing, the standard plate counts were $3.12{\pm}0.6{\log}\;CFU/g$ and $2.10{\pm}0.3{\log}\;CFU/g$, respectively, but upon washing after chlorination, those were $2.23{\pm}0.3{\log}\;CFU/g$ and $0.72{\pm}0.7{\log}\;CFU/g$, respectively. The standard plate counts of baby greens, radicchio and leek were $6.02{\pm}0.5{\log}\;CFU/g$, $5.76{\pm}0.1{\log}\;CFU/g$ and $6.83{\pm}0.5{\log}\;CFU/g$, respectively. After 5 minutes of chlorination, the standard plate counts were $4.10{\pm}0.6{\log}\;CFU/g$, $5.14{\pm}0.1{\log}\;CFU/g$ and $5.30{\pm}0.3{\log}\;CFU/g$, respectively. After 10 minutes of chlorination treatment, the standard plate counts were $2.58{\pm}0.3{\log}\;CFU/g$, $4.27{\pm}0.6{\log}\;CFU/g$, and $4.18{\pm}0.5{\log}\;CFU/g$, respectively. The microbial levels decreased as the time of chlorination increased. This study showed that the microbiological quality of foods was improved with the proper practices of time-temperature control, sanitization control, seasoning control, and personal and surface sanitization control. It also presents management guidelines for the control of potentially hazardous foods at the critical control points in the process of restaurant operations.

The Problems, Confidence and Satisfaction of Teachers on Implementation of "Technology and Home Economics" Subject in the 7th Curriculum (제7차 "기술.가정" 교과 운영에 대한 교사의 애로점, 교수 활동 자신감 및 만족도 -대구광역시 중.고교 "기술.가정" 담당 교사를 중심으로-)

  • Jang Hyun-Sook;Choi Ji-Hye
    • Journal of Korean Home Economics Education Association
    • /
    • v.18 no.1 s.39
    • /
    • pp.17-29
    • /
    • 2006
  • The purpose of this research was to examine the problems, confidence and satisfaction of teachers on the subject ${\ulcorner}technology and home economics{\lrcorner}$ in the 7th national curriculm. For this research, questionnaires were sent by post to teachers who teach technology and home economics in middle schools and high schools. The collected questionnaires were technically analyzed by SPSS/WIN 10.0 program, which measured frequency, percentage, average, standard deviation. According to the types of data, they were also analyzed by t-test and cross tabulation analyses. The results of this research were summarized as follows. 1) There were two teaching types of technology and home economics: the partial charge and the whole charge teaching according to teachers' majors, and both types occurred in similar percentage. The partial charge teaching means that teachers majoring in technology teach only the technology part and teachers majoring in home economics teach only the home economics part when they teach the same subject, technology and home economics. These days the partial charge teaching more often occurs in national or public schools than in private schools, and in coeducational schools than in girls' or boys' schools. 2) The major problems of teaching technology and home economics were caused in order by teachers' lack of skills and knowledge which we not their own major, the lack of students' interests and teaching materials, and burden of tests. 3) Teachers' confidence in teaching the contents of the subject, technology and home economics, made a significant difference according to their majors. Teachers whose major was technology felt more confident when they taught the chapters of the textbooks related to their major, technology, while teachers whose major was home economics felt more confident when they taught the chapters of the textbooks related to their major, home economics. According to implementation types, the partial charge teaching gave higher confidence to the teachers than the whole charge one in teaching almost all the chapters of the textbook. 4) According to implementation types, teachers' satisfaction was showed to be higher in the partial charge teaching than in the whole charge one.

  • PDF

Decomposition of Leaf Litter Containing Heavy Metals in the Andong Serpentine Area, Korea (안동 사문암지대의 중금속 함유 낙엽의 분해)

  • Ryou, Sae-Han;Kim, Jeong-Myung;Cha, Sang-Seub;Shim, Jae-Kuk
    • Korean Journal of Environment and Ecology
    • /
    • v.24 no.4
    • /
    • pp.426-435
    • /
    • 2010
  • The present study attempts to compare the soil chemical characteristics and biological activities (i.e. microbial biomass and soil enzyme activities), and litter decomposition rate of Arundinella hirta and Miscanthus sinensis var. purpurascens) collected from serpentine and non-serpentine sites by litter bag techniques at serpentine and non-serpentine field experiment sites over a 9-month period. The serpentine soil showed higher pH and soil alkaliphosphatase activity, and lower soil dehydrogenase and urease activities than the non-serpentine soil. Microbial biomass-N at the serpentine soil was larger than the non-serpentine soil, although the microbial biomass-C and microbial biomass-N represented no significant difference between serpentine and non-serpentine soil. These results suggest that the larger microbial biomass-N caused the lower C/N in serpentine soil. At the end of the experiment, the litter samples of A. hirta and M. sinensis collected from serpentine soil revealed a 39.8% and 38.5% mass loss, and the litter sample from non-serpentine soil also showed a 41.1% and 41.7% mass loss at the serpentine site. On the other hand, at the non-serpentine site, 42.2%, 37.4%, and 46.8%, 44.8% were respectively shown. These results demonstrate that the litter decomposition rate is more intensely affected by the heavy metal content of leaf litter than soil contamination. Moreover, the litter collected from the serpentine soil had a lower C/N, whereas the litter decomposition rate was slower than the litter from the non-serpentine soil, because the heavy metal inhibition activities on the litter decomposition process were more conspicuous than the effect of litter qualities such as C/N ratio or lignin/N. The nutrient element content in the decomposing litter was gradually leached out, but heavy metals and Mg were accumulated in the decaying litter. This phenomenon was conspicuous at the serpentine site during the process of decomposition.