• Title/Summary/Keyword: 재현 정확도

Search Result 1,459, Processing Time 0.025 seconds

A Methodology for Extracting Shopping-Related Keywords by Analyzing Internet Navigation Patterns (인터넷 검색기록 분석을 통한 쇼핑의도 포함 키워드 자동 추출 기법)

  • Kim, Mingyu;Kim, Namgyu;Jung, Inhwan
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.123-136
    • /
    • 2014
  • Recently, online shopping has further developed as the use of the Internet and a variety of smart mobile devices becomes more prevalent. The increase in the scale of such shopping has led to the creation of many Internet shopping malls. Consequently, there is a tendency for increasingly fierce competition among online retailers, and as a result, many Internet shopping malls are making significant attempts to attract online users to their sites. One such attempt is keyword marketing, whereby a retail site pays a fee to expose its link to potential customers when they insert a specific keyword on an Internet portal site. The price related to each keyword is generally estimated by the keyword's frequency of appearance. However, it is widely accepted that the price of keywords cannot be based solely on their frequency because many keywords may appear frequently but have little relationship to shopping. This implies that it is unreasonable for an online shopping mall to spend a great deal on some keywords simply because people frequently use them. Therefore, from the perspective of shopping malls, a specialized process is required to extract meaningful keywords. Further, the demand for automating this extraction process is increasing because of the drive to improve online sales performance. In this study, we propose a methodology that can automatically extract only shopping-related keywords from the entire set of search keywords used on portal sites. We define a shopping-related keyword as a keyword that is used directly before shopping behaviors. In other words, only search keywords that direct the search results page to shopping-related pages are extracted from among the entire set of search keywords. A comparison is then made between the extracted keywords' rankings and the rankings of the entire set of search keywords. Two types of data are used in our study's experiment: web browsing history from July 1, 2012 to June 30, 2013, and site information. The experimental dataset was from a web site ranking site, and the biggest portal site in Korea. The original sample dataset contains 150 million transaction logs. First, portal sites are selected, and search keywords in those sites are extracted. Search keywords can be easily extracted by simple parsing. The extracted keywords are ranked according to their frequency. The experiment uses approximately 3.9 million search results from Korea's largest search portal site. As a result, a total of 344,822 search keywords were extracted. Next, by using web browsing history and site information, the shopping-related keywords were taken from the entire set of search keywords. As a result, we obtained 4,709 shopping-related keywords. For performance evaluation, we compared the hit ratios of all the search keywords with the shopping-related keywords. To achieve this, we extracted 80,298 search keywords from several Internet shopping malls and then chose the top 1,000 keywords as a set of true shopping keywords. We measured precision, recall, and F-scores of the entire amount of keywords and the shopping-related keywords. The F-Score was formulated by calculating the harmonic mean of precision and recall. The precision, recall, and F-score of shopping-related keywords derived by the proposed methodology were revealed to be higher than those of the entire number of keywords. This study proposes a scheme that is able to obtain shopping-related keywords in a relatively simple manner. We could easily extract shopping-related keywords simply by examining transactions whose next visit is a shopping mall. The resultant shopping-related keyword set is expected to be a useful asset for many shopping malls that participate in keyword marketing. Moreover, the proposed methodology can be easily applied to the construction of special area-related keywords as well as shopping-related ones.

Customer Acceptance Procedure for Clinac (21EX-Platinum)

  • Hong, Dong-Ki;Lee, Woo-Seok;Kwon, Kyung-Tae;Park, Kwang-Ho;Kim, Chung-Man
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.16 no.2
    • /
    • pp.43-61
    • /
    • 2004
  • Purpose : For qualify improvement in radiotherapy, it is important to set up and evaluate equipment (linac) accurately. In addition, technicians are needed to be fully aware of the equipment's detailed quality and its manual. Therefore, the result of ATP is evaluated and introduced, in order that the technicians are skilled by participating in quality assurance (QA) and understanding the quality of the equipment before clinical use. Method and Material : QA for LINAC 21EX (Varian, US) was done with suppliers its procedure was divided into radiation survey, mechanical test, radiation isocenter test, bean performance, dosimetry, and enhanced dynamic wedge and using X-omat film (Kodak), multidata, densitometer, and electrometer. QA of MLC (Millennium, 120 leaf) attached to LINAC and EPID (Portal vision) were done separately. Result : The leakage dose by survey meter was below the tolerance. In mechanical test, collimater, gantry, and couch rotation were less than 1mm, and the angles were ${\pm}0.1^{\circ}$ for digital and ${\pm}0.5^{\circ}$ for mechanical. The alignment test of the light field and crosshair were evaluated less than 1mm. The (a)symmetrical jaw field was less than ${\pm}0.5mm$. The radiation isocenter test using X-mat film was less than 1mm. The consistency of light field and radiation field was less than ${\pm}0.1mm$. PDD for photon energy was less than ${\pm}1\%$ and for electron energy of $90\%,\;80\%,\;50\%,\;and\;30\%$ were evaluated within the tolerance. Flatness for photon and electron energy was evaluated $2.3\%$ (tolerance $3\%$) and $3\%$ (tolerance $4.5\%$), respectively, and symmetry was $0.45\%$ (tolerance $2\%$) and $0.3\%$ (tolerance $2\%$), respectively. Dosimetry test for short term, MU setting, rep rate, and dose rate accuracy of photon and electron energy was within the tolerance depending on energy, MU, and gantry angle. Conclusion : Accuracy and safety for clinical use of Clinac 21EX was verified through customer acceptance procedure and the quality of the equipment was found out. These can reduce the difficulties in using the equipment. Furthermore, it is useful for clinically treatment of patients by technicians' active participations.

  • PDF

지점우량 자료의 분포형 설정과 내용안전년수에 따르는 확률강우량에 관한 고찰 - 국내 3개지점 서울, 부산 및 대구를 중심으로 -

  • Lee, Won-Hwan;Lee, Gil-Chun;Jeong, Yeon-Gyu
    • Water for future
    • /
    • v.5 no.1
    • /
    • pp.27-36
    • /
    • 1972
  • This thesis is the study of the rainfall probability depth in the major areas of Korea, such as Seoul, Pusan and Taegu. The purpose of the paper is to analyze the rainfall in connection with the safe planning of the hydraulic structures and with the project life. The methodology used in this paper is the statistical treatment of the rainfall data in the above three areas. The scheme of the paper is the following. 1. The complementation of the rainfall data We tried to select the maximm values among the values gained by the three methods: Fourier Series Method, Trend Diagram Method and Mean Value Method. By the selection of the maximum values we tried to complement the rainfall data lacking in order to prevent calamities. 2. The statistical treatment of the data The data are ordered by the small numbers, transformed into log, $\sqrt{}, \sqrt[3]{}, \sqrt[4], and$\sqrt[5], and calculated their statistical values through the electronic computer. 3. The examination of the distribution types and the determination of the optimum distibution types By the $x^2-Test$ the distribution types of rainfall data are examined, and rejected some part of the data in order to seek the normal rainfall distribution types. In this way, the optimum distribution types are determined. 4. The computation of rainfall probability depth in the safety project life We tried to study the interrelation between the return period and the safety project life, and to present the rainfall probability depth of the safety project life. In conclusion we set up the optimum distribution types of the rainfall depths, formulated the optimum distributions, and presented the chart of the rainfall probability depth about the factor of safety and the project life.ct life.

  • PDF

Estimation of Surface fCO2 in the Southwest East Sea using Machine Learning Techniques (기계학습법을 이용한 동해 남서부해역의 표층 이산화탄소분압(fCO2) 추정)

  • HAHM, DOSHIK;PARK, SOYEONA;CHOI, SANG-HWA;KANG, DONG-JIN;RHO, TAEKEUN;LEE, TONGSUP
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.24 no.3
    • /
    • pp.375-388
    • /
    • 2019
  • Accurate evaluation of sea-to-air $CO_2$ flux and its variability is crucial information to the understanding of global carbon cycle and the prediction of atmospheric $CO_2$ concentration. $fCO_2$ observations are sparse in space and time in the East Sea. In this study, we derived high resolution time series of surface $fCO_2$ values in the southwest East Sea, by feeding sea surface temperature (SST), salinity (SSS), chlorophyll-a (CHL), and mixed layer depth (MLD) values, from either satellite-observations or numerical model outputs, to three machine learning models. The root mean square error of the best performing model, a Random Forest (RF) model, was $7.1{\mu}atm$. Important parameters in predicting $fCO_2$ in the RF model were SST and SSS along with time information; CHL and MLD were much less important than the other parameters. The net $CO_2$ flux in the southwest East Sea, calculated from the $fCO_2$ predicted by the RF model, was $-0.76{\pm}1.15mol\;m^{-2}yr^{-1}$, close to the lower bound of the previous estimates in the range of $-0.66{\sim}-2.47mol\;m^{-2}yr^{-1}$. The time series of $fCO_2$ predicted by the RF model showed a significant variation even in a short time interval of a week. For accurate evaluation of the $CO_2$ flux in the Ulleung Basin, it is necessary to conduct high resolution in situ observations in spring when $fCO_2$ changes rapidly.

The Study on Restoration & Repair of the Seated Stone Statue of Buddha in the Samreoung Valley of Mt. Namsan (경주 남산 삼릉계 석불좌상 보존 및 복원 연구)

  • Jeong, Min Ho;Ji, Sung Jin
    • Korean Journal of Heritage: History & Science
    • /
    • v.43 no.3
    • /
    • pp.242-281
    • /
    • 2010
  • There are a large number of Buddhist cultural relics in Mt. Namsan. The cultural relics carry the spirit of people of Shila who dream of Buddhist Elysium and the establishment of Buddhist nation. In the valley and the top of the mountain and on various rock cliff, stone statues of Buddha and stone pagodas stand in harmony with nature. For that reason, Mt. Namsan is called an open-air museum. And it played an important role in establishing 'The UNESCO World Heritage' status for Gyeongdju in December 2000. But sadly, there are many stone relics that have eroded away and damaged from collapsing in the passage of time. The seated stone statue of Buddha in Samreoung valley of Mt. Namsan is one of them. It was created between the 8th and 9th century, and restored without much care nor extensive historical research in 1923. As a result, The face of the Buddha remained with concrete mortar and its nimbus fallen backward and destroyed. Therefore, restoration and repair as well as creation of a statue environment for the statue were urgent. So we immediately started in restoration and repair. First, through the archaeological excavation around the stone Buddha, we carried the stone Buddha on the original position. In order to restore the statues to its original glory created by the Unified Shila Dynasty, we created a restoration plan in corporation with art historians and historians, then restored the jaw and the damage nimbus. Second, we made the weathering & damage map of the stone Buddha. In order to prevent second damage, we cleaned the surface of contaminants with distilled water. Third, we studied restoration method to prevent artificial damage. We recreated parts of his face and halo. Then each parts of the statue were restored to their original position. In the whole process of restoration, we tried to use traditional techniques.

A Study on Superficial Dose of 6MV-FFF in HalcyonTM LINAC: Phantom Study (HalcyonTM 선형가속기 6MV-FFF 에너지의 표재 선량에 대한 고찰: Phantom Study)

  • Choi, Seong Hoon;Um, Ki Cheon;Yoo, Soon Mi;Park, Je Wan;Song, Heung Kwon;Yoon, In Ha
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.32
    • /
    • pp.31-39
    • /
    • 2020
  • Purpose: The aims of this study were to compare the superficial dose with Optically Stimulated Luminescence Dosimeter(OSLD) measurement and Treatment Planning System(TPS) calculation for 6MV-Flattening Filter Free(FFF) energy using HalcyonTM and TrueBeamTM. Materials and methods: Phantom study was performed using the CT images of human phantom. In the treatment planning system, the Planning Target Volume(PTV) was contoured which is similar to Glottic cancer. Furthermore, Point(M), Point(R), and Point(L) were contoured at the iso-center of head and neck region and 5mm bolus was applied to the body contour. Each treatment plans using 6MV-FFF energy from HalcyonTM and TrueBeamTM with static Intensity Modulated Radiation Therapy(IMRT) and Volumetric Modulated Arc Therapy(VMAT) were established with eclipse. To reproduce the same position as the TPS, OSLDs were placed at the iso-center point and 5mm bolus was applied to compare the error rate after the dose delivery. Result: The results of the study using human phantom are as follows. In case of HalcyonTM, the mean absolute error rates of the point dose using the treatment planning system and the dose measured by OSLD were 1.7%±1.2% for VMAT and 4.0±2.8% for IMRT. Also TrueBeamTM was identified as 2.4±0.4% and 8.6±1.8% respectively for VMAT and IMRT. Conclusion: Through the results of this study, TrueBeamTM confirmed that the average error rate was 2.4 times higher for VMAT and 3.6 times higher for IMRT than HalcyonTM. Therefore, based on the results of this study, If we need a more accurate dose assessment for the superficial dose, It is expected that using HalcyonTM would be better than TrueBeamTM.

Application of microwave water surface current meter for measuring agricultural water intake (농업용수 사용량 계측을 위한 전자파 표면유속계의 적용)

  • Baek, Jongseok;Kim, Chiyoung;Lee, Kisung;Kang, Hyunwoong;Song, Jaehyun
    • Journal of Korea Water Resources Association
    • /
    • v.53 no.12
    • /
    • pp.1071-1079
    • /
    • 2020
  • For integrated water management, it is essential to secure basic data such as the amount of agricultural water intake. The river water intake through the intake weir is carried out through the agricultural irrigation canal, and a method for measuring the quantity of water intake is required to suit the characteristics of the measuring points. In this study, the accuracy of the calculated flow data was determined by applying a microwave water surface current meter. The microwave water surface current meter is a method of calculating surface velocity using doppler effect, which is mainly used in high-velocities situations such as flood. Surface velocity is difficult to represent the average velocity of the entire section at low dicharges or high wind speeds, it is considered to be low in continuous utilization throughout the year, and it is necessary to verify whether the measurement using an microwave water surface curren meter is appropriate in agricultural irrigation canal. The data measured with an microwave water surface curren meter were compared with the actual flow data to calculate the intake data in agricultural irrigation canal. In agricultural irrigation canal, the low-level discharge calculated using an microwave water surface current meter at a minimum velocity of about 0.3 m/s and a minimum discharge of about 1.0 m3/s or higher was found to have a high tendency and accuracy compared to the standard discharge, especially when the high discharge was high. Although effective results can be obtained in terms of quantity at low discharge, it is deemed that subsequent studies are needed to calculate the average discharge of the cross section at low discharge, given that the trend of data is unstable. Through this study, it is suggested that it is appropriate to calculate the amount of water intake through the microwave water surface current meter in artificial waterways with a certain discharge or higher, so it is expected to be widely distributed as a method for measuring river water intake.

Gridded Expansion of Forest Flux Observations and Mapping of Daily CO2 Absorption by the Forests in Korea Using Numerical Weather Prediction Data and Satellite Images (국지예보모델과 위성영상을 이용한 극상림 플럭스 관측의 공간연속면 확장 및 우리나라 산림의 일일 탄소흡수능 격자자료 산출)

  • Kim, Gunah;Cho, Jaeil;Kang, Minseok;Lee, Bora;Kim, Eun-Sook;Choi, Chuluong;Lee, Hanlim;Lee, Taeyun;Lee, Yangwon
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.6_1
    • /
    • pp.1449-1463
    • /
    • 2020
  • As recent global warming and climate changes become more serious, the importance of CO2 absorption by forests is increasing to cope with the greenhouse gas issues. According to the UN Framework Convention on Climate Change, it is required to calculate national CO2 absorptions at the local level in a more scientific and rigorous manner. This paper presents the gridded expansion of forest flux observations and mapping of daily CO2 absorption by the forests in Korea using numerical weather prediction data and satellite images. To consider the sensitive daily changes of plant photosynthesis, we built a machine learning model to retrieve the daily RACA (reference amount of CO2 absorption) by referring to the climax forest in Gwangneung and adopted the NIFoS (National Institute of Forest Science) lookup table for the CO2 absorption by forest type and age to produce the daily AACA (actual amount of CO2 absorption) raster data with the spatial variation of the forests in Korea. In the experiment for the 1,095 days between Jan 1, 2013 and Dec 31, 2015, our RACA retrieval model showed high accuracy with a correlation coefficient of 0.948. To achieve the tier 3 daily statistics for AACA, long-term and detailed forest surveying should be combined with the model in the future.

Topic Modeling Insomnia Social Media Corpus using BERTopic and Building Automatic Deep Learning Classification Model (BERTopic을 활용한 불면증 소셜 데이터 토픽 모델링 및 불면증 경향 문헌 딥러닝 자동분류 모델 구축)

  • Ko, Young Soo;Lee, Soobin;Cha, Minjung;Kim, Seongdeok;Lee, Juhee;Han, Ji Yeong;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.2
    • /
    • pp.111-129
    • /
    • 2022
  • Insomnia is a chronic disease in modern society, with the number of new patients increasing by more than 20% in the last 5 years. Insomnia is a serious disease that requires diagnosis and treatment because the individual and social problems that occur when there is a lack of sleep are serious and the triggers of insomnia are complex. This study collected 5,699 data from 'insomnia', a community on 'Reddit', a social media that freely expresses opinions. Based on the International Classification of Sleep Disorders ICSD-3 standard and the guidelines with the help of experts, the insomnia corpus was constructed by tagging them as insomnia tendency documents and non-insomnia tendency documents. Five deep learning language models (BERT, RoBERTa, ALBERT, ELECTRA, XLNet) were trained using the constructed insomnia corpus as training data. As a result of performance evaluation, RoBERTa showed the highest performance with an accuracy of 81.33%. In order to in-depth analysis of insomnia social data, topic modeling was performed using the newly emerged BERTopic method by supplementing the weaknesses of LDA, which is widely used in the past. As a result of the analysis, 8 subject groups ('Negative emotions', 'Advice and help and gratitude', 'Insomnia-related diseases', 'Sleeping pills', 'Exercise and eating habits', 'Physical characteristics', 'Activity characteristics', 'Environmental characteristics') could be confirmed. Users expressed negative emotions and sought help and advice from the Reddit insomnia community. In addition, they mentioned diseases related to insomnia, shared discourse on the use of sleeping pills, and expressed interest in exercise and eating habits. As insomnia-related characteristics, we found physical characteristics such as breathing, pregnancy, and heart, active characteristics such as zombies, hypnic jerk, and groggy, and environmental characteristics such as sunlight, blankets, temperature, and naps.

Sorghum Field Segmentation with U-Net from UAV RGB (무인기 기반 RGB 영상 활용 U-Net을 이용한 수수 재배지 분할)

  • Kisu Park;Chanseok Ryu ;Yeseong Kang;Eunri Kim;Jongchan Jeong;Jinki Park
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_1
    • /
    • pp.521-535
    • /
    • 2023
  • When converting rice fields into fields,sorghum (sorghum bicolor L. Moench) has excellent moisture resistance, enabling stable production along with soybeans. Therefore, it is a crop that is expected to improve the self-sufficiency rate of domestic food crops and solve the rice supply-demand imbalance problem. However, there is a lack of fundamental statistics,such as cultivation fields required for estimating yields, due to the traditional survey method, which takes a long time even with a large manpower. In this study, U-Net was applied to RGB images based on unmanned aerial vehicle to confirm the possibility of non-destructive segmentation of sorghum cultivation fields. RGB images were acquired on July 28, August 13, and August 25, 2022. On each image acquisition date, datasets were divided into 6,000 training datasets and 1,000 validation datasets with a size of 512 × 512 images. Classification models were developed based on three classes consisting of Sorghum fields(sorghum), rice and soybean fields(others), and non-agricultural fields(background), and two classes consisting of sorghum and non-sorghum (others+background). The classification accuracy of sorghum cultivation fields was higher than 0.91 in the three class-based models at all acquisition dates, but learning confusion occurred in the other classes in the August dataset. In contrast, the two-class-based model showed an accuracy of 0.95 or better in all classes, with stable learning on the August dataset. As a result, two class-based models in August will be advantageous for calculating the cultivation fields of sorghum.