• Title/Summary/Keyword: the Third Age

Search Result 2,862, Processing Time 0.034 seconds

Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)

  • Park, Jiae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.143-163
    • /
    • 2016
  • The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.

Associations of Communication Skills, Self-Efficacy on Clinical Performance and Empathy in Trainee Doctors (전공의 의료커뮤니케이션 능력과 진료수행 자기효능감, 공감능력과의 상관관계)

  • Kim, Doehyung;Kim, Min-Jeong;Lee, Haeyoung;Kim, Hyunseuk;Kim, Youngmi;Lee, Sang-Shin
    • Korean Journal of Psychosomatic Medicine
    • /
    • v.29 no.1
    • /
    • pp.49-57
    • /
    • 2021
  • Objectives : This study evaluated the medical communication skills of trainee doctors and analyzed the relationship between medical communication skills, self-efficacy on clinical performance (SECP) and empathy. Methods : A total of 106 trainee doctors from a university hospital participated. The questionnaire comprised self-evaluated medical communication skills, modified SECP and the Korean version of the Jefferson Scale of Empathy-Health Professionals version. The mean difference in medical communication skills scores according to gender, age, division (intern, internal medicine group or surgery group) and position (intern, first-/second- and third-/fourth-year residents) were analyzed. Pearson correlation coefficients were determined between medical communication skills, modified SECP and empathy. The effects of each variable on medical communication skills were verified using the structural equation model. Results : There were no statistically significant mean differences in self-evaluated medical communication skills according to gender, age, division or position. Medical communication skills had a significant positive correlation with modified SECP (r=0.782, p<0.001) and empathy (r=0.210, p=0.038). Empathy had a direct effect on modified SECP (β=0.30, p<0.01) and modified SECP had a direct effect on medical communication skills (β=0.80, p<0.001). Empathy indirectly influenced medical communication skills, mediating modified SECP (β=0.26, p<0.05). Conclusions : Medical communication skills are an important core curriculum of residency programs, as they have a direct correlation with SECP, which is needed for successful treatment. Moreover, the medical communication needs a new understanding that is out of empathy.

The Clinical Features of Endobronchial Tuberculosis - A Retrospective Study on 201 Patients for 6 years (기관지결핵의 임상상-201예에 대한 후향적 고찰)

  • Lee, Jae Young;Kim, Chung Mi;Moon, Doo Seop;Lee, Chang Wha;Lee, Kyung Sang;Yang, Suck Chul;Yoon, Ho Joo;Shin, Dong Ho;Park, Sung Soo;Lee, Jung Hee
    • Tuberculosis and Respiratory Diseases
    • /
    • v.43 no.5
    • /
    • pp.671-682
    • /
    • 1996
  • Background : Endobronchial tuberculosis is definded as tuberculous infection of the tracheobronchial tree with microbiological and histopathological evidence. Endobronchial tuberculosis has clinical significance due to its sequela of cicatrical stenosis which causes atelectasis, dyspnea and secondary pneumonia and may mimic bronchial asthma and pulmanary malignancy. Method : The authors carried out, retrospectively, a clinical study on 201 patients confirmed with endobronchial tuberculosis who visited the Department of Pulmonary Medicine at Hangyang University Hospital from January 1990 10 April 1996. The following results were obtained. Results: 1) Total 201 parients(l9.5%) were confirmed as endobronchial tuberculosis among 1031 patients who had been undergone flexible bronchofiberscopic examination. The number of male patients were 55 and that of female patients were 146. and the male to female ratio was 1 : 2.7. 2) The age distribution were as follows: there were 61(30.3%) cases in the third decade, 40 cases(19.9%) in the fourth decade, 27 cases(13.4%) in the sixth decade, 21 cases(10.4%) in the fifth decade, 19 cases(9.5%) in the age group between 15 and 19 years, 19 cases(9.5%) in the seventh decade, and 14 cases(7.0%) over 70 years, in decreasing order. 3) The most common symptom, in 192 cases, was cough 74.5%, followed by sputum 55.2%, dyspnea 28.6%, chest discomfort 19.8%, fever 17.2%, hemoptysis 11.5%, in decreasing order, and localized wheezing was heard in 15.6%. 4) In chest X-ray of 189 cases, consolidation was the most frequent finding in 67.7%, followed by collapse 43.9%. cavitary lesion 11.6%, pleural effusion 7.4%, in decreasing order, and there was no abnormal findings in 3.2%. 5) In the 76 pulmanary function tests, a normal pattern was found in 44.7%, restrictive pattern in 39.5 %, obstructive pattern in 11.8%, and combined pattern in 3.9%. 6) Among total 201 patients, bronchoscopy showed caseous pseudomembrane in 70 cases(34.8%), mucosal erythema and edema in 54 cases(26.9%), hyperplastic lesion in 52 cases(25.9%), fibrous s.enosis in 22 cases(10.9%), and erosion or ulcer in 3 cases(1.5%). 7) In total 201 cases, bronchial washing AFB stain was positive in 103 cases(51.2%), bronchial washing culture for tuberculous bacilli in 55 cases(27.4%). In the 99 bronchoscopic biopsies, AFB slain positive in 36.4%. granuloma without AFB stain positive in 13.1%, chronic inflammation only in 36.4%. and non diagnostic biopsy finding in 14.1%. Conclusions : Young female patients, whose cough resistant to genenal antitussive agents, should be evaluated for endobronchial tuberculosis, even with clear chest roentgenogram and negative sputum AFB stain. Furthermore, we would like to emphasize that the bronchoscopic approach is a substantially useful means of making a differential diagnosis of atelectasis in older patients of cancer age. At this time we have to make a standard endoscopic classification of endobronchial tuberculosis, and well designed prospective studies are required to elucidate the effect of combination therapy using antituberculous chemotherapy with steroids on bronchial stenosis in patients with endobronchial tuberculosis.

  • PDF

Lower Lung Field Tuberculosis (폐 하야 결핵)

  • Moon, Doo-Seop;Lim, Byung-Sung;Kim, Yeon-Soo;Kim, Seong-Min;Lee, Jae-Young;Lee, Dong-Suck;Sohn, Jang-Won;Lee, Kyung-Sang;Yang, Suck-Chul;Yoon, Ho-Joo;Shin, Dong-Ho;Park, Sung-Soo;Lee, Jung-Hee
    • Tuberculosis and Respiratory Diseases
    • /
    • v.44 no.2
    • /
    • pp.232-240
    • /
    • 1997
  • Background : Postprimary pulmonary tuberculosis is located mainly in upper lobes. The tuberculous lesion involving the lower lobes usually arises from the upper lobe cavity through endobronchial spread. When tuberculosis is confined to the lower lung field, it often masquerades as pneumonia, lung cancer, bronchiectasis, or lung abscess. Thus the correct diagnosis may be sometimes delayed for a long time. Methods : We carried out, retrospectively, a clinical study on 50 patients confirmed with lower lung field tuberculosis who visited the Department of Pulmonary Medicine at Hanyang University Hospital from January 1992 to December 1994. The following results were obtained. Results : Lower lung field tuberculosis without concomitant upper lobe disease occurred in fifty patients representing 6.9% of the total admission with active pulmonary tuberculosis over a period of 3 years. It occurred most frequently in the third decade but age distribution was relatively even. The mean age was 43 years old. Female was more frequently affected than male (male to female ratio 1 : 1.9). The most common symptom was cough(68%), followed by sputum(52%), fever(38%), and chest discomfort(30%). On chest X-ray of the 50patients, consolidation was the most common finding in 52%, followed by solitary nodule(22%) collapse(16%), cavitary lesion(10%), in decreasing order. The disease confined to the right side in 25 cases, left side 20 cases, and both sides 5 cases. Endobronchial tuberculosis (1) Endobronchial involvement was proved by bronchoscopic examination in 20 of 50patients. (2) Mean age was 44years old and female was more affected than man (male to female ratio 1 : 3). Sputum AFB stain and Mycobacterium tuberculosis culture were positive only in 50% of cases unlikely upper lobe tuberculosis, additional diagnostic methods were needed. In our study, bronchoscopic examination and percutaneous fine needle aspiration biopsy increased diagnostic yield by 18% and 32%, respectively. The most common associated condition was diabetes mellitus(18%) and others were anemia, anorexia nervosa, stomach cancer, and systemic steroid usage. Conclusion : When we find a lower lung field lesion, we should suspect tuberculosis if the patient has diabetes mellitus, anemia, systemic steroid usage, malignancy or other immune suppressed states. Because diagnostic yield of sputum AFB smear & Mycobacterium tuberculosis culture was low, additional diagnostic methods such as bronchoscopy and fine needle aspiration biopsy were needed.

  • PDF

Four-year change and tracking of serum lipids in Korean adolescents (강화지역 청소년의 4년간 혈청 지질의 변화와 지속성)

  • Lee, Kang-Hee;Suh, Il;Jee, Sun-Ha;Nam, Chung-Mo;Kim, Sung-Soon;Shim, Won-Heum;Ha, Jong-Won;Kim, Suk-Il;Kang, Hyung-Gon
    • Journal of Preventive Medicine and Public Health
    • /
    • v.30 no.1 s.56
    • /
    • pp.45-59
    • /
    • 1997
  • It has been known that there is a tracking phenomenon in the level of serum lipids. However, no study has been performed to examine the change and tracking of serum lipids in Korean adolescents. The purpose of this study is to examine the changes of serum lipids in Korean adolescents from 12 to 16 years of age, and to examine whether or not there is a tracking phenomenon in serum lipids level during the period. In 1992 serum lipids(total cholesterol(TC), triglyceride(TG), LDL cholesterol(LDL-C), HDL cholesterol(HDL-C)) were measured in 318 males, 365 females who were 12 years of age in Kangwha county, Korea. These participants have been followed up to 1996 and serum lipids level were examined in 1994 and 1996. Among the participants 162 males and 147 females completed all three examinations in fasting state. To examine the effect of eliminating adolescents with incomplete data, we compared serum lipids, blood pressure and anthropometric measures at baseline between adolescents with complete follow-up and adolescents who were withdrawn. To examine the change of serum lipids we compared mean values of serum lipids according to age in males and females. Repeated analysis of variance was used to test the change according to age. We used three methods to examine the existence of tracking. First, we analyzed the trends in serum lipids over 4-year period within quartile groups formed on the basis of the first-year serum lipids level to see whether or not the relative ranking of the mean serum lipids among the quartile groups remained in the same group for 4-year period. Second, we quantified the degree of tracking by calculating Spearman's rank correlation coefficient between every tests. Third, the persistence extreme quartile method was used. This method divides the population into quartile groups according to the initial level of blood lipids and then calculates the percent of the subjects who stayed in the same group at follow-up measurement. The decreases in levels were noted during 4 years for TC, LDL-C, primarily for boys. The level of HDL-C decreased between baseline and first follow-up for both sexes. Tracking, as measured by both correlation coefficients and persistence extreme quartiles, was evident for all of the lipids. The correlation coefficients of TC between baseline and 4 years later in boys and girls were 0.55 and 0.68, respectively. And the corresponding values for HDL-C were 0.58 and 0.69. More than 50% of adolescents who belonged to the highest quartile group in TC, HDL-C and LDL-C at the baseline were remained at the same group at the examination performed 2 years later for both sexes. The probabilities of remaining at the same group were more than 35% when examined 4 years later. The tracking phenomenon of TG was less evident compared with the other lipids. Percents of girls who stayed at the same group 2 years later and 4 years later were 42.9% and 25.7%, respectively. It was evident that serum lipid levels tracked in Korean adolescents. Researches with longer follow-up would be needed in the future to investigate the long-term change of lipids from adolescents to adults.

  • PDF

Records Management and Archives in Korea : Its Development and Prospects (한국 기록관리행정의 변천과 전망)

  • Nam, Hyo-Chai
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.1 no.1
    • /
    • pp.19-35
    • /
    • 2001
  • After almost one century of discontinuity in the archival tradition of Chosun dynasty, Korea entered the new age of records and archival management by legislating and executing the basic laws (The Records and Archives Management of Public Agencies Ad of 1999). Annals of Chosun dynasty recorded major historical facts of the five hundred years of national affairs. The Annals are major accomplishment in human history and rare in the world. It was possible because the Annals were composed of collected, selected and complied records of primary sources written and compiled by generations of historians, As important public records are needed to be preserved in original forms in modern archives, we had to develop and establish a modern archival system to appraise and select important national records for archival preservation. However, the colonialization of Korea deprived us of the opportunity to do the task, and our fine archival tradition was not succeeded. A centralized archival system began to develop since the establishment of GARS under the Ministry of Government Administration in 1969. GARS built a modem repository in Pusan in 1984 succeeding to the tradition of History Archives of Chosun dynasty. In 1998, GARS moved its headquarter to Taejon Government Complex and acquired state-of-the-art audio visual archives preservation facilities. From 1996, GARS introduced an automated archival management system to remedy the manual registration and management system complementing the preservation microfilming. Digitization of the holdings was the key project to provided the digital images of archives to users. To do this, the GARS purchased new computer/server systems and developed application softwares. Parallel to this direction, GARS drastically renovated its manpower composition toward a high level of professionalization by recruiting more archivists with historical and library science backgrounds. Conservators and computer system operators were also recruited. The new archival laws has been in effect from January 1, 2000. The new laws made following new changes in the field of records and archival administration in Korea. First, the laws regulate the records and archives of all public agencies including the Legislature, the Judiciary, the Administration, the constitutional institutions, Army, Navy, Air Force, and National Intelligence Service. A nation-wide unified records and archives management system became available. Second, public archives and records centers are to be established according to the level of the agency; a central archives at national level, special archives for the National Assembly and the Judiciary, local government archives for metropolitan cities and provinces, records center or special records center for administrative agencies. A records manager will be responsible for the records management of each administrative divisions. Third, the records in the public agencies are registered in the computer system as they are produced. Therefore, the records are traceable and will be searched or retrieved easily through internet or computer network. Fourth, qualified records managers and archivists who are professionally trained in the field of records management and archival science will be assigned mandatorily to guarantee the professional management of records and archives. Fifth, the illegal treatment of public records and archives constitutes a punishable crime. In the future, the public records find archival management will develop along with Korean government's 'Electronic Government Project.' Following changes are in prospect. First, public agencies will digitize paper records, audio-visual records, and publications as well as electronic documents, thus promoting administrative efficiency and productivity. Second, the National Assembly already established its Special Archives. The judiciary and the National Intelligence Service will follow it. More archives will be established at city and provincial levels. Third, the more our society develop into a knowledge-based information society, the more the records management function will become one of the important national government functions. As more universities, academic associations, and civil societies participate in promoting archival awareness and in establishing archival science, and more people realize the importance of the records and archives management up to the level of national public campaign, the records and archival management in Korea will develop significantly distinguishable from present practice.

An Epidemiological Study on the Industrial Injuries among Metal Products Manufacturing Workers in Young-Dung-Po, Seoul (일부 금속 및 기계제품 제조업체 근로자들의 산업재해($1980{\sim}1981$)에 관한 조사)

  • Lee, Jung-Hee
    • Journal of Preventive Medicine and Public Health
    • /
    • v.15 no.1
    • /
    • pp.187-196
    • /
    • 1982
  • The followings are the results of the study on industrial accidents occurred at 12 factories manufacturing metal products during the period of 2 years from January 1980 to December 1981 in the area of Yong-Dung-Po in Seoul. The results of the study are as follows: 1. The incidence rate of industrial injuries was 45.7 per 1,000 workers of the sample group and the rate of male (54.0) was three times higher than that of female (17.5). 2. In age groups, the highest rate was observed in the group of under 19 years old with 83.5, while the lowest in the group of 40s. 3. It was found that those who had short term of work experience produced a higher rate of injuries, particularly, the group of workers with less than 1 year of experience showed the highest rate of it as 48.1%. 4. In working time, the highest incidence rate occurred 3 and 7 hours after the beginning of their working showing the rate of 6.0 and 6.1 per 1,000 workers, respectively. 5. The highest incidence rate was observed on Monday as 8.4 per 1,000 workers, and it was 18.3% in aspect of the days of a week. 6. In aspect of the months of a year, the highest incidence was observed on July 1,000 workers and the next was on March as 4.8. These figures account for 11.8% of total occurrence in respective month. as 5. 4 per and 10.5% 7. In causes of injuries, the accident caused by power driven machinery showed the highest rate with 37.5%, the second was due to handling without machinery with 17.2%, and the third was due to falling objects with 14.2%, and striking against objects with 10.2%, and so on. 8. By parts of the body affected, the most injuries 84.3% of them occurred on both upper and lower extremities with the rate of 58.8% for the former and 25.5% for the latter. Fingers were most frequently injured with a rate of 40.3%. Comparing the sites of extremities affected, rate of injuries on the right side was 55.0% and 45.0% on the left side. 9. In the nature of injury, laceration and open wound were the highest with 34. 0%, the next was fracture and dislocation with 31. 9%, and sprain was the third with 8.1%. 10. On the duration of treatment, it lasted less than one month in 68.9% of the injured cases, of which 14.5% of the cases were recovered within 2 weeks, and 54.4% of them were treated more than 2 weeks. And the duration of the treatment tended to be prolonged in larger industries. 11. The ratio of insured accidents to uninsured accidents was 1 to 4.7.

  • PDF

The Ontology Based, the Movie Contents Recommendation Scheme, Using Relations of Movie Metadata (온톨로지 기반 영화 메타데이터간 연관성을 활용한 영화 추천 기법)

  • Kim, Jaeyoung;Lee, Seok-Won
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.25-44
    • /
    • 2013
  • Accessing movie contents has become easier and increased with the advent of smart TV, IPTV and web services that are able to be used to search and watch movies. In this situation, there are increasing search for preference movie contents of users. However, since the amount of provided movie contents is too large, the user needs more effort and time for searching the movie contents. Hence, there are a lot of researches for recommendations of personalized item through analysis and clustering of the user preferences and user profiles. In this study, we propose recommendation system which uses ontology based knowledge base. Our ontology can represent not only relations between metadata of movies but also relations between metadata and profile of user. The relation of each metadata can show similarity between movies. In order to build, the knowledge base our ontology model is considered two aspects which are the movie metadata model and the user model. On the part of build the movie metadata model based on ontology, we decide main metadata that are genre, actor/actress, keywords and synopsis. Those affect that users choose the interested movie. And there are demographic information of user and relation between user and movie metadata in user model. In our model, movie ontology model consists of seven concepts (Movie, Genre, Keywords, Synopsis Keywords, Character, and Person), eight attributes (title, rating, limit, description, character name, character description, person job, person name) and ten relations between concepts. For our knowledge base, we input individual data of 14,374 movies for each concept in contents ontology model. This movie metadata knowledge base is used to search the movie that is related to interesting metadata of user. And it can search the similar movie through relations between concepts. We also propose the architecture for movie recommendation. The proposed architecture consists of four components. The first component search candidate movies based the demographic information of the user. In this component, we decide the group of users according to demographic information to recommend the movie for each group and define the rule to decide the group of users. We generate the query that be used to search the candidate movie for recommendation in this component. The second component search candidate movies based user preference. When users choose the movie, users consider metadata such as genre, actor/actress, synopsis, keywords. Users input their preference and then in this component, system search the movie based on users preferences. The proposed system can search the similar movie through relation between concepts, unlike existing movie recommendation systems. Each metadata of recommended candidate movies have weight that will be used for deciding recommendation order. The third component the merges results of first component and second component. In this step, we calculate the weight of movies using the weight value of metadata for each movie. Then we sort movies order by the weight value. The fourth component analyzes result of third component, and then it decides level of the contribution of metadata. And we apply contribution weight to metadata. Finally, we use the result of this step as recommendation for users. We test the usability of the proposed scheme by using web application. We implement that web application for experimental process by using JSP, Java Script and prot$\acute{e}$g$\acute{e}$ API. In our experiment, we collect results of 20 men and woman, ranging in age from 20 to 29. And we use 7,418 movies with rating that is not fewer than 7.0. In order to experiment, we provide Top-5, Top-10 and Top-20 recommended movies to user, and then users choose interested movies. The result of experiment is that average number of to choose interested movie are 2.1 in Top-5, 3.35 in Top-10, 6.35 in Top-20. It is better than results that are yielded by for each metadata.

Treatment Outcome of Locally Advanced Non-small Cell Lung Cancer Patients Who Received Concurrent Chemoradiotherapy with Weekly Paclitaxel (Paclitaxel 매주 투여 및 방사선치료 동시요법을 받은 국소진행성 비소세포폐암 환자들의 치료 결과)

  • Kim, Su-Zy;Shim, Byoung-Yong;Kim, Chi-Hong;Song, So-Hyang;Ahn, Meyung-Im;Cho, Deog-Gon;Cho, Kyu-Do;Yoo, Jin-Young;Kim, Hoon-Kyo;Kim, Sung-Whan
    • Radiation Oncology Journal
    • /
    • v.24 no.4
    • /
    • pp.230-236
    • /
    • 2006
  • $\underline{Purpose}$: To analyze the response, toxicity, patterns of failure and survival rate of patients with locally advanced non-small cell lung cancer who were treated with concurrent chemoradiotherapy with weekly paclitaxel. $\underline{Materials\;and\;Methods}$: Twenty-three patients with locally advanced non-small cell lung cancer patients who received radical chemoradiotherapy from October 1999 to September 2004 were included in this retrospective study. Patients received total $55.4{\sim}64.8$ (median 64.8) Gy (daily 1.8 Gy per fraction, 5 days per weeks) over $7{\sim}8$ weeks. 50 or $60\;mg/m^2$ of paclitaxel was administered on day 1, 8, 15, 22, 29 and 36 of radiotherapy. Four weeks after the concurrent chemoradiotherapy, three cycles of consolidation chemotherapy consisted of paclitaxel $135\;mg/m^2$ and cisplatin $75\;mg/m^2$ was administered every 3 weeks. $\underline{Results}$: Of the 23 patients, 3 patients refused to receive the treatment during the concurrent chemoradiotherapy. One patient died of bacterial pneumonia during the concurrent chemoradiotherapy. Grade 2 radiation esophagitis was observed in 4 patients (17%). Sixteen patients received consolidation chemotherapy. During the consolidation chemotherapy, 8 patients (50%) experienced grade 3 or 4 neutropenia and one of those patients died of neutropenic sepsis. Overall response rate for 20 evaluable patients was 90% including 4 complete responses (20%) and 14 partial responses (70%). Among 18 responders, 9 had local failure, 3 had local and distant failure and 2 had distant failure only. Median progression-free survival time was 9.5 months and 2-year progression-free survival rate was 19%. Eleven patients received second-line or third-line chemotherapy after the treatment failure. The median overall survival time was 21 months. 2-year and 5-year survival rate were 43% and 33%, respectively. Age, performance status, tumor size were significant prognostic factors for progression-free survival. $\underline{Conclusion}$: Concurrent chemoradiotherapy with weekly paclitaxel revealed high response rate and low toxicity rate. But local failure occurred frequently after the remission and large tumor size was a poor prognostic factor. Further investigations are needed to improve the local control.

Twitter Issue Tracking System by Topic Modeling Techniques (토픽 모델링을 이용한 트위터 이슈 트래킹 시스템)

  • Bae, Jung-Hwan;Han, Nam-Gi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.109-122
    • /
    • 2014
  • People are nowadays creating a tremendous amount of data on Social Network Service (SNS). In particular, the incorporation of SNS into mobile devices has resulted in massive amounts of data generation, thereby greatly influencing society. This is an unmatched phenomenon in history, and now we live in the Age of Big Data. SNS Data is defined as a condition of Big Data where the amount of data (volume), data input and output speeds (velocity), and the variety of data types (variety) are satisfied. If someone intends to discover the trend of an issue in SNS Big Data, this information can be used as a new important source for the creation of new values because this information covers the whole of society. In this study, a Twitter Issue Tracking System (TITS) is designed and established to meet the needs of analyzing SNS Big Data. TITS extracts issues from Twitter texts and visualizes them on the web. The proposed system provides the following four functions: (1) Provide the topic keyword set that corresponds to daily ranking; (2) Visualize the daily time series graph of a topic for the duration of a month; (3) Provide the importance of a topic through a treemap based on the score system and frequency; (4) Visualize the daily time-series graph of keywords by searching the keyword; The present study analyzes the Big Data generated by SNS in real time. SNS Big Data analysis requires various natural language processing techniques, including the removal of stop words, and noun extraction for processing various unrefined forms of unstructured data. In addition, such analysis requires the latest big data technology to process rapidly a large amount of real-time data, such as the Hadoop distributed system or NoSQL, which is an alternative to relational database. We built TITS based on Hadoop to optimize the processing of big data because Hadoop is designed to scale up from single node computing to thousands of machines. Furthermore, we use MongoDB, which is classified as a NoSQL database. In addition, MongoDB is an open source platform, document-oriented database that provides high performance, high availability, and automatic scaling. Unlike existing relational database, there are no schema or tables with MongoDB, and its most important goal is that of data accessibility and data processing performance. In the Age of Big Data, the visualization of Big Data is more attractive to the Big Data community because it helps analysts to examine such data easily and clearly. Therefore, TITS uses the d3.js library as a visualization tool. This library is designed for the purpose of creating Data Driven Documents that bind document object model (DOM) and any data; the interaction between data is easy and useful for managing real-time data stream with smooth animation. In addition, TITS uses a bootstrap made of pre-configured plug-in style sheets and JavaScript libraries to build a web system. The TITS Graphical User Interface (GUI) is designed using these libraries, and it is capable of detecting issues on Twitter in an easy and intuitive manner. The proposed work demonstrates the superiority of our issue detection techniques by matching detected issues with corresponding online news articles. The contributions of the present study are threefold. First, we suggest an alternative approach to real-time big data analysis, which has become an extremely important issue. Second, we apply a topic modeling technique that is used in various research areas, including Library and Information Science (LIS). Based on this, we can confirm the utility of storytelling and time series analysis. Third, we develop a web-based system, and make the system available for the real-time discovery of topics. The present study conducted experiments with nearly 150 million tweets in Korea during March 2013.