• Title/Summary/Keyword: Big date

Search Result 106, Processing Time 0.022 seconds

Building Linked Big Data for Stroke in Korea: Linkage of Stroke Registry and National Health Insurance Claims Data

  • Kim, Tae Jung;Lee, Ji Sung;Kim, Ji-Woo;Oh, Mi Sun;Mo, Heejung;Lee, Chan-Hyuk;Jeong, Han-Young;Jung, Keun-Hwa;Lim, Jae-Sung;Ko, Sang-Bae;Yu, Kyung-Ho;Lee, Byung-Chul;Yoon, Byung-Woo
    • Journal of Korean Medical Science
    • /
    • v.33 no.53
    • /
    • pp.343.1-343.8
    • /
    • 2018
  • Background: Linkage of public healthcare data is useful in stroke research because patients may visit different sectors of the health system before, during, and after stroke. Therefore, we aimed to establish high-quality big data on stroke in Korea by linking acute stroke registry and national health claim databases. Methods: Acute stroke patients (n = 65,311) with claim data suitable for linkage were included in the Clinical Research Center for Stroke (CRCS) registry during 2006-2014. We linked the CRCS registry with national health claim databases in the Health Insurance Review and Assessment Service (HIRA). Linkage was performed using 6 common variables: birth date, gender, provider identification, receiving year and number, and statement serial number in the benefit claim statement. For matched records, linkage accuracy was evaluated using differences between hospital visiting date in the CRCS registry and the commencement date for health insurance care in HIRA. Results: Of 65,311 CRCS cases, 64,634 were matched to HIRA cases (match rate, 99.0%). The proportion of true matches was 94.4% (n = 61,017) in the matched data. Among true matches (mean age 66.4 years; men 58.4%), the median National Institutes of Health Stroke Scale score was 3 (interquartile range 1-7). When comparing baseline characteristics between true matches and false matches, no substantial difference was observed for any variable. Conclusion: We could establish big data on stroke by linking CRCS registry and HIRA records, using claims data without personal identifiers. We plan to conduct national stroke research and improve stroke care using the linked big database.

From Multimedia Data Mining to Multimedia Big Data Mining

  • Constantin, Gradinaru Bogdanel;Mirela, Danubianu;Luminita, Barila Adina
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.11
    • /
    • pp.381-389
    • /
    • 2022
  • With the collection of huge volumes of text, image, audio, video or combinations of these, in a word multimedia data, the need to explore them in order to discover possible new, unexpected and possibly valuable information for decision making was born. Starting from the already existing data mining, but not as its extension, multimedia mining appeared as a distinct field with increased complexity and many characteristic aspects. Later, the concept of big data was extended to multimedia, resulting in multimedia big data, which in turn attracted the multimedia big data mining process. This paper aims to survey multimedia data mining, starting from the general concept and following the transition from multimedia data mining to multimedia big data mining, through an up-to-date synthesis of works in the field, which is a novelty, from our best of knowledge.

Analyzing Box-Office Hit Factors Using Big Data: Focusing on Korean Films for the Last 5 Years

  • Hwang, Youngmee;Kim, Kwangsun;Kwon, Ohyoung;Moon, Ilyoung;Shin, Gangho;Ham, Jongho;Park, Jintae
    • Journal of information and communication convergence engineering
    • /
    • v.15 no.4
    • /
    • pp.217-226
    • /
    • 2017
  • Korea has the tenth largest film industry in the world; however, detailed analyses using the factors contributing to successful film commercialization have not been approached. Using big data, this paper analyzed both internal and external factors (including genre, release date, rating, and number of screenings) that contributed to the commercial success of Korea's top 10 ranking films in 2011-2015. The authors developed a WebCrawler to collect text data about each movie, implemented a Hadoop system for data storage, and classified the data using Map Reduce method. The results showed that the characteristic of "release date," followed closely by "rating" and "genre" were the most influential factors of success in the Korean film industry. The analysis in this study is considered groundwork for the development of software that can predict box-office performance.

Covid 19 News Data Analysis and Visualization

  • Hur, Tai-Sung;Hwang, In-Yong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.4
    • /
    • pp.37-43
    • /
    • 2022
  • In this paper, we calculate the word frequency by date and region using news data related to COVID-19 distributed for about 8 months from December 2019 to July 2020, and visualized the correlation with the current state data of COVID-19 patients using the results. News data was collected from Big Kids, a news big data system operated by the Korea Press Promotion Foundation. The visualization system proposed in this paper shows the news frequency of the selected region compared to the overall region, the key keyword of the selected region, the region of the main keyword, and the date change of the selected region. Through this visualization, the main keywords and trends of COVID-19 confirmed and infected people can be identified for previous events.

Hadoop System Design for Big data Processing of RFID Distribution (RFID/NFC 물류의 빅 데이터 처리를 위한 하둡 시스템의 설계)

  • Kim, Nam-Ho;Noh, Jin-Heon;Jeong, Hee-Ja
    • Smart Media Journal
    • /
    • v.2 no.3
    • /
    • pp.47-53
    • /
    • 2013
  • Recently convergence of IT in logistics system as a typical application RFID/NFC technology is being used, such as, according to the distribution of the flow is generated by a lot of big data. The Hadoop distributed system to collect data items produced by the parallel processing capabilities of logistics information and logistics information for the record management can create. Hadoop system to support the design and development of prototypes were approaching the possibility of its utilization.

  • PDF

A Study on Up-to-date Technology Development in Small & Medium Industries of Korea. (우리나라 중소기업의 첨단기술개발에 관한 연구)

  • 신현재;서승록
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.6 no.9
    • /
    • pp.45-59
    • /
    • 1983
  • This study focuses on the growth and development of small and medium industries of Korea, orienting to the development of up-to-date technology from now on and bolstering their competitive ability in the rapidly changing international markets. For this purpose, the small and medium industries should 1) develop high-level manpower of up-to-date technology, 2) make constant efforts to categorize and divide the fields of technology with big business groups to boost their competitiveness, 3) raise automation rate by turning all facilities into mechatronics, 4) positively develop software know-how, 5) jointly conduct researches in cooperation with venture capital and Governmental research institute, 6) categorize an systematize the industries. On the Governmental level, there should be 1) wide-ranging support and assistance in technology, finance, and the facilities, 2) positive opening of consumer market, 3) assistance in technical cooperation with other nations, 4) and such indirect assistance as fostering the fields of related technology.

  • PDF

Design of a Smart Application using Big Data (빅 데이터를 이용한 스마트 응용의 설계)

  • Oh, Sun-Jin
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.15 no.6
    • /
    • pp.17-24
    • /
    • 2015
  • With the rapid growth of Information technology and up-to-date wireless network application technologies, huge and various types of data are produced in every moment, the value and significance of the analysis techniques using big data are increased recently. Big data, which were useless since they were too huge to manage in the past, enables us to get new inspirations and values in various practical application areas through the development of big data computing devices and analytic tools. Nowadays, however, it is true that most of the big data are still wasted without properly analyzed and used. In the long run, the preliminary stipulations for finding inspirations and extracting new values from big data are securing big data analysis and application techniques to process big data efficiently. In this paper, we study accurate data analysis techniques and data process technologies those are able to extract needed inspirations and values from big data efficiently, then design the smart application that adopts these techniques practically.

A Study on the Frequency and Intensity Variations of Okhotsk High: Focused on the Korean Peninsula (오호츠크해고기압의 출현일과 강도의 변동에 관한 연구 -한반도에 영향을 미친 날을 중심으로-)

  • Cho, Li-Na;Lee, Seung-Ho
    • Journal of the Korean Geographical Society
    • /
    • v.46 no.1
    • /
    • pp.36-49
    • /
    • 2011
  • This paper aims to investigate the frequency and intensity variations of Okhotsk high pressure system focused on the Korean Peninsula. Weather chart (00UTC), daily weather data and reanalysis data were used. The first occurrence date of Okhotsk high pressure system tends to be earlier in those years that surrounding land air temperature in April is high. The frequency of Okhotsk high has recently decreased, and its intensity tends to be stronger when the difference between sea surface temperature and surrounding land air temperature is big. The frequency of Okhotsk high in April, May, June and July increases when surrounding land air temperature is high, and its intensity grows when the difference between surrounding land air temperature and sea surface temperature is big. The frequency of Okhotsk high may increase and its intensity may increase when the first occurrence date comes earlier. In June, however, the reverse may apply.

A Study on the Development of the Use Index of Closed School Facilities Using Big Data -Focused on Text-Mining Techniques- (빅데이터를 활용한 폐교시설의 지표 개발에 관한 연구 -텍스트마이닝 기법을 중심으로-)

  • Kim, Jae-Young;Lee, Jong-Kuk
    • The Journal of Sustainable Design and Educational Environment Research
    • /
    • v.18 no.2
    • /
    • pp.1-11
    • /
    • 2019
  • The purpose of this study is to make objective decisions in the use of closed schools through the development of utilization indicators for the efficient use of closed schools, which is expected to increase continuously. The research phase was largely carried out by drawing preliminary indicators for use in closed schools, drawing final indicators using big data, and quantifying indicators, and finally objectifying them through quantification. The institution intends to apply and verify the facility based on future indicators. This study has implications for the application of big data analysis methods that have not been attempted in planning and research for the use of closed school facilities to date.

An Effective Data Model for Forecasting and Analyzing Securities Data

  • Lee, Seung Ho;Shin, Seung Jung
    • International journal of advanced smart convergence
    • /
    • v.5 no.4
    • /
    • pp.32-39
    • /
    • 2016
  • Machine learning is a field of artificial intelligence (AI), and a technology that collects, forecasts, and analyzes securities data is developed upon machine learning. The difference between using machine learning and not using machine learning is that machine learning-seems similar to big data-studies and collects data by itself which big data cannot do. Machine learning can be utilized, for example, to recognize a certain pattern of an object and find a criminal or a vehicle used in a crime. To achieve similar intelligent tasks, data must be more effectively collected than before. In this paper, we propose a method of effectively collecting data.