• Title/Summary/Keyword: 과학 빅데이터

Search Result 519, Processing Time 0.026 seconds

An Introduction and Trend Analysis in Questions of Engineer Big Data Analyst (빅데이터분석 기사 국가기술자격 개요 및 출제 경향 분석)

  • Jang, Hee-Seon;Song, Ji Young
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.01a
    • /
    • pp.393-394
    • /
    • 2022
  • 본 논문에서는 과학기술정보통신부와 통계청에서 주관하고 한국산업인력공단에서 시행(한국데이터산업진흥원 위탁)하는 「빅데이터분석기사」에 대한 필기 및 실기 시험의 내용을 설명하고 지금까지 2회에 걸쳐 시행된 시험에 대한 문제점과 이에 대한 해결방안을 제시하였다. 2021년 처음 시행된 국가기술자격으로써 기존 자격증과의 차별성, 난이도 조정, 수험생들의 각종 민원 발생 등의 문제를 해결하기 위한 체계적인 시스템 마련이 요구되며, 향후 데이터 과학자들에 대한 수요 급증에 대비하기 위해 빅데이터분석 실무 능력을 평가하기 위한 바람직한 제도와 정책이 병행되어야 한다.

  • PDF

A Exploratory Study on Big-data based Election Campaign Strategy Model in South Korea (빅데이터 기반 선거캠페인 전략에 관한 탐색적 연구)

  • Noh, Kyoo-Sung
    • Journal of Digital Convergence
    • /
    • v.11 no.12
    • /
    • pp.113-120
    • /
    • 2013
  • The victory of Barack Obama in the presidential reelection, in which he got closer to voters by scientific election strategy based on data, is making a new paradigm of this scientific election mechanism. But it is within bounds to say that Korean election has developed based on emotional confrontation, rather than on the confrontation of policy or personal qualification. This study suggests a Big data-based election campaign strategy in an effort to reduce the harmful consequences of Korean election and to settle down a desirable campaign culture. To do so, this study examines the actual status and problems of Korean politics and election campaign. And then it designs a Korean election strategy model using Big data as an alternative to break through the problems. Last, it discusses the plan to utilize Big data.

How to Identify Customer Needs Based on Big Data and Netnography Analysis (빅데이터와 네트노그라피 분석을 통합한 온라인 커뮤니티 고객 욕구 도출 방안: 천기저귀 온라인 커뮤니티 사례를 중심으로)

  • Soonhwa Park;Sanghyeok Park;Seunghee Oh
    • Information Systems Review
    • /
    • v.21 no.4
    • /
    • pp.175-195
    • /
    • 2019
  • This study conducted both big data and netnography analysis to analyze consumer needs and behaviors of online consumer community. Big data analysis is easy to identify correlations, but causality is difficult to identify. To overcome this limitation, we used netnography analysis together. The netnography methodology is excellent for context grasping. However, there is a limit in that it is time and costly to analyze a large amount of data accumulated for a long time. Therefore, in this study, we searched for patterns of overall data through big data analysis and discovered outliers that require netnography analysis, and then performed netnography analysis only before and after outliers. As a result of analysis, the cause of the phenomenon shown through big data analysis could be explained through netnography analysis. In addition, it was able to identify the internal structural changes of the community, which are not easily revealed by big data analysis. Therefore, this study was able to effectively explain much of online consumer behavior that was difficult to understand as well as contextual semantics from the unstructured data missed by big data. The big data-netnography integrated model proposed in this study can be used as a good tool to discover new consumer needs in the online environment.

A Study of Big Data Transmission using Science DMZ (Science DMZ를 활용한 빅데이터의 효율적 공유 방법 연구)

  • Choi, Won-Jun;Kim, Jae-Hun;Kim, Sun-Young;Kim, Jeong-Hwan
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2017.11a
    • /
    • pp.73-74
    • /
    • 2017
  • 제4차 산업혁명(클라우드, 빅데이터, Internet of Things, 인공지능 등)과 관련된 다양한 기술이 개발되고 연구가 되면서 네트워크를 사용하는 다양한 기기(모바일, 클라우드 서버 등)에서 축적되는 데이터의 양도 점점 증가하고 있다. 데이터 전송 서비스를 지원하는 기관이나 데이터 전송을 위한 인프라를 지원하는 기관에서는 종종 데이터가 빅데이터가 되어갈수록 기관 간의 효율적인 데이터 전송이 어려워 직접 방문하여 필요한 정보만 전달 받기도 한다. 초고속 네트워크 시대에 우리는 과학기술정보의 효율적인 공유를 위한 신속한 전송이 필요하게 되었고 이러한 방법을 Science DMZ 환경에서 문제를 풀어 보고자 한다.

A Big Data Learning for Patent Analysis (특허분석을 위한 빅 데이터학습)

  • Jun, Sunghae
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.23 no.5
    • /
    • pp.406-411
    • /
    • 2013
  • Big data issue has been considered in diverse fields. Also, big data learning has been required in all areas such as engineering and social science. Statistics and machine learning algorithms are representative tools for big data learning. In this paper, we study learning tools for big data and propose an efficient methodology for big data learning via legacy data to practical application. We apply our big data learning to patent analysis, because patent is one of big data. Also, we use patent analysis result for technology forecasting. To illustrate how the proposed methodology could be applied in real domain, we will retrieve patents related to big data from patent databases in the world. Using searched patent data, we perform a case study by text mining preprocessing and multiple linear regression of statistics.

Multi-model Typhoon Simulation for Big Data Analysis and Prediction (빅데이터 분석 및 예측을 위한 멀티모델 태풍 시뮬레이션)

  • Kang, Ji-Sun;Yuk, Jin-Hee;Joh, Minsu
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2017.05a
    • /
    • pp.291-292
    • /
    • 2017
  • 한국과학기술정보연구원 융합기술연구본부 재난대응HPC연구센터에서는 초고성능컴퓨팅 기반의 풍수해 예측 및 피해 정보 생산기술을 연구개발하여 재난 재해에 대한 국가현안 대응 의사결정지원 시스템을 구축 중에 있다. HPC 기반의 풍수해 예측 시스템과 빅데이터 분석 기반의 피해 예측 시스템에 대한 연구를 독자적으로 진행하는 가운데, 최근 여러 분야에 적용되고 있는 빅데이터 분석 기술을 HPC 기반의 풍수해 예측 시스템에 적목시켜 더 정확하고 신속한 풍수해 예측 정보 생산에 기여하고자 한다. 본 연구는 빅데이터 분석을 위한 학습 데이터 생산을 목적으로 HPC 기반 태풍 예측의 주요 기상 인자들을 조정하여 서로 다른 성능의 예측 모델을 구축하고, 각 모델 별 태풍 시뮬레이션의 성능을 진단하였다. 향후 빅데이터 분석을 통한 예측 성능의 검증을 위해 HPC 기반 풍수해 예측 및 검증 데이터를 최대한 생산하고자 한다.

  • PDF

Performance Evaluation of All-Reduce Algorithms on Nurion System (누리온 시스템에서의 All-Reduce 알고리즘 성능평가)

  • Myung, Hunjoo;Jeong, Kimoon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2020.11a
    • /
    • pp.116-118
    • /
    • 2020
  • GPU 기술과 빅데이터의 성장에 힘입어 최근 딥러닝 기술은 괄목할만한 성장을 이루었고, 구글, 페이스북, 우버 등의 빅데이터를 보유한 업체들과 슈퍼컴퓨팅분야에서는 이러한 빅데이터를 빠른 시간 안에 학습하기 위해 분산 딥러닝 기술을 연구해오고 있다. 이러한 대규모 분산 딥러닝에서는 집합 통신, IO 부하 등이 주요 병목으로 알려져 있다. 본 연구에서는 분산 딥러닝에서 시도되고 있는 주요 All-Reduce 알고리즘들에 대해 누리온 시스템에서 성능평가를 수행하였고, 512노드 이상의 대규모에서는 2D-torus 알고리즘이 우수한 성능을 보였다.

Suggestions on how to convert official documents to Machine Readable (공문서의 기계가독형(Machine Readable) 전환 방법 제언)

  • Yim, Jin Hee
    • The Korean Journal of Archival Studies
    • /
    • no.67
    • /
    • pp.99-138
    • /
    • 2021
  • In the era of big data, analyzing not only structured data but also unstructured data is emerging as an important task. Official documents produced by government agencies are also subject to big data analysis as large text-based unstructured data. From the perspective of internal work efficiency, knowledge management, records management, etc, it is necessary to analyze big data of public documents to derive useful implications. However, since many of the public documents currently held by public institutions are not in open format, a pre-processing process of extracting text from a bitstream is required for big data analysis. In addition, since contextual metadata is not sufficiently stored in the document file, separate efforts to secure metadata are required for high-quality analysis. In conclusion, the current official documents have a low level of machine readability, so big data analysis becomes expensive.

Development and Application of Middle School STEAM Program Using Big Data of World Wide Telescope (WWT 빅데이터를 활용한 중학교 STEAM 프로그램 개발 및 적용)

  • You, Samgmi;Kim, Hyoungbum;Kim, Yonggi;Kim, Heoungtae
    • Journal of the Korean Society of Earth Science Education
    • /
    • v.14 no.1
    • /
    • pp.33-47
    • /
    • 2021
  • This study developed a big data-based STEAM (Science, Technology, Engineering, Art & Mathematics) program using WWT (World Wide Telescope), focusing on content elements of 'solar system', 'star and universe' in the 2015 revised science curriculum, and in order to find out the effectiveness of the STEAM program, analyzed creative problem solving, STEAM attitude, and STEAM satisfaction by applying it to one middle school 176 students simple random sampled. The results of this study are as follows. First, we developed a program to encourage students to actively and voluntarily participating, utilizing the astronomical data platform WWT. Second, in the paired t-test based on the difference between the pre- and post-scores of the creative problem solving measurement test, significant statistical test results were shown in 'idea adaptation', 'imaging', 'analogy', 'idea production' and 'elaboration' sub-factors except 'attention task' sub-factor (p < .05). Third, in the paired t-test based on the difference between the pre- and post-scores of the STEAM attitude test, significant statistical test results were shown in 'interest', 'communication', 'self-concept', 'self-efficacy' and 'science and engineering career choice' sub-factors except 'consideration' and 'usefulness / value recognition' sub-factors (p < .05). Fourth, in the STEAM satisfaction test conducted after class application, the average values of sub-factors were 3.16~3.90. The results indicated that students' understanding and interest in the science subject improved significantly through the big data-based STEAM program using the WWT.

Personal Recommendation Service Design Through Big Data Analysis on Science Technology Information Service Platform (과학기술정보 서비스 플랫폼에서의 빅데이터 분석을 통한 개인화 추천서비스 설계)

  • Kim, Dou-Gyun
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.28 no.4
    • /
    • pp.501-518
    • /
    • 2017
  • Reducing the time it takes for researchers to acquire knowledge and introduce them into research activities can be regarded as an indispensable factor in improving the productivity of research. The purpose of this research is to cluster the information usage patterns of KOSEN users and to suggest optimization method of personalized recommendation service algorithm for grouped users. Based on user research activities and usage information, after identifying appropriate services and contents, we applied a Spark based big data analysis technology to derive a personal recommendation algorithm. Individual recommendation algorithms can save time to search for user information and can help to find appropriate information.