• Title/Summary/Keyword: Big data Processing

Search Result 1,063, Processing Time 0.035 seconds

Automated Story Generation with Image Captions and Recursiva Calls (이미지 캡션 및 재귀호출을 통한 스토리 생성 방법)

  • Isle Jeon;Dongha Jo;Mikyeong Moon
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.24 no.1
    • /
    • pp.42-50
    • /
    • 2023
  • The development of technology has achieved digital innovation throughout the media industry, including production techniques and editing technologies, and has brought diversity in the form of consumer viewing through the OTT service and streaming era. The convergence of big data and deep learning networks automatically generated text in format such as news articles, novels, and scripts, but there were insufficient studies that reflected the author's intention and generated story with contextually smooth. In this paper, we describe the flow of pictures in the storyboard with image caption generation techniques, and the automatic generation of story-tailored scenarios through language models. Image caption using CNN and Attention Mechanism, we generate sentences describing pictures on the storyboard, and input the generated sentences into the artificial intelligence natural language processing model KoGPT-2 in order to automatically generate scenarios that meet the planning intention. Through this paper, the author's intention and story customized scenarios are created in large quantities to alleviate the pain of content creation, and artificial intelligence participates in the overall process of digital content production to activate media intelligence.

Comparative Analysis of Economic Efficiency by Major Sericultural Farming Areas in Korea (잠업단지의 경제효율에 관한 비교분석)

  • 이질현;김문협;강석권
    • Journal of Sericultural and Entomological Science
    • /
    • v.14 no.2
    • /
    • pp.95-103
    • /
    • 1972
  • The major purpose of this study is to collect the information related on the aspects of economic efficiency for solving the problems which are faced by farmers and areas, and providing scientific facts to farmers and related institutions for further development of sericultural sector in Korea. In order for obtaining the related information 12 sample areas among 23 major sericultural farming areas and 30 farm units in each area are selected and analyzed in this study. The fold suevey is made by member of this study team and graduate students in the Department of Sericultural Science with a prepared questionnaires. Cross-section and regression analysis methods are employed for processing the data in this study. The major findings obtained are as followings. 1. Sericultural earnings per Tanbo is, on the average, 22, 752 won in new cultivated areas and 29, 403 won in ordinary ones. There are big difference in the size of earnings by areas, especially, 46, 968 won in Kumo mountain area, compared with 16, 798 won in Yeoju and Yichun areas. General trend is finded that small scale farming units are made higher earnings and operating their farms efficiently. 2. Cocoon production expences per Tanbo is 16, 737 won in new cultivated areas and 19, 802 won in ordinary areas. There are also big difference in farming expences, especially, 27, 389 won in Sudang area, compared with 11, 689 won in Emjin area. 3. Sericultural income per Tanto is 10, 664 won in ordinary areas and 6, 898 won in new cultivated areas. Farmers in Kumo mountain area make the highest income of 21, 164 won and lowest income of 1, 296 won in Sudang area. It can be generized that about 30-50 a sized farmers make higher income. 4. Land, labor and capital productivities estimated by fitting Cobb-Douglas functions in ordinary areas are higher than in new cultivated areas, especially, labor productivity is higher in ordinary areas. 5. Changsung, Kwangna, Yunsun and Kumo mountain areas are technically and economically efficient. Sudang and Mujinchang areas are technically successful but economically inefficient and Emjin and Honam areas are technically inefficient but economically efficient. YeojuYichun, Chunwon and West Kyongnam are technically and economically inefficient. Technical and economic improvement program should be implemented for these areas. 6. Estimated Internal Rate of Return (IRR) on capital investment in Chongwon are is 23.5 percent. It is economically feasible, if we consider 20 percent of opportunity cost of capital in our economy.

  • PDF

Design and Implementation of a Protocol for Interworking Open Web Application Store (개방형 웹 애플리케이션 스토어 연동을 위한 프로토콜의 설계 및 구현)

  • Baek, Jihun;Kim, Jihun;Nam, Yongwoo;Lee, HyungUk;Park, Sangwon;Jeon, Jonghong;Lee, Seungyoon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.10
    • /
    • pp.669-678
    • /
    • 2013
  • Recently, because the portable devices became popular, it is easily to see that each person carries more than just one portable device and the use of the smartphone stretches as time goes by. After the smartphone has propagated rapidly, the total usage of the smartphone applications has also increased. But still, each application store has a different platform to develop and to apply an application. The application store is divided into two big markets, the Android and the Apple. So the developers have to develop their application by using these two different platforms. Developing into two different platforms almost makes a double development cost. And for the other platforms, the weakness is, which still have a small market breadth like Bada is not about the cost, but about drawing the proper developers for the given platform application development. The web application is rising up as the solution to solve these problems, reducing the cost and time in developing applications for every platform. For web applications don't need to make a vassal relationship with application markets platform. Which makes it possible for an application to operate properly in every portable devices and reduces the time and cost in developing. Therefore, all of the application markets could be united into one big market through a protocol which will connect each web applications market. But, still there is no standard for the web application store and no current web application store is possible to interlock with other web application stores. In this paper, we are trying to suggest a protocol by developing a prototype and prove that this protocol can supplement the current weakness.

Big Data based Tourist Attractions Recommendation - Focus on Korean Tourism Organization Linked Open Data - (빅데이터 기반 관광지 추천 시스템 구현 - 한국관광공사 LOD를 중심으로 -)

  • Ahn, Jinhyun;Kim, Eung-Hee;Kim, Hong-Gee
    • Management & Information Systems Review
    • /
    • v.36 no.4
    • /
    • pp.129-148
    • /
    • 2017
  • Conventional exhibition management information systems recommend tourist attractions that are close to the place in which an exhibition is held. Some recommended attractions by the location-based recommendation could be meaningless when nothing is related to the exhibition's topic. Our goal is to recommend attractions that are related to the content presented in the exhibition, which can be coined as content-based recommendation. Even though human exhibition curators can do this, the quality is limited to their manual task and knowledge. We propose an automatic way of discovering attractions relevant to an exhibition of interests. Language resources are incorporated to discover attractions that are more meaningful. Because a typical single machine is unable to deal with such large-scale language resources efficiently, we implemented the algorithm on top of Apache Spark, which is a well-known distributed computing framework. As a user interface prototype, a web-based system is implemented that provides users with a list of relevant attractions when users are browsing exhibition information, available at http://bike.snu.ac.kr/WARP. We carried out a case study based on Korean Tourism Organization Linked Open Data with Korean Wikipedia as a language resource. Experimental results are demonstrated to show the efficiency and effectiveness of the proposed system. The effectiveness was evaluated against well-known exhibitions. It is expected that the proposed approach will contribute to the development of both exhibition and tourist industries by motivating exhibition visitors to become active tourists.

  • PDF

The Impact of CPO Characteristics on Organizational Privacy Performance (개인정보보호책임자의 특성이 개인정보보호 성과에 미치는 영향)

  • Wee, Jiyoung;Jang, Jaeyoung;Kim, Beomsoo
    • Asia pacific journal of information systems
    • /
    • v.24 no.1
    • /
    • pp.93-112
    • /
    • 2014
  • As personal data breach reared up as a problem domestically and globally, organizations appointing chief privacy officers (CPOs) are increasing. Related Korean laws, 'Personal Data Protection Act' and 'the Act on Promotion of Information and Communication Network Utilization and Information Protection, etc.' require personal data processing organizations to appoint CPOs. Research on the characteristics and role of CPO is called for because of the importance of CPO being emphasized. There are many researches on top management's role and their impact on organizational performance using the Upper Echelon theory. This study investigates what influence the characteristics of CPO gives on the organizational privacy performance. CPO's definition varies depending on industry, organization size, required responsibility and power. This study defines CPO as 'a person who takes responsibility for all the duties on handling the organization's privacy,' This research assumes that CPO characteristics such as role, personality and background knowledge have an influence on the organizational privacy performance. This study applies the part relevant to the upper echelon's characteristics and performance of the executives (CEOs, CIOs etc.) for CPO. First, following Mintzberg and other managerial role classification, information, strategic, and diplomacy roles are defined as the role of CPO. Second, the "Big Five" taxonomy on individual's personality was suggested in 1990. Among these five personalities, extraversion and conscientiousness are drawn as the personality characteristics of CPO. Third, advance study suggests complex knowledge of technology, law and business is necessary for CPO. Technical, legal, and business background knowledge are drawn as the background knowledge of CPO. To test this model empirically, 120 samples of data collected from CPOs of domestic organizations are used. Factor analysis is carried out and convergent validity and discriminant validity were verified using SPSS and Smart PLS, and the causal relationships between the CPO's role, personality, background knowledge and the organizational privacy performance are analyzed as well. The result of the analysis shows that CPO's diplomacy role and strategic role have significant impacts on organizational privacy performance. This reveals that CPO's active communication with other organizations is needed. Differentiated privacy policy or strategy of organizations is also important. Legal background knowledge and technical background knowledge were also found to be significant determinants to organizational privacy performance. In addition, CPOs conscientiousness has a positive impact on organizational privacy performance. The practical implication of this study is as follows: First, the research can be a yardstick for judgment when companies select CPOs and vest authority in them. Second, not only companies but also CPOs can judge what ability they should concentrate on for development of their career relevant to their job through results of this research. Cultural social value, citizen's consensus on the right to privacy, expected CPO's role will change in process of time. In future study, long-term time-series analysis based research can reveal these changes and can also offer practical implications for government and private organization's policy making on information privacy.

E-Discovery Process Model and Alternative Technologies for an Effective Litigation Response of the Company (기업의 효과적인 소송 대응을 위한 전자증거개시 절차 모델과 대체 기술)

  • Lee, Tae-Rim;Shin, Sang-Uk
    • Journal of Digital Convergence
    • /
    • v.10 no.8
    • /
    • pp.287-297
    • /
    • 2012
  • In order to prepare for the introduction of the E-Discovery system from the United States and to cope with some causable changes of legal systems, we propose a general E-Discovery process and essential tasks of the each phase. The proposed process model is designed by the analysis of well-known projects such as EDRM, The Sedona Conference, which are advanced research for the standardization of E-Discovery task procedures and for the supply of guidelines to hands-on workers. In addition, Machine Learning Algorithms, Open-source libraries for the Information Retrieval and Distributed Processing technologies based on the Hadoop for big data are introduced and its application methods on the E-Discovery work scenario are proposed. All this information will be useful to vendors or people willing to develop the E-Discovery service solution. Also, it is very helpful to company owners willing to rebuild their business process and it enables people who are about to face a major lawsuit to handle a situation effectively.

De-cloaking Malicious Activities in Smartphones Using HTTP Flow Mining

  • Su, Xin;Liu, Xuchong;Lin, Jiuchuang;He, Shiming;Fu, Zhangjie;Li, Wenjia
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.6
    • /
    • pp.3230-3253
    • /
    • 2017
  • Android malware steals users' private information, and embedded unsafe advertisement (ad) libraries, which execute unsafe code causing damage to users. The majority of such traffic is HTTP and is mixed with other normal traffic, which makes the detection of malware and unsafe ad libraries a challenging problem. To address this problem, this work describes a novel HTTP traffic flow mining approach to detect and categorize Android malware and unsafe ad library. This work designed AndroCollector, which can automatically execute the Android application (app) and collect the network traffic traces. From these traces, this work extracts HTTP traffic features along three important dimensions: quantitative, timing, and semantic and use these features for characterizing malware and unsafe ad libraries. Based on these HTTP traffic features, this work describes a supervised classification scheme for detecting malware and unsafe ad libraries. In addition, to help network operators, this work describes a fine-grained categorization method by generating fingerprints from HTTP request methods for each malware family and unsafe ad libraries. This work evaluated the scheme using HTTP traffic traces collected from 10778 Android apps. The experimental results show that the scheme can detect malware with 97% accuracy and unsafe ad libraries with 95% accuracy when tested on the popular third-party Android markets.

Outdoor Positioning Estimation of Multi-GPS / INS Integrated System by EKF / UPF Filter Conversion (EKF/UPF필터 변환을 통한 Multi-GPS/INS 융합 시스템의 실외 위치추정)

  • Choi, Seung-Hwan;Kim, Gi-Jeung;Kim, Yun-Ki;Lee, Jang-Myung
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.20 no.12
    • /
    • pp.1284-1289
    • /
    • 2014
  • In this Paper, outdoor position estimation system was implemented using GPS (Global Positioning System) and INS (Inertial Navigation System). GPS position information has lots of errors by interference from obstacles and weather, the surrounding environment. To reduce these errors, multiple GPS system is used. Also, the Discrete Wavelet Transforms was applied to INS data for compensation of its error. In this paper, position estimation of the mobile robot in the straight line is conducted by EKF (Extended Kalman Filter). However, curve running position estimation is less accurate than straight line due to phase change in rotation. The curve is recognized through the rate of change in heading angle and the position estimation precision of the initial curve was improved by UPF (Unscented Particle Filter). In the case of UPF, if the number of particle is so many that big memory gets size is needed and processing speed becomes late. So, it only used the position estimation in the initial curve. Thereafter, the position of mobile robot in curve is estimated through switching from UPF to EKF again. Through the experiments, we verify the superiority of the system and make a conclusion.

Accuracy Analysis of Unified Control Point Coordinate Using GAMIT/GLOBK Software (GAMIT/GLOBK를 활용한 통합기준점 성과 정확도 분석)

  • Jae Myoung, Cho;Hong Sik, Yun;Dong Ha, Lee
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.33 no.2
    • /
    • pp.103-110
    • /
    • 2015
  • This paper planned for the adjustment of unified control points by compared adjusted software for integrated network and the national integrated network. There may be some errors in the survey date and interpretation of data processing due to applying different software each year. To minimize errors, we performed a precision network adjustment by consolidating control points per observation session over years. Prior to perform the integrated network adjustment with the GPS analysis program (GLOBK) for the final integrated network adjustment, the Quasi-Observation Combination Analysis(QOCA), the Global Kalman filter VLBI and the GLOBK were compared and analyzed to perform an integrated network adjustment. The integrated network adjustment result indicates that the RMSE was rather big with ±0.03m along the vertical axis, but ±0.006m along the horizontal, that is not much different from the existing result.

A Study on the Method of Measuring Accessibility to Urban Open Spaces (도시 오픈스페이스의 접근성 측정에 관한 연구)

  • 안동만;최형석;김인호;조형준
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.18 no.4
    • /
    • pp.17-28
    • /
    • 1991
  • The purpose of this study is to investigate and present a method for measuring public accessibility to urban open spaces. A basic assumption is that, for urban open space policies, accessibility is more important than per capita area. In this study, for the purpose of simplicity, a residential area is assumed to have access to open space if it is within a certain distance from an urban open space. Official city planning map is overlayed with a 200m grid and each cell of dwelling area is checked whether it is within a certain distance from a cell categorized as urban open space. A computer program for widely commercialized personal computer is developed for data processing so that local governments without access to more sophisticated systems can carry out similar studies for their own jurisdictions. Five cities, big, small, old and new, are selected to test the proposed method. Dwelling areas of Ansan new Town have highest accessibility to open spaces(93.4% of dwelling cells have open space cell within 500m). Seoul (91.2%), Suwon(78.2%), Pusan(73.8%), and Inchon(61.4%) have less accessibility. If we assume the Ansan City residents are evenly distributed over the dwelling area, 93.4% of the population has open spaces within walking distance of 500m. However, if we consider physical barriers such as arterial roads, railroads, and streams that reduce the accessibility, less than 93.4% of Ansan city residents enjoy good access to open spaces. Though a further detailed analysis is needed to picture the microscopic accessibility, this method can serve as a useful tool for urban open space policy and open space alternatives evaluations.

  • PDF