• 제목/요약/키워드: Mining project

검색결과 160건 처리시간 0.026초

텍스트마이닝을 활용한 건설실무정보의 특성 분석 - 건설기술, 사례, 원가절감 등 정보를 중심으로 - (Analysis on the Characteristics of Construction Practice Information Using Text Mining: Focusing on Information Such as Construction Technology, Cases, and Cost Reduction)

  • 정성윤;김진욱
    • 한국문헌정보학회지
    • /
    • 제56권4호
    • /
    • pp.205-222
    • /
    • 2022
  • 본 연구는 전문지식을 갖지 않은 건설기술자와 건설사업 참여자가 건설 실무에서 중요도가 높은 단어와 단어 간의 상호 연관관계를 쉽게 이해할 수 있도록 정보서비스를 개선하고자 하였다. 이를 위해 텍스트마이닝과 네트워크 중심성을 이용하여 건설기술정보시스템에서 가장 많이 사용하고 있는 기술정보, 사례정보 및 원가절감 등 건설실무정보에 대해 단어의 출현 빈도, 주제 모형화, 네트워크 중심성을 분석하였다. 이러한 분석을 통해 도로, 포장, 교량, 터널 등 도로공사와 관련한 설계, 시공, 사업관리, 시방·기준, 유지관리 등이 건설 실무에서 중요한 정보로 파악되었다. 또한, 연결 중심성과 고유벡터 중심성 측정을 통해 중요도가 높은 단어 간의 상관도를 분석하였다. 상관도 분석을 통해 기술정보를 확충한다면 보다 유용한 정보를 제공할 수 있다는 결과를 얻었다. 끝으로, 연구 결과가 갖는 제약과 이에 따른 추가적인 연구를 제시하였다.

Factors Clustering Approach to Parametric Cost Estimates And OLAP Driver

  • JaeHo, Cho;BoSik, Son;JaeYoul, Chun
    • 국제학술발표논문집
    • /
    • The 3th International Conference on Construction Engineering and Project Management
    • /
    • pp.707-716
    • /
    • 2009
  • The role of cost modeller is to facilitate the design process by systematic application of cost factors so as to maintain a sensible and economic relationship between cost, quantity, utility and appearance which thus helps in achieving the client's requirements within an agreed budget. There are a number of research on cost estimates in the early design stage based on the improvement of accuracy or impact factors. It is common knowledge that cost estimates are undertaken progressively throughout the design stage and make use of the information that is available at each phase, through the related research up to now. In addition, Cost estimates in the early design stage shall analyze the information under the various kinds of precondition before reaching the more developed design because a design can be modified and changed in all process depending on clients' requirements. Parametric cost estimating models have been adopted to support decision making in a changeable environment, in the early design stage. These models are using a similar instance or a pattern of historical case to be constituted in project information, geographic design features, relevant data to quantity or cost, etc. OLAP technique analyzes a subject data by multi-dimensional points of view; it supports query, analysis, comparison of required information by diverse queries. OLAP's data structure matches well with multiview-analysis framework. Accordingly, this study implements multi-dimensional information system for case based quantity data related to design information that is utilizing OLAP's technology, and then analyzes impact factors of quantity by the design criteria or parameter of the same meaning. On the basis of given factors examined above, this study will generate the rules on quantity measure and produce resemblance class using clustering of data mining. These sorts of knowledge-base consist of a set of classified data as group patterns, of which will be appropriate stand on the parametric cost estimating method.

  • PDF

Advanced Information Data-interactive Learning System Effect for Creative Design Project

  • Park, Sangwoo;Lee, Inseop;Lee, Junseok;Sul, Sanghun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권8호
    • /
    • pp.2831-2845
    • /
    • 2022
  • Compared to the significant approach of project-based learning research, a data-driven design project-based learning has not reached a meaningful consensus regarding the most valid and reliable method for assessing design creativity. This article proposes an advanced information data-interactive learning system for creative design using a service design process that combines a design thinking. We propose a service framework to improve the convergence design process between students and advanced information data analysis, allowing students to participate actively in the data visualization and research using patent data. Solving a design problem by discovery and interpretation process, the Advanced information-interactive learning framework allows the students to verify the creative idea values or to ideate new factors and the associated various feasible solutions. The student can perform the patent data according to a business intelligence platform. Most of the new ideas for solving design projects are evaluated through complete patent data analysis and visualization in the beginning of the service design process. In this article, we propose to adapt advanced information data to educate the service design process, allowing the students to evaluate their own idea and define the problems iteratively until satisfaction. Quantitative evaluation results have shown that the advanced information data-driven learning system approach can improve the design project - based learning results in terms of design creativity. Our findings can contribute to data-driven project-based learning for advanced information data that play a crucial role in convergence design in related standards and other smart educational fields that are linked.

스마트팜 열환경 모델링을 위한 Open source 기반 Data mining 기법 분석 (A Benchmark of Open Source Data Mining Package for Thermal Environment Modeling in Smart Farm(R, OpenCV, OpenNN and Orange))

  • 이준엽;오종우;이동훈
    • 한국농업기계학회:학술대회논문집
    • /
    • 한국농업기계학회 2017년도 춘계공동학술대회
    • /
    • pp.168-168
    • /
    • 2017
  • ICT 융합 스마트팜 내의 환경계측 센서, 영상 및 사양관리 시스템의 증가에도 불구하고 이들 장비에서 확보되는 데이터를 적절히 유효하게 활용하는 기술이 미흡한 실정이다. 돈사의 경우 가축의 복지수준, 성장 변화를 실시간으로 모니터링 및 예측할 수 있는 데이터 분석 및 모델링 기술 확보가 필요하다. 이를 위해선 가축의 생리적 변화 및 행동적 변화를 조기에 감지하고 가축의 복지수준을 실시간으로 감시하고 분석 및 예측 기술이 필요한데 이를 위한 대표적인 정보 통신 공학적 접근법 중에 하나가 Data mining 이다. Data mining에 대한 연구 수행에 필요한 다양한 소프트웨어 중에서 Open source로 제공이 되는 4가지 도구를 비교 분석하였다. 스마트 돈사 내에서 열환경 모델링을 목표로 한 데이터 분석에서 고려해야할 요인으로 데이터 분석 알고리즘 도출 시간, 시각화 기능, 타 라이브러리와 연계 기능 등을 중점 적으로 분석하였다. 선정된 4가지 분석 도구는 1) R(https://cran.r-project.org), 2) OpenCV(http://opencv.org), 3) OpenNN (http://www.opennn.net), 4) Orange(http://orange.biolab.si) 이다. 비교 분석을 수행한 운영체제는 Linux-Ubuntu 16.04.4 LTS(X64)이며, CPU의 클럭속도는 3.6 Ghz, 메모리는 64 Gb를 설치하였다. 개발언어 측면에서 살펴보면 1) R 스크립트, 2) C/C++, Python, Java, 3) C++, 4) C/C++, Python, Cython을 지원하여 C/C++ 언어와 Python 개발 언어가 상대적으로 유리하였다. 데이터 분석 알고리즘의 경우 소스코드 범위에서 라이브러리를 제공하는 경우 Cross-Platform 개발이 가능하여 여러 운영체제에서 개발한 결과를 별도의 Porting 과정을 거치지 않고 사용할 수 있었다. 빌트인 라이브러리 경우 순서대로 R 의 경우 가장 많은 수의 Data mining 알고리즘을 제공하고 있다. 이는 R 운영 환경 자체가 개방형으로 되어 있어 온라인에서 추가되는 새로운 라이브러리를 클라우드를 통하여 공유하기 때문인 것으로 판단되었다. OpenCV의 경우 영상 처리에 강점이 있었으며, OpenNN은 신경망학습과 관련된 라이브러리를 소스코드 레벨에서 공개한 것이 강점이라 할 수 있다. Orage의 경우 라이브러리 집합을 제공하는 것에 중점을 둔 다른 패키지와 달리 시각화 기능 및 망 구성 등 사용자 인터페이스를 통합하여 운영한 것이 강점이라 할 수 있다. 열환경 모델링에 요구되는 시간 복잡도에 대응하기 위한 부가 정보 처리 기술에 대한 연구를 수행하여 스마트팜 열환경 모델링을 실시간으로 구현할 수 있는 방안 연구를 수행할 것이다.

  • PDF

국가 융합 R&D 특성 분석에 관한 연구: 텍스트분석을 중심으로 (Feature Analyze and Research of National Convergence R&D: With Focus on the Text Mining)

  • 유기철;이태희;최상현;이정환
    • Journal of Information Technology Applications and Management
    • /
    • 제27권1호
    • /
    • pp.59-73
    • /
    • 2020
  • There is a growing interest in convergence. National R & D is also providing various policies and institutional support to promote convergence research. Convergence research, however, does not clearly specify its characteristics at the academic and government levels. This research proceeds with the process of collecting, refining, analyzing, modeling, verifying and visualizing national R & D data through the National Science and Technology Information Service (NTIS). The method is to derive the convergence research characteristics and to derive through text mining, focusing on the unstructured information of national R & D project data. The study confirmed that there was a difference in perception between the definition of converged research and the research site. In order to improve this, the research suggested that convergence among research subjects, collaboration among research topics reflecting various backgrounds and characteristics of researchers, and analysis of characteristics of convergence research using information were suggested in the process of establishing convergence policy.

바이오데이터베이스와 도구를 활용한 바이오인포매틱스의 동향 (Current Status of Bioinformatics on Bio-databases and it Tools)

  • 임달혁;전수경;박완규;이영주
    • Journal of Pharmaceutical Investigation
    • /
    • 제34권1호
    • /
    • pp.73-79
    • /
    • 2004
  • The union of information-technology and biology presents great possibilities to both applications of bio-information and development of science and technology. Also, meaningful analysis of bio-information brings about a new innovation in the field of bio-market with the advent and growth of bioinformatics. Hence, bioinformatics is the most import aspect for establishing a science-technology-oriented society in the $21^{st}$ century. This article provides trends in current state of bioinformatics. Technological development of bioinformatics for the rapid growth of bio-industry means that using bioinformatics, a biologist can process and store enormous amount of data such as current Human Genome Project and future data in the field of biology. We have manly looked at the tends of bio-information, databases and mining tools that are generally used, and strategies and directions for the future.

부상기술 예측을 위한 특허키워드정보분석에 관한 연구 - GHG 기술 중심으로 (Patent Keyword Analysis for Forecasting Emerging Technology : GHG Technology)

  • 최도한;김갑조;박상성;장동식
    • 디지털산업정보학회논문지
    • /
    • 제9권2호
    • /
    • pp.139-149
    • /
    • 2013
  • As the importance of technology forecasting while countries and companies manage the R&D project is growing bigger, the methodology of technology forecasting has been diversified. One of the forecasting method is patent analysis. This research proposes quick forecasting process of emerging technology based on keyword approach using text mining. The forecasting process is following: First, the term-document matrix is extracted from patent documents by using text mining. Second, emerging technology keyword are extracted by analyzing the importance of word from utilizing mean values and standard deviation values of the term and the emerging trend of word discovered from time series information of the term. Next, association between terms is measured by using cosine similarity. finally, the keyword of emerging technology is selected in consequence of the synthesized result and we forecast the emerging technology according to the results. The technology forecasting process described in this paper can be applied to developing computerized technology forecasting system integrated with various results of other patent analysis for decision maker of company and country.

데이터 마이닝 기반의 건설 생산성 예측 모델 개발 (The Development of a Construction Productivity Prediction Model Based on Data Mining)

  • 우기범;안지성;오세욱;김영석
    • 한국건설관리학회:학술대회논문집
    • /
    • 한국건설관리학회 2007년도 정기학술발표대회 논문집
    • /
    • pp.813-818
    • /
    • 2007
  • 건설 프로젝트에서 수집되는 생산성 정보는 공사 진행의 효율성 파악, 작업여건 및 투입자원의 분석, 프로젝트의 성과측정 등에 활용될 수 있을 뿐만 아니라 향후 공사계획 수립에 있어 유용하게 사용될 수 있는 매우 중요한 실적자료이다. 그러나 이와 같은 생산성 정보의 중요성에도 불구하고 기존의 국내 건설 산업은 생산성 데이터의 수집 및 측정방법 등이 아직 체계화 되어있지 못하고 생산성 데이터의 활용도 미진하며 이로 인해 대부분의 공사계획 수립을 현장관리자의 경험과 직관에 의존하고 있어 계획 대비 실적에 대한 신뢰도가 그만큼 저하될 수밖에 없는 실정이다. 따라서 본 연구에서는 실제 건설 생산성 데이터의 축적을 통해 이를 향후 공사계획 수립에 유용한 실적자료로서 활용할 수 있는 건설 생산성 예측모델을 제시하고자 한다.

  • PDF

Semantic Trajectory Based Behavior Generation for Groups Identification

  • Cao, Yang;Cai, Zhi;Xue, Fei;Li, Tong;Ding, Zhiming
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제12권12호
    • /
    • pp.5782-5799
    • /
    • 2018
  • With the development of GPS and the popularity of mobile devices with positioning capability, collecting massive amounts of trajectory data is feasible and easy. The daily trajectories of moving objects convey a concise overview of their behaviors. Different social roles have different trajectory patterns. Therefore, we can identify users or groups based on similar trajectory patterns by mining implicit life patterns. However, most existing daily trajectories mining studies mainly focus on the spatial and temporal analysis of raw trajectory data but missing the essential semantic information or behaviors. In this paper, we propose a novel trajectory semantics calculation method to identify groups that have similar behaviors. In our model, we first propose a fast and efficient approach for stay regions extraction from daily trajectories, then generate semantic trajectories by enriching the stay regions with semantic labels. To measure the similarity between semantic trajectories, we design a semantic similarity measure model based on spatial and temporal similarity factor. Furthermore, a pruning strategy is proposed to lighten tedious calculations and comparisons. We have conducted extensive experiments on real trajectory dataset of Geolife project, and the experimental results show our proposed method is both effective and efficient.

Prevention through Design (PtD) of integrating accident precursors in BIM

  • Chang, Soowon;Oh, Heung Jin;Lee, JeeHee
    • 국제학술발표논문집
    • /
    • The 9th International Conference on Construction Engineering and Project Management
    • /
    • pp.94-102
    • /
    • 2022
  • Construction workers are engaged in many activities that may expose them to serious hazards, such as falling, unguarded machinery, or being struck by heavy construction equipment. Despite extensive research in building information modeling (BIM) for safety management, current approaches, detecting safety issues after design completion, may limit the opportunities to prevent predictable and potential accidents when decisions of building materials and systems are made. In this respect, this research proposes a proactive approach to detecting safety issues from the early design phase. This research aims to explore accident precursors and integrate them into BIM for tracking safety hazards during the design development process. Accident precursors can be identified from construction incident reports published by OSHA using a text mining technique. Through BIM-integrated accident precursors, construction safety hazards can be identified during the design phase. The results will contribute to supporting a successful transition from the design stage to the construction stage that considers a safe construction workplace. This will advance the body of knowledge about construction safety management by elucidating a hypothesis that safety hazards can be detected during the design phase involving decisions about materials, building elements, and equipment. In addition, the proactive approach will help the Architecture, Engineering and Construction (AEC) industry eliminate occupational safety hazards before near-miss situations appear on construction sites.

  • PDF