참고문헌
- 한선화, "Science Big Data: Grand Challenges", IT 21 Global Conference, 2012
- 조성우, "Big Data 시대의 기술", 중앙연구소 Intelligent Knowledge Service
- CERN, http://cern.org
- Complete Genomics, www.completegenomics.com/
- 이명진, "빅 데이터 환경의 고급 분석 기법과 지원 기술 동향", 연세대학교 지식정보화연구소
- Suresh Srinivas, "HDFS Federation", Yahoo! Inc.
- 이미영, 분산 스트림 컴퓨팅 기술 동향 ,ETRI
- Bio Science, "Data Intensive Science: A New Paradigm for Biodiversity Studies"
- KAIST 그리드 미들웨어 연구 센터, "시멘틱 그리드 기반 의 생물정보 지식 발굴 시스템 구축 연구
- "Data Cleansing", http://en.wikipedia.org/wiki/Data_cleansing
- Erhard Rahm, Hong Hai Do, "Data Cleaning: Problems and Current Approaches", 2000
- Google, "Google Refine Tutorial"
- R. Catell, "Scalable SQL and NoSQL Data Stores", 2011
- S. Gilbert, N. Lynch, "Brewer's Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web Services"
- T. V. Ganesh, "When NoSQL makes better sense than MySQL", 2011
- "NoSQL", http://en.wikipedia.org/wiki/NoSQL
- F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, R. E. Gruber, "Bigtable: A Distributed Storage System for Structured Data", Google, Inc.
- Dhruba Borthakur, "The Hadoop Distributed File System: Architecture and Design"
- J. Dean, S. Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters", OSDI, 2004
- W. Y. Chen, Y. Song, H. Bai, C. J. Lin, E. Y. Chang, "Parallel Spectral Clustering in Distributed Systems"
- "map/Reduce 개념", http://nadayyh.springnote.com/pages/6064905
- A. Matsunaga, M. Tsugawa, J. Fortes, "CloudBlast: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Application"
- I. H. Witten, "Text Mining"
- "Cluster Analysis: Basic Concepts and Algorithms"
- J. Ekanayake, S. Pallickara, G. Fox, "MapReduce for Data Intensive Scientific Analyses"
- An Oracle White Paper, "Oracle: Big Data for the Enterprise"
- E. Pednault, Big Data Platforms, Tools, and Research at IBM
- IBM, Why IBM for Big Data
- IBM, "InfoSphere Streams", www-01.ibm.com
- OLAP,, http://www.terms.co.kr/OLAP.htm
- IBM, "IBM Netezza 1000, www-01.ibm.com"
- X. Fei, S. Lu, C. Lin, "A MapReduce-Enable Scientific Workflow Composition Framework
- J. Wang, D. Crawl, I. Altintas, "Kepler + Hadoop: A General Architecture Facilitating Data-Intensive Aplications in Scientific Workflow Systems