Search | Korea Science

Business Application of Convolutional Neural Networks for Apparel Classification Using Runway Image (합성곱 신경망의 비지니스 응용: 런웨이 이미지를 사용한 의류 분류를 중심으로)

Seo, Yian;Shin, Kyung-shik
- Journal of Intelligence and Information Systems
- /
- v.24 no.3
- /
- pp.1-19
- /
- 2018
Large amount of data is now available for research and business sectors to extract knowledge from it. This data can be in the form of unstructured data such as audio, text, and image data and can be analyzed by deep learning methodology. Deep learning is now widely used for various estimation, classification, and prediction problems. Especially, fashion business adopts deep learning techniques for apparel recognition, apparel search and retrieval engine, and automatic product recommendation. The core model of these applications is the image classification using Convolutional Neural Networks (CNN). CNN is made up of neurons which learn parameters such as weights while inputs come through and reach outputs. CNN has layer structure which is best suited for image classification as it is comprised of convolutional layer for generating feature maps, pooling layer for reducing the dimensionality of feature maps, and fully-connected layer for classifying the extracted features. However, most of the classification models have been trained using online product image, which is taken under controlled situation such as apparel image itself or professional model wearing apparel. This image may not be an effective way to train the classification model considering the situation when one might want to classify street fashion image or walking image, which is taken in uncontrolled situation and involves people's movement and unexpected pose. Therefore, we propose to train the model with runway apparel image dataset which captures mobility. This will allow the classification model to be trained with far more variable data and enhance the adaptation with diverse query image. To achieve both convergence and generalization of the model, we apply Transfer Learning on our training network. As Transfer Learning in CNN is composed of pre-training and fine-tuning stages, we divide the training step into two. First, we pre-train our architecture with large-scale dataset, ImageNet dataset, which consists of 1.2 million images with 1000 categories including animals, plants, activities, materials, instrumentations, scenes, and foods. We use GoogLeNet for our main architecture as it has achieved great accuracy with efficiency in ImageNet Large Scale Visual Recognition Challenge (ILSVRC). Second, we fine-tune the network with our own runway image dataset. For the runway image dataset, we could not find any previously and publicly made dataset, so we collect the dataset from Google Image Search attaining 2426 images of 32 major fashion brands including Anna Molinari, Balenciaga, Balmain, Brioni, Burberry, Celine, Chanel, Chloe, Christian Dior, Cividini, Dolce and Gabbana, Emilio Pucci, Ermenegildo, Fendi, Giuliana Teso, Gucci, Issey Miyake, Kenzo, Leonard, Louis Vuitton, Marc Jacobs, Marni, Max Mara, Missoni, Moschino, Ralph Lauren, Roberto Cavalli, Sonia Rykiel, Stella McCartney, Valentino, Versace, and Yve Saint Laurent. We perform 10-folded experiments to consider the random generation of training data, and our proposed model has achieved accuracy of 67.2% on final test. Our research suggests several advantages over previous related studies as to our best knowledge, there haven't been any previous studies which trained the network for apparel image classification based on runway image dataset. We suggest the idea of training model with image capturing all the possible postures, which is denoted as mobility, by using our own runway apparel image dataset. Moreover, by applying Transfer Learning and using checkpoint and parameters provided by Tensorflow Slim, we could save time spent on training the classification model as taking 6 minutes per experiment to train the classifier. This model can be used in many business applications where the query image can be runway image, product image, or street fashion image. To be specific, runway query image can be used for mobile application service during fashion week to facilitate brand search, street style query image can be classified during fashion editorial task to classify and label the brand or style, and website query image can be processed by e-commerce multi-complex service providing item information or recommending similar item.
https://doi.org/10.13088/jiis.2018.24.3.001 인용 PDF KSCI

A Deep Learning Based Approach to Recognizing Accompanying Status of Smartphone Users Using Multimodal Data (스마트폰 다종 데이터를 활용한 딥러닝 기반의 사용자 동행 상태 인식)

Kim, Kilho;Choi, Sangwoo;Chae, Moon-jung;Park, Heewoong;Lee, Jaehong;Park, Jonghun
- Journal of Intelligence and Information Systems
- /
- v.25 no.1
- /
- pp.163-177
- /
- 2019
As smartphones are getting widely used, human activity recognition (HAR) tasks for recognizing personal activities of smartphone users with multimodal data have been actively studied recently. The research area is expanding from the recognition of the simple body movement of an individual user to the recognition of low-level behavior and high-level behavior. However, HAR tasks for recognizing interaction behavior with other people, such as whether the user is accompanying or communicating with someone else, have gotten less attention so far. And previous research for recognizing interaction behavior has usually depended on audio, Bluetooth, and Wi-Fi sensors, which are vulnerable to privacy issues and require much time to collect enough data. Whereas physical sensors including accelerometer, magnetic field and gyroscope sensors are less vulnerable to privacy issues and can collect a large amount of data within a short time. In this paper, a method for detecting accompanying status based on deep learning model by only using multimodal physical sensor data, such as an accelerometer, magnetic field and gyroscope, was proposed. The accompanying status was defined as a redefinition of a part of the user interaction behavior, including whether the user is accompanying with an acquaintance at a close distance and the user is actively communicating with the acquaintance. A framework based on convolutional neural networks (CNN) and long short-term memory (LSTM) recurrent networks for classifying accompanying and conversation was proposed. First, a data preprocessing method which consists of time synchronization of multimodal data from different physical sensors, data normalization and sequence data generation was introduced. We applied the nearest interpolation to synchronize the time of collected data from different sensors. Normalization was performed for each x, y, z axis value of the sensor data, and the sequence data was generated according to the sliding window method. Then, the sequence data became the input for CNN, where feature maps representing local dependencies of the original sequence are extracted. The CNN consisted of 3 convolutional layers and did not have a pooling layer to maintain the temporal information of the sequence data. Next, LSTM recurrent networks received the feature maps, learned long-term dependencies from them and extracted features. The LSTM recurrent networks consisted of two layers, each with 128 cells. Finally, the extracted features were used for classification by softmax classifier. The loss function of the model was cross entropy function and the weights of the model were randomly initialized on a normal distribution with an average of 0 and a standard deviation of 0.1. The model was trained using adaptive moment estimation (ADAM) optimization algorithm and the mini batch size was set to 128. We applied dropout to input values of the LSTM recurrent networks to prevent overfitting. The initial learning rate was set to 0.001, and it decreased exponentially by 0.99 at the end of each epoch training. An Android smartphone application was developed and released to collect data. We collected smartphone data for a total of 18 subjects. Using the data, the model classified accompanying and conversation by 98.74% and 98.83% accuracy each. Both the F1 score and accuracy of the model were higher than the F1 score and accuracy of the majority vote classifier, support vector machine, and deep recurrent neural network. In the future research, we will focus on more rigorous multimodal sensor data synchronization methods that minimize the time stamp differences. In addition, we will further study transfer learning method that enables transfer of trained models tailored to the training data to the evaluation data that follows a different distribution. It is expected that a model capable of exhibiting robust recognition performance against changes in data that is not considered in the model learning stage will be obtained.
https://doi.org/10.13088/jiis.2019.25.1.163 인용 PDF KSCI HTML

On MIMO OTA Performance Characterization Method for Mobile Devices with Multiple Antennas (MIMO 무선 성능 성능평가 방법에 관한 연구)

Cho, Y.S.;Kim, Y.R.;No, S.P.;Shim, H.J.;Kim, I.K.
- Journal of the Institute of Electronics Engineers of Korea TC
- /
- v.49 no.5
- /
- pp.84-90
- /
- 2012
Since the major cellular data service providers in U.S, Japan as well as in Korea started the LTE (Long Term Evolution) service, there has been more strong need for the methods that can accurately measure the MIMO (Multi Input-Multi Outout) OTA (Over The Air) performance of LTE handsets because the performance of the MIMO antenna determines the packet data rates in the downlink and therefore the higher system throughput of the network. In this regard, there has been a lot discussions in 3GPP on the candidate MIMO OTA test solutions. In this paper, a faire comparison has been done for the conventional method, the Envelop Correlation Coefficient (ECC) measurements, and the anechoic chamber based MIMO OTA test solution, one of the candidate system being discussed in 3GPP. The evaluations and the comparisons are conducted by numerically and experimentally.
PDF KSCI

A Study on Implementation of Writing Supporting System(ICWS) for Interactive Storytelling Contents (인터렉티브 스토리텔링 콘텐츠 저작지원도구 설계 및 구현에 관한 연구)

Lee, Eun Ryoung;Kim, Kio Chung
- Journal of Digital Convergence
- /
- v.11 no.2
- /
- pp.263-269
- /
- 2013
This research paper is applying Writing Supporting System on the previous research study about writing tool data model on interactive storytelling about family Story. Family story writing supporting system enables users to create text, images, videos and digital contents based on experimental knowledge collected from the first and second generations. The paper about studies on writing tool system on family story, aims to create documentary based high quality contents about each family members and family history. At the same time, overcome generation gaps and the lack of creation infrastructures. Throughout this process, the author will contribute to the expansion of creation devices which can be applied in other researches and writing tools.
https://doi.org/10.14400/JDPM.2013.11.2.263 인용 PDF

A Study on Infra-Technology of RCP Interaction System

Kim, Seung-Woo;Choe, Jae-Il
- 제어로봇시스템학회:학술대회논문집
- /
- 2004.08a
- /
- pp.1121-1125
- /
- 2004
The RT(Robot Technology) has been developed as the next generation of a future technology. According to the 2002 technical report from Mitsubishi R&D center, IT(Information Technology) and RT(Robotic Technology) fusion system will grow five times larger than the current IT market at the year 2015. Moreover, a recent IEEE report predicts that most people will have a robot in the next ten years. RCP(Robotic Cellular Phone), CP(Cellular Phone) having personal robot services, will be an intermediate hi-tech personal machine between one CP a person and one robot a person generations. RCP infra consists of $RCP^{Mobility}$, $RCP^{Interaction}$, $RCP^{Integration}$ technologies. For $RCP^{Mobility}$, human-friendly motion automation and personal service with walking and arming ability are developed. $RCP^{Interaction}$ ability is achieved by modeling an emotion-generating engine and $RCP^{Integration}$ that recognizes environmental and self conditions is developed. By joining intelligent algorithms and CP communication network with the three base modules, a RCP system is constructed. Especially, the RCP interaction system is really focused in this paper. The $RCP^{interaction}$(Robotic Cellular Phone for Interaction) is to be developed as an emotional model CP as shown in figure 1. $RCP^{interaction}$ refers to the sensitivity expression and the link technology of communication of the CP. It is interface technology between human and CP through various emotional models. The interactive emotion functions are designed through differing patterns of vibrator beat frequencies and a feeling system created by a smell injection switching control. As the music influences a person, one can feel a variety of emotion from the vibrator's beats, by converting musical chord frequencies into vibrator beat frequencies. So, this paper presents the definition, the basic theory and experiment results of the RCP interaction system. We confirm a good performance of the RCP interaction system through the experiment results.
PDF

Proteomic Analysis of Recombinant Saccharomyces cerevisiae upon Iron Deficiency Induced via Human H-Ferritin Production

Seo, Hyang-Yim;Chang, Yu-Jung;Chung, Yun-Jo;Kim, Kyung-Suk
- Journal of Microbiology and Biotechnology
- /
- v.18 no.8
- /
- pp.1368-1376
- /
- 2008
In our previous study, the expression of active H-ferritins in Saccharomyces cerevisiae was found to reduce cell growth and reactive oxygen species (ROS) generation upon exposure to oxidative stress; such expression enhanced that of high-affinity iron transport genes (FET3 and FTR1). The results suggested that the recombinant cells expressing H-ferritins induced cytosolic iron depletion. The present study analyzes metabolic changes under these circumstances via proteomic methods. The YGH2 yeast strain expressing A-ferritin, the YGH2-KG (E62K and H65G) mutant strain, and the YGT control strain were used. Comparative proteomic analysis showed that the synthesis of 34 proteins was at least stimulated in YGH2, whereas the other 37 proteins were repressed. Among these, the 31 major protein spots were analyzed via nano-LC/MS/MS. The increased proteins included major heat-shock proteins and proteins related to endoplasmic reticulum-associated degradation (ERAD). On the other hand, the proteins involved with folate metabolism, purine and methionine biosynthesis, and translation were reduced. In addition, we analyzed the insoluble protein fractions and identified the fragments of Idh1p and Pgk1p, as well as several ribosomal assembly-related proteins. This suggests that intracellular iron depletion induces imperfect translation of proteins. Although the proteins identified above result from changes in iron metabolism (i.e., iron deficiency), definitive evidence for iron-related proteins remains insufficient. Nevertheless, this study is the first to present a molecular model for iron deficiency, and the results may provide valuable information on the regulatory network of iron metabolism.
PDF KSCI

The Evaluation of Method for Computerization of Clinical Informations of the Patients of the Department of Thoracic and Cardiovascular Surgery - About the practical method of coding and standardization of the structure of the database file(DBF) - (흉부외과환자 임상정보의 전산화 방법에 대한 고찰;데이터베이스 파일(DBF) 구조의 표준화및 코딩화 방안에 대하여)

Song, U-Cheol;Kim, Byeong-Ju;Hong, Gi-U
- Journal of Chest Surgery
- /
- v.25 no.10
- /
- pp.989-1000
- /
- 1992
The concepts of modern type computer are so called "General purpose, stored program and digital computer" that is proposed by Charles Babbage. ENIAC, the initial operational electronic digital computer model, was produced in 1946. During the last 50 years, an epoch-making development of the personal computer was marked. The computerization of all levels of society is going on and also computerization of the general hospital and medical college is developing. But patient data management system for clinician is not used generally. We suggest the use of computer aided data management application programs for the clinical informations of the patients of the Department of Thoracic and Cardiovascular Surgery for better management and to make best of medical informations, to co-operate with the current of this times, and to prepare against the Hospital Information Systems[HIS], actively. Also, we suggest to standardize the format and structure of database files to store the clinical data of the patients By standardization of the database files, we can integrate and relate the data of the individual department or hospital, build up the regional or national statistics of the patients easily, and promote the generation of application programs. The medical network by the communication and computer would be utilized to collect the database files. And finally, we suggest the use of code system to input and search the informations about the diagnosis and operation such as the code system of International Classfication of Disease[WHO] and the table of the classfication of operation of the Ministry of Health and Social Affairs, Korea. In this article, we tried to show the new standards, the essential items for computerization of clinical informations of the patients of the Department of Thoracic and Cardiovascular Surgery.r Surgery.
PDF

Exergy analysis on the power recovery of LNG supply system (냉열 에너지의 동력 회수에 대한 엑서지 해석 방법에 관한 연구)

Park, Il-Hwan;Kim, Choon-Seong
- The Journal of Korean Institute for Practical Engineering Education
- /
- v.3 no.1
- /
- pp.9-14
- /
- 2011
The expansion work that is wasted through the irreversible expansion through the PC valve of decompression process of the natural gas governor station can be recovered by replacing the process by an isentropic expansion. The energy and exergy analyses for the two decompression process models of power producing and current decompression process model are presented. Analysis results showed that the exergy by gas supply is 56.29%, the exergy by producing power is 32.12 % in case of preheating system and 22.52% in case of non-preheating system. The dead exergy at the PCV is generated much more network. As these results, the usefulness of exergy analysis is verified.
PDF

A Study on the Overseas Expansion Strategy of u-City based on BIM/GIS (BIM/GIS 기반 u-City 해외진출 전략 연구)

Na, Joon Yeop;Lee, Woo Sik;Hong, Chang Hee;Hwang, Jung Rae
- Spatial Information Research
- /
- v.20 no.6
- /
- pp.119-127
- /
- 2012
u-City is next generation city which can innovate functions of city. It can realize increase of convenience, improvement of life quality and safety guarantee by convergence of information technologies and ubiquitous service with urban space. Market of u-City is in range of rapid growth and u-City can make enormous synergy effects by accompanying construction technologies with spatial information, sensor technologies, communications network and related equipments. In this study, we analyzed the domestic/abroad status, researches and element technologies involved in u-City. And, we suggested overseas expansion strategy of u-City such as selection and analysis of target nations, packaging method of u-City service models and application of BIM/GIS connection technologies in terms of u-City construction and operation.
https://doi.org/10.12672/ksis.2012.20.6.119 인용 PDF KSCI

Development on Public Participation GIS 2.0 Application Based on Google Map for Android Smart-phones (구글맵기반 사용자 참여형 안드로이드폰 GIS 2.0 응용프로그램 개발)

Kim, Byeong-Su;Kim, Jong-Hoon
- The Journal of Korean Association of Computer Education
- /
- v.14 no.4
- /
- pp.11-20
- /
- 2011
Recently, the interests of mobile GIS(Geographic Information System) technology is increasing with the spread of wireless network, the improvement of mobile device's performances and the growth of demands about mobile services. If we refer mobile GIS system classified as 3rd generation with the success of Web 2.0, it is suggested that supplies of information from the server side couldn't enhance users' participations and the usage of service. In this study, we suggested the model of GIS 2.0 application based on Google map for Android smart-phones, which users could make, save and share information using photos and text by themselves. We were able to grasp the expectation of GIS 2.0 by expert assessment of this application and got to the core of conditions and implications for the success of GIS 2.0.
PDF

Search Result 619, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)