Search | Korea Science

The way to make training data for deep learning model to recognize keywords in product catalog image at E-commerce (온라인 쇼핑몰에서 상품 설명 이미지 내의 키워드 인식을 위한 딥러닝 훈련 데이터 자동 생성 방안)

Kim, Kitae;Oh, Wonseok;Lim, Geunwon;Cha, Eunwoo;Shin, Minyoung;Kim, Jongwoo
- Journal of Intelligence and Information Systems
- /
- v.24 no.1
- /
- pp.1-23
- /
- 2018
From the 21st century, various high-quality services have come up with the growth of the internet or 'Information and Communication Technologies'. Especially, the scale of E-commerce industry in which Amazon and E-bay are standing out is exploding in a large way. As E-commerce grows, Customers could get what they want to buy easily while comparing various products because more products have been registered at online shopping malls. However, a problem has arisen with the growth of E-commerce. As too many products have been registered, it has become difficult for customers to search what they really need in the flood of products. When customers search for desired products with a generalized keyword, too many products have come out as a result. On the contrary, few products have been searched if customers type in details of products because concrete product-attributes have been registered rarely. In this situation, recognizing texts in images automatically with a machine can be a solution. Because bulk of product details are written in catalogs as image format, most of product information are not searched with text inputs in the current text-based searching system. It means if information in images can be converted to text format, customers can search products with product-details, which make them shop more conveniently. There are various existing OCR(Optical Character Recognition) programs which can recognize texts in images. But existing OCR programs are hard to be applied to catalog because they have problems in recognizing texts in certain circumstances, like texts are not big enough or fonts are not consistent. Therefore, this research suggests the way to recognize keywords in catalog with the Deep Learning algorithm which is state of the art in image-recognition area from 2010s. Single Shot Multibox Detector(SSD), which is a credited model for object-detection performance, can be used with structures re-designed to take into account the difference of text from object. But there is an issue that SSD model needs a lot of labeled-train data to be trained, because of the characteristic of deep learning algorithms, that it should be trained by supervised-learning. To collect data, we can try labelling location and classification information to texts in catalog manually. But if data are collected manually, many problems would come up. Some keywords would be missed because human can make mistakes while labelling train data. And it becomes too time-consuming to collect train data considering the scale of data needed or costly if a lot of workers are hired to shorten the time. Furthermore, if some specific keywords are needed to be trained, searching images that have the words would be difficult, as well. To solve the data issue, this research developed a program which create train data automatically. This program can make images which have various keywords and pictures like catalog and save location-information of keywords at the same time. With this program, not only data can be collected efficiently, but also the performance of SSD model becomes better. The SSD model recorded 81.99% of recognition rate with 20,000 data created by the program. Moreover, this research had an efficiency test of SSD model according to data differences to analyze what feature of data exert influence upon the performance of recognizing texts in images. As a result, it is figured out that the number of labeled keywords, the addition of overlapped keyword label, the existence of keywords that is not labeled, the spaces among keywords and the differences of background images are related to the performance of SSD model. This test can lead performance improvement of SSD model or other text-recognizing machine based on deep learning algorithm with high-quality data. SSD model which is re-designed to recognize texts in images and the program developed for creating train data are expected to contribute to improvement of searching system in E-commerce. Suppliers can put less time to register keywords for products and customers can search products with product-details which is written on the catalog.
https://doi.org/10.13088/jiis.2018.24.1.001 인용 PDF KSCI

Reproducibility of Hemispheric Language Dominance by Noun, Verb, Adjective and Adverb Generation Paradigms in Functional Magnetic Resonance Imaging of Normal Volunteers (정상성인의 뇌기능적 자기공명영상에서 명사, 동사, 형용사 그리고 부사 만들기 과제들에 대한 언어영역편재화의 재현성에 관한 연구)

In Chan Song;Kee Hyun Chang;Chun Kee Chung;Sang Hyun Lee;Moon Hee Han
- Investigative Magnetic Resonance Imaging
- /
- v.5 no.1
- /
- pp.24-32
- /
- 2001
Purpose : We investigated the reproducibility of language lateralization by 4 different word generation paradigms or the rest contents in each paradigm using functional magnetic resonance imaging in normal volunteers Materials and Methods Nine normal volunteers with left-handedness (mean age: 25 yrs) were examined on a 1.57 MR unit using a single-shot gradient echo epibold sequence. Four different word generation paradigms of noun, verb, adjective and adverb were used in each normal volunteer for investigating language system. In each paradigm, two different rest contents consisted of only seeing the " +" symbol or reading the meaningless letters. Each task consisted of 96 phases including 3 activations and 6 rests of 2 different contents. Two activation maps in one task were obtained under two different rest contents using the correlation method. We evaluated the detection rates of Broca and Wernicke areas and the differences of language lateralization among four different word generation paradigms, or between the rest contents. Results : The detection rates of Broca and Wernicke areas were over 67 % in 4 different language paradigms and there was no significant difference of them among language paradigms, or between two different rest contents. Language dominances, in all 4 different language paradigms, were shown to be consistent in 66 %, but were contrary with language paradigms in some subjects. The rest contents made no significant effect on dominant language dominance determination, but the success rates of the dominant language dominances determined from 4 language paradigms were higher in reading the meaningless letter (100%, n=9) than in only seeing "+" on screen at the rest task (78%, n=7).
PDF

Mutagenicity Study of DA-3030, A New Recombinant Human G-CSF(rhG-CSF) (새로운 재조합 인 과립구 콜로니 자극인자 DA-3030의 변이원성연구)

강경구;최성학;김옥진;안병옥;백남기;김계원;김원배;양중익
- Biomolecules & Therapeutics
- /
- v.2 no.3
- /
- pp.286-291
- /
- 1994
The mutagenicity of DA-3030(rhG-CSF)was studied by reverse mutation test, chromosome aberration test and micronucleus test. The reverse mutatuon test in bacteria was performed using salmonella typhimurium strain TA100, TA98, TA1535 and TA1537 with rhG-CSF in any of the concentrations(150, 75, 37.5, 18.75, 9.375 and 4,6875 $\mu\textrm{g}$/plate), no increase in the number of revertant colonies in each strain was observed, irrespective of treatment with the metabolic activation system(S-9 mix) The chromosome aberration test was carried out using CHL cells, cell line from chinese hamster lung. With 4 doses(75, 37.5, 18.75 and 9.375 $\mu\textrm{g}$/ml) of rhG-/CSF the cells were treated for 24 or 48 hours in the direct method or for 6 hours followed by 18 hour-expression time in the metabolic activation method. Results of the study showed, by the direct method or metabolic activation method, no trend toward increase in the number of aberrant metaphase. The micronucleus test was carried out using ICR mice at the age of 8 weeks. Three doses(862.5, 1725 and 3450 $\mu\textrm{g}$/kg) of DA-3030 were admintstered intraperitoneally with single shot and bone marrow cells were sampled at 24 hours after administration. Neither the number of polychromatic erythrocytes with micronuclei nor the ratio of normochromatic erythrocytes to polychromatic erythrocytes increased singinficantly in each dose, compared with a vehicle control. These results indicate that rhG-CSF has not mutagenic potential under the condiions.
PDF

Cerebrum Lateralization by Area based on the Intensity of BOLD Signal during Cognitive Performance (인지 기능 수행 시 BOLD 신호 크기에 기반 한 영역별 대뇌 편측화)

Chung Soon Cheol;Shon Jin Hun;Kim Ik Hyeon;Lee Soo Yeol
- Journal of the Korean Society for Precision Engineering
- /
- v.22 no.1
- /
- pp.183-192
- /
- 2005
This study compared cerebral lateralization index based on the area of neural activation with that based on the intensity of neural activation. For this purpose, 8 right-handed male college students (the mean age - 23.5 years) and 10 right-handed male college students (the mean age - 25.1 years) participated respectively in researches on visuospatial and verbal task brain function. Functional brain images were taken from 3T MRI using the single-shot EPI method. The result of measuring cerebral lateralization index based on the area of neural activation suggested that the right hemisphere is dominant in visuospatial tasks and the left one is in verbal tasks. However, the dominance is not sufficient to locate the exact part of the brain for these tasks. When cerebral lateralization index was computed based on the intensity of neural activation, it was derived that the area of cerebral lateralization closely related to visuospatial tasks is the superior parietal lobe, and the area of cerebral lateralization closely related to verbal tasks is the inferior and middle frontal lobes. Thus, cerebral lateralization index by area based on the intensity of neural activation as proposed by this study can determine the dominance of the cerebrum by area, so is helpful for accurate and quantitative determination of cerebral lateralization.
PDF KSCI

The Feasibility of Event-Related Functional Magnetic Resonance Imaging of Power Hand Grip Task for Studying the Motor System in Normal Volunteers; Comparison with Finger Tapping Task

Song, In-Chan;Chang, Kee-Hyun;Han, Moon-Hee
- Proceedings of the KSMRM Conference
- /
- 2001.11a
- /
- pp.111-111
- /
- 2001
목적： To evaluate the feasibility of the event-related functional MR study using power grip studying the hand motor system 대상 및 방법： Event-related functional MRI was performed on a 1.5T MR unit in seven norm volunteers (man=7, right-handedness=2, left-handedness=5, mean age： 25 years). A single-shot GRE-EPI sequence (TR/TE/flip angle： 1000ms/40ms/90, FOV = 240 mm matrix= 64$\times$64, slice thickness/gap = 5mm/0mm, 7 true axial slices) was used for functiona MR images. A flow-sensitive conventional gradient echo sequence (TR/TE/flip angl 50ms/4ms/60) was used for high-resolution anatomical images. To minimize the gross hea motion, neck-holders (MJ-200, USA) were used. A series of MR images were obtained in axial planes covering motor areas. To exclude motion-corrupted images, all MR images wer surveyed in a movie procedure and evaluated using the estimation of center of mass of ima signal intensities. Power grip task consisted of the powerful grip of all right fingers and hand movement ta used very fast right finger tapping at a speed of 3 per 1 second. All tasks were visual-guid by LCD projector (SHARP, Japan). Two tasks consisted of 134 phases including 7 activatio and 8 rest periods. Active stimulations were performed during 2 seconds and rest period were 15 seconds and total scan time per one task was 2 min 14 sec. Statistical maps we obtained using cross-correlation method. Reference vector was time-shifted by 4 seconds an Gaussian convolution with a FWHM of 4 seconds was applied to it. The threshold in p val for the activation sites was set to be 0.001. All mapping procedures were peformed usin homemade program an IDL (Research Systems Inc., USA) platform. We evaluated the activation patterns of the motor system of power grip compared to hand movement in t event-related functional MRI.
PDF

Comparative Proteomic Profiling of Pancreatic Ductal Adenocarcinoma Cell Lines

Kim, Yikwon;Han, Dohyun;Min, Hophil;Jin, Jonghwa;Yi, Eugene C.;Kim, Youngsoo
- Molecules and Cells
- /
- v.37 no.12
- /
- pp.888-898
- /
- 2014
Pancreatic cancer is one of the most fatal cancers and is associated with limited diagnostic and therapeutic modalities. Currently, gemcitabine is the only effective drug and represents the preferred first-line treatment for chemotherapy. However, a high level of intrinsic or acquired resistance of pancreatic cancer to gemcitabine can contribute to the failure of gemcitabine treatment. To investigate the underlying molecular mechanisms for gemcitabine resistance in pancreatic cancer, we performed label-free quantification of protein expression in intrinsic gemcitabine-resistant and -sensitive human pancreatic adenocarcinoma cell lines using our improved proteomic strategy, combined with filter-aided sample preparation, single-shot liquid chromatography-mass spectrometry, enhanced spectral counting, and a statistical method based on a power law global error model. We identified 1931 proteins and quantified 787 differentially expressed proteins in the BxPC3, PANC-1, and HPDE cell lines. Bioinformatics analysis identified 15 epithelial to mesenchymal transition (EMT) markers and 13 EMT-related proteins that were closely associated with drug resistance were differentially expressed. Interestingly, 8 of these proteins were involved in glutathione and cysteine/methionine metabolism. These results suggest that proteins related to the EMT and glutathione metabolism play important roles in the development of intrinsic gemcitabine resistance by pancreatic cancer cell lines.
https://doi.org/10.14348/molcells.2014.0207 인용 PDF KSCI

Study on Enhancements to Ultrasonic Data Imaging Using Full Matrix Capture Technique (Full Matrix Capture 기법을 통한 초음파신호 영상화 향상 연구)

Lee, Tae-Hun;Yoon, Byung-Sik;Lee, Jeong-Seok
- Journal of the Korean Society for Nondestructive Testing
- /
- v.35 no.5
- /
- pp.299-306
- /
- 2015
A conventional phased array system can control an ultrasonic beam electronically by adjusting the excitation time delay of individual elements in a multi-element probe and produce an ultrasonic image. In Contrast, full matrix capture (FMC) is a data acquisition process that allows receiving ultrasonic signals from one single shot of the phased array transducer element through all the other elements and captures the complete dataset from every possible transmit-receive combination. This FMC data can be used to create the ultrasonic image in post processing. It is possible to produce not only images equivalent to conventional phased array image but also total focusing method (TFM) images with improved resolution and sharpness, which is virtually focused at any point in a region of interest. In this paper, the system that can perform FMC by using a conventional phased array instrument is developed, and a study was conducted on the imaging algorithms to reconstruct sector B-scan and TFM images from FMC dataset.
https://doi.org/10.7779/JKSNT.2015.35.5.299 인용 PDF KSCI

A Tile-Image Merging Algorithm of Tiled-Display Recorder using Time-stamp (타임 스탬프를 이용한 타일드 디스플레이 기록기의 타일 영상 병합 알고리즘)

Choe, Gi-Seok;Nang, Jong-Ho
- Journal of KIISE:Computer Systems and Theory
- /
- v.36 no.5
- /
- pp.327-334
- /
- 2009
The tiled-display system provides a high resolution display which can be used in different applications in co-working area. The systems used in the co-working field usually save the user logs, and these log information not only makes the maintenance of the tiled-display system easier, but also can be used to check the progress of the co-working. There are three main steps in the proposed tiled display log recorder. The first step is to capture the screen shots of the tiles and send them for merging. The second step is to merge the captured tile images to form a single screen shot of the tiled-display. The final step is to encode the merged tile images to make a compressed video stream. This video stream could be stored for the logs of co-working or be streamed to remote users. Since there could be differences in capturing time of tile images, the quality of merged tiled-display could be degraded. This paper proposes a time stamp-based metric to evaluate the quality of the video stream, and a merging algorithm that could upgrade the quality of the video stream with respect to the proposed quality metrics.
PDF KSCI

A Survey on the Magnitude of the Sound, Ground Vibration and Properly Delayed Interval of a Plasma Rock-Splitting Machine driven by Electric Shocks (플라즈마 지발 전력충격파암기의 적정 지발시차 및 진동과 소음크기 고찰)

Won, Yeon-Ho;Kang, Choo-Won;Kim, Il-Jung
- Explosives and Blasting
- /
- v.27 no.1
- /
- pp.7-20
- /
- 2009
In this study, 5 steps of different delay intervals are applied to a plasma rock-breaking machine that is driven by electric shocks in order to improve the workability of the traditional single-shot type plasma rock-breaking operation. The sequential steps use the electrolyte volume per delay of 1, 2, 3, 4, 5 kg and it has been analyzed to measure the delay time and level of the ground vibration and noise according to exploding. The delay time of the rock-breaking machine by an electric shock of 5 steps has used about 40~50ms at the electrolyte connected from 1 to 3 holes, about 70~80ms at the electrolyte connected from 4 to 5 holes. It is identified that the extents of the ground vibration is low to 1 over 3~6 compared with that of the emulsion explosives.
PDF KSCI

Quantitative Measurement of Soot concentration by Two-Wavelength Correction of Laser-Induced Incandescence Signals (2파장 보정 Laser-Induced Incandescence 법을 이용한 매연 농도 측정)

정종수
- Transactions of the Korean Society of Automotive Engineers
- /
- v.5 no.3
- /
- pp.54-65
- /
- 1997
To quantify the LII signals from soot particle of flames in diesel engine cylinder, a new method has been proposed for correcting LII signal attenuated by soot particles between the measuring point and the detector. It has been verified by an experiment on a laminar jet ethylene-air diffusion flame. Being proportional to the attenuation, the ratio of LII signal at two different detection wavelengths can be used to correct the measured LIIsignal and obtain the unattenuated LII signal, from which the soot volume fraction in the flame can be estimated. Both the 1064-nm and frequency-doubled 532-nm beams from the Nd : YAG laser are used. Single-shot, one-dimensional(1-D) line images are recorded on the intensified CCD camera, with the rectangular-profile laser beam using 1-mm-diameter pinhole. Two broadband optical interference filters having the center wavelengths of 647 nm and 400 nm respectively and a bandwidth of 10 nm are used. This two-wavelength correction has been applied to the ethylene-air coannular laminar diffusion flame, previously studied on soot formation by the laser extinction method in this laboratory. The results by the LII measurement technique and the conventional laser extinction method at the height of 40 nm above the jet exit agreed well with each other except around outside of the peaks of soot concentration, where the soot concentration was relatively high and resulting attenuation of the LII signal was large. The radial profile shape of soot concentration was not changed a lot, but the absolute value of the soot volume fraction around outside edge changed from 4ppm to 6.5 ppm at r=2.8mm after correction. This means that the attenuation of LII signal was approximately 40% at this point, which is higher than the average attenuation rate of this flame, 10~15%.
PDF

Search Result 234, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)