• 제목/요약/키워드: vision recognition

Search Result 1,037, Processing Time 0.027 seconds

Development of compound eye image quality improvement based on ESRGAN (ESRGAN 기반의 복안영상 품질 향상 알고리즘 개발)

  • Taeyoon Lim;Yongjin Jo;Seokhaeng Heo;Jaekwan Ryu
    • Journal of the Korea Computer Graphics Society
    • /
    • v.30 no.2
    • /
    • pp.11-19
    • /
    • 2024
  • Demand for small biomimetic robots that can carry out reconnaissance missions without being exposed to the enemy in underground spaces and narrow passages is increasing in order to increase the fighting power and survivability of soldiers in wartime situations. A small compound eye image sensor for environmental recognition has advantages such as small size, low aberration, wide angle of view, depth estimation, and HDR that can be used in various ways in the field of vision. However, due to the small lens size, the resolution is low, and the problem of resolution in the fused image obtained from the actual compound eye image occurs. This paper proposes a compound eye image quality enhancement algorithm based on Image Enhancement and ESRGAN to overcome the problem of low resolution. If the proposed algorithm is applied to compound eye image fusion images, image resolution and image quality can be improved, so it is expected that performance improvement results can be obtained in various studies using compound eye cameras.

Inexpensive Visual Motion Data Glove for Human-Computer Interface Via Hand Gesture Recognition (손 동작 인식을 통한 인간 - 컴퓨터 인터페이스용 저가형 비주얼 모션 데이터 글러브)

  • Han, Young-Mo
    • The KIPS Transactions:PartB
    • /
    • v.16B no.5
    • /
    • pp.341-346
    • /
    • 2009
  • The motion data glove is a representative human-computer interaction tool that inputs human hand gestures to computers by measuring their motions. The motion data glove is essential equipment used for new computer technologiesincluding home automation, virtual reality, biometrics, motion capture. For its popular usage, this paper attempts to develop an inexpensive visual.type motion data glove that can be used without any special equipment. The proposed approach has the special feature; it can be developed as a low-cost one becauseof not using high-cost motion-sensing fibers that were used in the conventional approaches. That makes its easy production and popular use possible. This approach adopts a visual method that is obtained by improving conventional optic motion capture technology, instead of mechanical method using motion-sensing fibers. Compared to conventional visual methods, the proposed method has the following advantages and originalities Firstly, conventional visual methods use many cameras and equipments to reconstruct 3D pose with eliminating occlusions But the proposed method adopts a mono vision approachthat makes simple and low cost equipments possible. Secondly, conventional mono vision methods have difficulty in reconstructing 3D pose of occluded parts in images because they have weak points about occlusions. But the proposed approach can reconstruct occluded parts in images by using originally designed thin-bar-shaped optic indicators. Thirdly, many cases of conventional methods use nonlinear numerical computation image analysis algorithm, so they have inconvenience about their initialization and computation times. But the proposed method improves these inconveniences by using a closed-form image analysis algorithm that is obtained from original formulation. Fourthly, many cases of conventional closed-form algorithms use approximations in their formulations processes, so they have disadvantages of low accuracy and confined applications due to singularities. But the proposed method improves these disadvantages by original formulation techniques where a closed-form algorithm is derived by using exponential-form twist coordinates, instead of using approximations or local parameterizations such as Euler angels.

Survey on a Disposal Method of Contact Lenses after Use (콘택트렌즈 사용 후 폐기처분에 대한 실태 조사)

  • Park, Il-nam;Kwon, Min-sun;Park, Ji-woong;Lee, Ki-Seok;Jung, Mi-A;Lee, Hae-Jung
    • The Korean Journal of Vision Science
    • /
    • v.20 no.4
    • /
    • pp.553-560
    • /
    • 2018
  • Purpose : To investigate a disposal method of disposing contact lenses and the recognition of environmental pollution by micro plastics which may be caused by the wrong disposal method of domestic contact lens wearers. Methods : Two hundred sixty one adults(124 males, 137 females, mean age $21.48{\pm}3.14years$) were participated in this study. They were given the questionnaire survey on contact lenses purchasing place, type of contact lenses, duration of wearing contact lenses, the disposal method of disposing contact lenses and the recognition of the occurrence of environmental pollution. Results : It appeared that eyeglass shop(50.0%) and contact lens shop(48.3%) were the main purchasing places, and the most common type of contact lenses were disposable lenses(38.5%) and daily wearing lenses(52.5%). On the duration of wearing contact lenses they answered more than 5 years(29.3%), less than 1 year (26.0%), less than 1 year to less than 3 years (26.0%), and on wearing a contact lens during a week they did 1-2 days (32.0%), 1 week (28.0%), 5-6 days (22.4%) and 3-4 days (17.6%). It was shown "no(78.3%)" and "yes(21.7%)" to the questionnaire of whether they received information or education about a disposal method at the place where the contact lens was purchased, and "no(87.5%)" and "yes(12.5%)" to the questionnaire of whether they received information or education from schools, public institutions or public media such as the internet. As for the disposal methods, landfill waste(45.6%), recycled garbage(29.6%), and drainage(16.8%) from the sink or toilet responded in order. Although men were more educated and informed about disposal than women (t=3.63189, p<0.00001), women were more aware of environmental pollution(t=2.44269, p=0.01605). Conclusion : In order to reduce the environmental pollution issue caused by the contact lens which does not decompose at the sewage treatment facility and become micro plastics, it is urgent to provide information about correct disposal methods after using contact lenses and to educate contact lens wearers.

A Study Concerning the Background of Formation in Deleuze's System (들뢰즈 체계의 형성 배경에 대한 연구 - 칸트 선험철학 체계 그 심연으로부터의 역류 -)

  • Kim, Dae-hyeon
    • Journal of the Daesoon Academy of Sciences
    • /
    • v.37
    • /
    • pp.329-355
    • /
    • 2021
  • The objective of this paper is to reveal that the formation of Deleuze's system is a result of a back flow of the 'ideal of pure reason' in Kant's system. I will try to seize upon the keyword in his main book, Difference and Repetition, and examine the aspect of mutual transformation between Deleuze's transcendental empiricism and Kant's transcendentalism. When analyzing Deleuze's system, most researchers tend to focus on anti-Hegelianism, but it is proper that Kant be adopted as the start when tracing the way of deployment directly. Fundamentally, Deleuze is different from Hegel in his approach to observing entire ground of thought. Even if Deleuze surely has the capability of becoming in the dialectical context, his systemic environment wherein dialectics is applied is different even at the onset. While Hegel follows the way of origin and copy or a system that begins from a preceding point of origin, Deleuze follows a way of copy and recopy or a system that begins without a point of origin. This characteristic of Deleuze's system originates directly from idealistic play. In fact, we can anticipate and identify in his book that he refers to Kant who accepted the tradition of empiricism. Therefore, the main contents of this paper is to present an overview of Kant's influence on Deleuze's system. While tracing ideas back to Kant's system, the cohabitation of empiricism and rationalism, which Kant felicitously revoiced, there emerges a definitude of world recognition. This occurs through cohabitation, and this is both deconstructed and integrated by Deleuze, and therein definitude is turned into a vision of prosperity. To the vision of prosperity that spans definitude to recognition, a philosopher has the right to select a philosophical system because selection methodology in philosophy is not a problem of legitimacy so much as the needs of the times. Deleuze's choice resulted in the opening of pandora's box in an abyss and secret contents have in turn risen sharply.

Deep Learning OCR based document processing platform and its application in financial domain (금융 특화 딥러닝 광학문자인식 기반 문서 처리 플랫폼 구축 및 금융권 내 활용)

  • Dongyoung Kim;Doohyung Kim;Myungsung Kwak;Hyunsoo Son;Dongwon Sohn;Mingi Lim;Yeji Shin;Hyeonjung Lee;Chandong Park;Mihyang Kim;Dongwon Choi
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.143-174
    • /
    • 2023
  • With the development of deep learning technologies, Artificial Intelligence powered Optical Character Recognition (AI-OCR) has evolved to read multiple languages from various forms of images accurately. For the financial industry, where a large number of diverse documents are processed through manpower, the potential for using AI-OCR is great. In this study, we present a configuration and a design of an AI-OCR modality for use in the financial industry and discuss the platform construction with application cases. Since the use of financial domain data is prohibited under the Personal Information Protection Act, we developed a deep learning-based data generation approach and used it to train the AI-OCR models. The AI-OCR models are trained for image preprocessing, text recognition, and language processing and are configured as a microservice architected platform to process a broad variety of documents. We have demonstrated the AI-OCR platform by applying it to financial domain tasks of document sorting, document verification, and typing assistance The demonstrations confirm the increasing work efficiency and conveniences.

Deep Learning Architectures and Applications (딥러닝의 모형과 응용사례)

  • Ahn, SungMahn
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.2
    • /
    • pp.127-142
    • /
    • 2016
  • Deep learning model is a kind of neural networks that allows multiple hidden layers. There are various deep learning architectures such as convolutional neural networks, deep belief networks and recurrent neural networks. Those have been applied to fields like computer vision, automatic speech recognition, natural language processing, audio recognition and bioinformatics where they have been shown to produce state-of-the-art results on various tasks. Among those architectures, convolutional neural networks and recurrent neural networks are classified as the supervised learning model. And in recent years, those supervised learning models have gained more popularity than unsupervised learning models such as deep belief networks, because supervised learning models have shown fashionable applications in such fields mentioned above. Deep learning models can be trained with backpropagation algorithm. Backpropagation is an abbreviation for "backward propagation of errors" and a common method of training artificial neural networks used in conjunction with an optimization method such as gradient descent. The method calculates the gradient of an error function with respect to all the weights in the network. The gradient is fed to the optimization method which in turn uses it to update the weights, in an attempt to minimize the error function. Convolutional neural networks use a special architecture which is particularly well-adapted to classify images. Using this architecture makes convolutional networks fast to train. This, in turn, helps us train deep, muti-layer networks, which are very good at classifying images. These days, deep convolutional networks are used in most neural networks for image recognition. Convolutional neural networks use three basic ideas: local receptive fields, shared weights, and pooling. By local receptive fields, we mean that each neuron in the first(or any) hidden layer will be connected to a small region of the input(or previous layer's) neurons. Shared weights mean that we're going to use the same weights and bias for each of the local receptive field. This means that all the neurons in the hidden layer detect exactly the same feature, just at different locations in the input image. In addition to the convolutional layers just described, convolutional neural networks also contain pooling layers. Pooling layers are usually used immediately after convolutional layers. What the pooling layers do is to simplify the information in the output from the convolutional layer. Recent convolutional network architectures have 10 to 20 hidden layers and billions of connections between units. Training deep learning networks has taken weeks several years ago, but thanks to progress in GPU and algorithm enhancement, training time has reduced to several hours. Neural networks with time-varying behavior are known as recurrent neural networks or RNNs. A recurrent neural network is a class of artificial neural network where connections between units form a directed cycle. This creates an internal state of the network which allows it to exhibit dynamic temporal behavior. Unlike feedforward neural networks, RNNs can use their internal memory to process arbitrary sequences of inputs. Early RNN models turned out to be very difficult to train, harder even than deep feedforward networks. The reason is the unstable gradient problem such as vanishing gradient and exploding gradient. The gradient can get smaller and smaller as it is propagated back through layers. This makes learning in early layers extremely slow. The problem actually gets worse in RNNs, since gradients aren't just propagated backward through layers, they're propagated backward through time. If the network runs for a long time, that can make the gradient extremely unstable and hard to learn from. It has been possible to incorporate an idea known as long short-term memory units (LSTMs) into RNNs. LSTMs make it much easier to get good results when training RNNs, and many recent papers make use of LSTMs or related ideas.

Enhancing the performance of the facial keypoint detection model by improving the quality of low-resolution facial images (저화질 안면 이미지의 화질 개선를 통한 안면 특징점 검출 모델의 성능 향상)

  • KyoungOok Lee;Yejin Lee;Jonghyuk Park
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.171-187
    • /
    • 2023
  • When a person's face is recognized through a recording device such as a low-pixel surveillance camera, it is difficult to capture the face due to low image quality. In situations where it is difficult to recognize a person's face, problems such as not being able to identify a criminal suspect or a missing person may occur. Existing studies on face recognition used refined datasets, so the performance could not be measured in various environments. Therefore, to solve the problem of poor face recognition performance in low-quality images, this paper proposes a method to generate high-quality images by performing image quality improvement on low-quality facial images considering various environments, and then improve the performance of facial feature point detection. To confirm the practical applicability of the proposed architecture, an experiment was conducted by selecting a data set in which people appear relatively small in the entire image. In addition, by choosing a facial image dataset considering the mask-wearing situation, the possibility of expanding to real problems was explored. As a result of measuring the performance of the feature point detection model by improving the image quality of the face image, it was confirmed that the face detection after improvement was enhanced by an average of 3.47 times in the case of images without a mask and 9.92 times in the case of wearing a mask. It was confirmed that the RMSE for facial feature points decreased by an average of 8.49 times when wearing a mask and by an average of 2.02 times when not wearing a mask. Therefore, it was possible to verify the applicability of the proposed method by increasing the recognition rate for facial images captured in low quality through image quality improvement.

The Audience Behavior-based Emotion Prediction Model for Personalized Service (고객 맞춤형 서비스를 위한 관객 행동 기반 감정예측모형)

  • Ryoo, Eun Chung;Ahn, Hyunchul;Kim, Jae Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.73-85
    • /
    • 2013
  • Nowadays, in today's information society, the importance of the knowledge service using the information to creative value is getting higher day by day. In addition, depending on the development of IT technology, it is ease to collect and use information. Also, many companies actively use customer information to marketing in a variety of industries. Into the 21st century, companies have been actively using the culture arts to manage corporate image and marketing closely linked to their commercial interests. But, it is difficult that companies attract or maintain consumer's interest through their technology. For that reason, it is trend to perform cultural activities for tool of differentiation over many firms. Many firms used the customer's experience to new marketing strategy in order to effectively respond to competitive market. Accordingly, it is emerging rapidly that the necessity of personalized service to provide a new experience for people based on the personal profile information that contains the characteristics of the individual. Like this, personalized service using customer's individual profile information such as language, symbols, behavior, and emotions is very important today. Through this, we will be able to judge interaction between people and content and to maximize customer's experience and satisfaction. There are various relative works provide customer-centered service. Specially, emotion recognition research is emerging recently. Existing researches experienced emotion recognition using mostly bio-signal. Most of researches are voice and face studies that have great emotional changes. However, there are several difficulties to predict people's emotion caused by limitation of equipment and service environments. So, in this paper, we develop emotion prediction model based on vision-based interface to overcome existing limitations. Emotion recognition research based on people's gesture and posture has been processed by several researchers. This paper developed a model that recognizes people's emotional states through body gesture and posture using difference image method. And we found optimization validation model for four kinds of emotions' prediction. A proposed model purposed to automatically determine and predict 4 human emotions (Sadness, Surprise, Joy, and Disgust). To build up the model, event booth was installed in the KOCCA's lobby and we provided some proper stimulative movie to collect their body gesture and posture as the change of emotions. And then, we extracted body movements using difference image method. And we revised people data to build proposed model through neural network. The proposed model for emotion prediction used 3 type time-frame sets (20 frames, 30 frames, and 40 frames). And then, we adopted the model which has best performance compared with other models.' Before build three kinds of models, the entire 97 data set were divided into three data sets of learning, test, and validation set. The proposed model for emotion prediction was constructed using artificial neural network. In this paper, we used the back-propagation algorithm as a learning method, and set learning rate to 10%, momentum rate to 10%. The sigmoid function was used as the transform function. And we designed a three-layer perceptron neural network with one hidden layer and four output nodes. Based on the test data set, the learning for this research model was stopped when it reaches 50000 after reaching the minimum error in order to explore the point of learning. We finally processed each model's accuracy and found best model to predict each emotions. The result showed prediction accuracy 100% from sadness, and 96% from joy prediction in 20 frames set model. And 88% from surprise, and 98% from disgust in 30 frames set model. The findings of our research are expected to be useful to provide effective algorithm for personalized service in various industries such as advertisement, exhibition, performance, etc.

Development of Robotic Inspection System over Bridge Superstructure (교량 상판 하부 안전점검 로봇개발)

  • Nam Soon-Sung;Jang Jung-Whan;Yang Kyung-Taek
    • Proceedings of the Korean Institute Of Construction Engineering and Management
    • /
    • autumn
    • /
    • pp.180-185
    • /
    • 2003
  • The increase of traffic over a bridge has been emerged as one of the most severe problems in view of bridge maintenance, since the load effect caused by the vehicle passage over the bridge has brought out a long-term damage to bridge structure, and it is nearly impossible to maintain operational serviceability of bridge to user's satisfactory level without any concern on bridge maintenance at the phase of completion. Moreover, bridge maintenance operation should be performed by regular inspection over the bridge to prevent structural malfunction or unexpected accidents front breaking out by monitoring on cracks or deformations during service. Therefore, technical breakthrough related to this uninterested field of bridge maintenance leading the public to the turning point of recognition is desperately needed. This study has the aim of development on automated inspection system to lower surface of bridge superstructures to replace the conventional system of bridge inspection with the naked eye, where the monitoring staff is directly on board to refractive or other type of maintenance .vehicles, with which it is expected that we can solve the problems essentially where the results of inspection are varied to change with subjective manlier from monitoring staff, increase stabilities in safety during the inspection, and make contribution to construct data base by providing objective and quantitative data and materials through image processing method over data captured by cameras. By this system it is also expected that objective estimation over the right time of maintenance and reinforcement work will lead enormous decrease in maintenance cost.

  • PDF

Future Development Strategies for KODISA Journals: Overview of 2016 and Strategic Plans for the Future (KODISA 학술지 성장전략: 2016 개관 및 미래 성장개요)

  • Hwang, Hee-Joong;Lee, Jung-Wan;Youn, Myoung-Kil;Kim, Dong-Ho;Lee, Jong-Ho;Shin, Dong-Jin;Kim, Byung-Goo;Kim, Tae-Joong;Lee, Yong-Ki;Kim, Wan-Ki
    • Journal of Distribution Science
    • /
    • v.15 no.5
    • /
    • pp.75-83
    • /
    • 2017
  • Purpose - With the rise of the fourth industrial revolution, it has converged with the existing industrial revolution to give shape to increased accessibility of knowledge and information. As a result, it has become easier for scholars to actively persue and compile research in various fields. This current study aims to focus and assess the current standing of KODISA: the Journal of Distribution Science (JDS), International Journal of Industrial Distribution & Business(IJIDB), the East Asian Journal of Business Management (EAJBM), the Journal of Asian Finance, Economics and Business (JAFEB) in a rapidly evolving era. Novel strategies for creating the future vision of KODISA 2020 will also be examined. Research design, data, and methodology - The current research will analyze published journals of KODISA in order to offer a vision for the KODISA 2020 future. In part 1, this paper will observe the current address of the KODISA journal and its overview of past achievements. Next, part 2 will discuss the activities that will be needed for journals of KODISA, JDS, IJIDB, EAJBM, JAFEB to branch out internationally and significant journals will be statistically analyzed in part 3. The last part 4 will offer strategies for the continued growth of KODISA and visions for KODISA 2020. Results - Among the KODISA publications, IJIDB was second, JDS was 23rd (in economic publications of 54 journals), and EAJBM was 22nd (out of 79 publications in management field journals). This shows the high quality of the KODISA publication journals. According to 2016 publication analysis, JDS, IJIDB, etc. each had 157 publications, 15 publications, 16 publications, and 28 publications. In the case of JDS, it showed an increase of 14% compared to last year. Additionally, JAFEB showed a significant increase of 68%. This shows that compared to other journals, it had a higher rate of paper submission. IJIDB and EAJBM did not show any significant increases. In JDS, it showed many studies related to the distribution, management of distribution, and consumer behavior. In order to increase the status of the KODISA journal to a SCI status, many more international conferences will open to increase its international recognition levels. Second, the systematic functions of the journal will be developed further to increase its stability. Third, future graduate schools will open to foster future potential leaders in this field and build a platform for innovators and leaders. Conclusions - In KODISA, JDS was first published in 1999, and has been registered in SCOPUS February 2017. Other sister publications within the KODISA are preparing for SCOPUS registration as well. KODISA journals will prepare to be an innovative journal for 2020 and the future beyond.