• Title/Summary/Keyword: open-set recognition

Search Result 36, Processing Time 0.029 seconds

Automatic Recognition of the Front/Back Sides and Stalk States for Mushrooms(Lentinus Edodes L.) (버섯 전후면과 꼭지부 상태의 자동 인식)

  • Hwang, H.;Lee, C.H.
    • Journal of Biosystems Engineering
    • /
    • v.19 no.2
    • /
    • pp.124-137
    • /
    • 1994
  • Visual features of a mushroom(Lentinus Edodes, L.) are critical in grading and sorting as most agricultural products are. Because of its complex and various visual features, grading and sorting of mushrooms have been done manually by the human expert. To realize the automatic handling and grading of mushrooms in real time, the computer vision system should be utilized and the efficient and robust processing of the camera captured visual information be provided. Since visual features of a mushroom are distributed over the front and back sides, recognizing sides and states of the stalk including the stalk orientation from the captured image is a prime process in the automatic task processing. In this paper, the efficient and robust recognition process identifying the front and back side and the state of the stalk was developed and its performance was compared with other recognition trials. First, recognition was tried based on the rule set up with some experimental heuristics using the quantitative features such as geometry and texture extracted from the segmented mushroom image. And the neural net based learning recognition was done without extracting quantitative features. For network inputs the segmented binary image obtained from the combined type automatic thresholding was tested first. And then the gray valued raw camera image was directly utilized. The state of the stalk seriously affects the measured size of the mushroom cap. When its effect is serious, the stalk should be excluded in mushroom cap sizing. In this paper, the stalk removal process followed by the boundary regeneration of the cap image was also presented. The neural net based gray valued raw image processing showed the successful results for our recognition task. The developed technology through this research may open the new way of the quality inspection and sorting especially for the agricultural products whose visual features are fuzzy and not uniquely defined.

  • PDF

Incremental Face Annotation for Open Web Service (개방형 웹 서버스를 위한 증가적 얼굴 어노테이션)

  • Chai, Kwon-Taeg;Byun, Hye-Ran
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.8
    • /
    • pp.673-682
    • /
    • 2009
  • Recently, photo sharing and publishing based Social Network Sites(SNSs) are increasingly attracting the attention of academic and industry researches. Unlike the face recognition environment addressed by existing works, face annotation problem under SNSs is differentiated in terms of daily updated images database, a limited number of training set and millions of users. Thus, conventional approach may not deal with these problems. In this paper, we proposed a face annotation method for sharing and publishing photographs that contain faces under a social network service using random projection, non-linear regression and representational state transfer. Our experiments on several databases show that the proposed method records an almost constant execution time with comparable accuracy of the PCA-SVM classifier.

Recognition of Machining Features on Prismatic Components (각주형 부품상의 가공 특징형상 인식)

  • 손영태;박면웅
    • Transactions of the Korean Society of Mechanical Engineers
    • /
    • v.17 no.6
    • /
    • pp.1412-1422
    • /
    • 1993
  • As a part of development of process planning system for mold die manufaturing, a software system is developed, which recognizes features and extracts parameters of the shape from design data produced by solid modeller. The recognized feature date is fed to process planning and operation planning system. Low level geometry and topology data from commercial CAD system is transformed to high level machining feature data which used to be done by using a dedicated design system. The recognition algorithm is applied to the design data with boundary representation produced by a core modeller ACIS which has object oriented open architecture and is expected to become a common core modeller of next generation CAD system. The algoritm of recognition has been formulated for 21 features on prismatic components, but the feature set can be expanded by adding rules for the additional features.

Graphemes Segmentation for Arabic Online Handwriting Modeling

  • Boubaker, Houcine;Tagougui, Najiba;El Abed, Haikal;Kherallah, Monji;Alimi, Adel M.
    • Journal of Information Processing Systems
    • /
    • v.10 no.4
    • /
    • pp.503-522
    • /
    • 2014
  • In the cursive handwriting recognition process, script trajectory segmentation and modeling represent an important task for large or open lexicon context that becomes more complicated in multi-writer applications. In this paper, we will present a developed system of Arabic online handwriting modeling based on graphemes segmentation and the extraction of its geometric features. The main contribution consists of adapting the Fourier descriptors to model the open trajectory of the segmented graphemes. To segment the trajectory of the handwriting, the system proceeds by first detecting its baseline by checking combined geometric and logic conditions. Then, the detected baseline is used as a topologic reference for the extraction of particular points that delimit the graphemes' trajectories. Each segmented grapheme is then represented by a set of relevant geometric features that include the vector of the Fourier descriptors for trajectory shape modeling, normalized metric parameters that model the grapheme dimensions, its position in respect to the baseline, and codes for the description of its associated diacritics.

Comparison Study of the Performance of CNN Models with Multi-view Image Set on the Classification of Ship Hull Blocks (다시점 영상 집합을 활용한 선체 블록 분류를 위한 CNN 모델 성능 비교 연구)

  • Chon, Haemyung;Noh, Jackyou
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.57 no.3
    • /
    • pp.140-151
    • /
    • 2020
  • It is important to identify the location of ship hull blocks with exact block identification number when scheduling the shipbuilding process. The wrong information on the location and identification number of some hull block can cause low productivity by spending time to find where the exact hull block is. In order to solve this problem, it is necessary to equip the system to track the location of the blocks and to identify the identification numbers of the blocks automatically. There were a lot of researches of location tracking system for the hull blocks on the stockyard. However there has been no research to identify the hull blocks on the stockyard. This study compares the performance of 5 Convolutional Neural Network (CNN) models with multi-view image set on the classification of the hull blocks to identify the blocks on the stockyard. The CNN models are open algorithms of ImageNet Large-Scale Visual Recognition Competition (ILSVRC). Four scaled hull block models are used to acquire the images of ship hull blocks. Learning and transfer learning of the CNN models with original training data and augmented data of the original training data were done. 20 tests and predictions in consideration of five CNN models and four cases of training conditions are performed. In order to compare the classification performance of the CNN models, accuracy and average F1-Score from confusion matrix are adopted as the performance measures. As a result of the comparison, Resnet-152v2 model shows the highest accuracy and average F1-Score with full block prediction image set and with cropped block prediction image set.

The input device system with hand motion using hand tracking technique of CamShift algorithm (CamShift 알고리즘의 Hand Tracking 기법을 응용한 Hand Motion 입력 장치 시스템)

  • Jeon, Yu-Na;Kim, Soo-Ji;Lee, Chang-Hoon;Kim, Hyeong-Ryul;Lee, Sung-Koo
    • Journal of Digital Contents Society
    • /
    • v.16 no.1
    • /
    • pp.157-164
    • /
    • 2015
  • The existing input device is limited to keyboard and mouse. However, recently new type of input device has been developed in response to requests from users. To reflect this trend we propose the new type of input device that gives instruction as analyzing the hand motion of image without special device. After binarizing the skin color area using Cam-Shift method and tracking, it recognizes the hand motion by inputting the finger areas and the angles from the palm center point, which are separated through labeling, into four cardinal directions and counting them. In cases when specific background was not set and without gloves, the recognition rate remained approximately at 75 percent. However, when specific background was set and the person wore red gloves, the recognition rate increased to 90.2 percent due to reduction in noise.

Investigation of image preprocessing and face covering influences on motion recognition by a 2D human pose estimation algorithm (모션 인식을 위한 2D 자세 추정 알고리듬의 이미지 전처리 및 얼굴 가림에 대한 영향도 분석)

  • Noh, Eunsol;Yi, Sarang;Hong, Seokmoo
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.7
    • /
    • pp.285-291
    • /
    • 2020
  • In manufacturing, humans are being replaced with robots, but expert skills remain difficult to convert to data, making them difficult to apply to industrial robots. One method is by visual motion recognition, but physical features may be judged differently depending on the image data. This study aimed to improve the accuracy of vision methods for estimating the posture of humans. Three OpenPose vision models were applied: MPII, COCO, and COCO+foot. To identify the effects of face-covering accessories and image preprocessing on the Convolutional Neural Network (CNN) structure, the presence/non-presence of accessories, image size, and filtering were set as the parameters affecting the identification of a human's posture. For each parameter, image data were applied to the three models, and the errors between the actual and predicted values, as well as the percentage correct keypoints (PCK), were calculated. The COCO+foot model showed the lowest sensitivity to all three parameters. A <50% (from 3024×4032 to 1512×2016 pixels) reduction in image size was considered acceptable. Emboss filtering, in combination with MPII, provided the best results (reduced error of <60 pixels).

The study of Parking Management System by Image Processing (영상인식을 이용한 주차 관리 시스템 연구)

  • Kim, Kun-Kook;Son, Woong-Gi;Lee, Min-Gyu;Han, Jung-Gu;Park, Yong-Wook
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.12 no.4
    • /
    • pp.651-656
    • /
    • 2017
  • In this study, we designed the system that helps drivers check all information about parking space at the entrance and find out whether the places is available or not, because the system has 'Image recognition function' which can even recognize car number plates exactly. Besides, we place the webcam close to the car number plate, so that car number can be identified more quickly. Finally, since we set the webcam high, the system keeps us from parking wrong places by displaying on the screen.

A Selection of Threshold for the Generalized Hough Transform: A Probabilistic Approach (일반화된 허프변환의 임계값 선택을 위한 확률적 접근방식)

  • Chang, Ji Y.
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.1
    • /
    • pp.161-171
    • /
    • 2014
  • When the Hough transform is applied to identify an instance of a given model, the output is typically a histogram of votes cast by a set of image features into a parameter space. The next step is to threshold the histogram of counts to hypothesize a given match. The question is "What is a reasonable choice of the threshold?" In a standard implementation of the Hough transform, the threshold is selected heuristically, e.g., some fraction of the highest cell count. Setting the threshold too low can give rise to a false alarm of a given shape(Type I error). On the other hand, setting the threshold too high can result in mis-detection of a given shape(Type II error). In this paper, we derive two conditional probability functions of cell counts in the accumulator array of the generalized Hough transform(GHough), that can be used to select a scientific threshold at the peak detection stage of the Ghough.

Outdoor Care System using WEMOS and Arduino MEGA (WEMOS와 아두이노 MEGA를 이용한 외출 케어 시스템)

  • Jeong-Geun Choi;Chang-Hyun Kim;Chan-Gyu Lee;Geon-Ho Choi;Boong-Joo Lee
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.4
    • /
    • pp.677-686
    • /
    • 2023
  • In this paper, we study the design and implementation of a smart home outing care system that recognizes the user's purpose of going out and delivers useful information that can help when going out. RSS service data of the Korea Meteorological Administration can be transmitted in real time using ESP8266, and a system that can provide weather information to users after analyzing the data using Arduino MEGA is implemented. Using App Inventor, you can pack the necessary items without forgetting, and you can change the settings according to the desired weather and purpose. The position of the microphone was placed outside to increase awareness by 12%, and the sensitivity of the pressure sensor was set to a maximum of 210 kΩ. If there is an obstacle between the doors, the doors open automatically. An ultrasonic sensor was placed on the ceiling of the drawer to recognize an object within the range of 0.5cm to 10cm to check the existence of an object, and a camera was installed to research a security reinforcement system.