• Title/Summary/Keyword: Module Extraction

Search Result 211, Processing Time 0.023 seconds

Real-Time Arbitrary Face Swapping System For Video Influencers Utilizing Arbitrary Generated Face Image Selection

  • Jihyeon Lee;Seunghoo Lee;Hongju Nam;Suk-Ho Lee
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.15 no.2
    • /
    • pp.31-38
    • /
    • 2023
  • This paper introduces a real-time face swapping system that enables video influencers to swap their faces with arbitrary generated face images of their choice. The system is implemented as a Django-based server that uses a REST request to communicate with the generative model,specifically the pretrained stable diffusion model. Once generated, the generated image is displayed on the front page so that the influencer can decide whether to use the generated face or not, by clicking on the accept button on the front page. If they choose to use it, both their face and the generated face are sent to the landmark extraction module to extract the landmarks, which are then used to swap the faces. To minimize the fluctuation of landmarks over time that can cause instability or jitter in the output, a temporal filtering step is added. Furthermore, to increase the processing speed the system works on a reduced set of the extracted landmarks.

Development of RPA with Information Extraction Module (문서에서 정보 추출 기능을 갖는 RPA 개발)

  • Kim, Ki-Tae;Jeong, Su-Na;Lee, Se-Hoon
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2021.07a
    • /
    • pp.435-436
    • /
    • 2021
  • 본 논문에서는 RPA(Robotic Process Automation) Tool 개발 과정 중 OCR기법을 활용한 영수증 인식 후 가계부 생성에 관한 자동화 처리 과정을 기술한다. 개발된 RPA 툴은 AI분야에 사용될 데이터의 데이터 전처리 기능을 제공하고 그 외에 반복적으로 사용되는 기능들의 자동화를 제공한다. 그 중 영수증을 이용하여 가계부 작성을 자동으로 처리해주는 기능은 반복적이고 시간이 많이 소요되는 작업으로 이 기능을 활용하면 작업의 수행시간을 단축하고 효율적인 관리가 가능하다.

  • PDF

Hardware Design for JBIG2 Encoder on Embedded System (임베디드용 JBIG2 부호화기의 하드웨어 설계)

  • Seo, Seok-Yong;Ko, Hyung-Hwa
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.2C
    • /
    • pp.182-192
    • /
    • 2010
  • This paper proposes the hardware IP design of JBIG2 encoder. In order to facilitate the next generation FAX after the standardization of JBIG2, major modules of JBIG2 encoder are designed and implemented, such as symbol extraction module, Huffman coder, MMR coder, and MQ coder. ImpulseC Codeveloper and Xilinx ISE/EDK program are used for the synthesis of VHDL code. To minimize the memory usage, 128 lines of input image are processed succesively instead of total image. The synthesized IPs are downloaded to Virtex-4 FX60 FPGA on ML410 development board. The four synthesized IPs utilize 36.7% of total slice of FPGA. Using Active-HDL tool, the generated IPs were verified showing normal operation. Compared with the software operation using microblaze cpu on ML410 board, the synthesized IPs are better in operation time. The improvement ratio of operation time between the synthesized IP and software is 17 times in case of symbol extraction IP, and 10 times in Huffman coder IP. MMR coder IP shows 6 times faster and MQ coder IP shows 2.2 times faster than software only operation. The synthesized H/W IP and S/W module cooperated to succeed in compressing the CCITT standard document.

A Thoracic Spine Segmentation Technique for Automatic Extraction of VHS and Cobb Angle from X-ray Images (X-ray 영상에서 VHS와 콥 각도 자동 추출을 위한 흉추 분할 기법)

  • Ye-Eun, Lee;Seung-Hwa, Han;Dong-Gyu, Lee;Ho-Joon, Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.1
    • /
    • pp.51-58
    • /
    • 2023
  • In this paper, we propose an organ segmentation technique for the automatic extraction of medical diagnostic indicators from X-ray images. In order to calculate diagnostic indicators of heart disease and spinal disease such as VHS(vertebral heart scale) and Cobb angle, it is necessary to accurately segment the thoracic spine, carina, and heart in a chest X-ray image. A deep neural network model in which the high-resolution representation of the image for each layer and the structure converted into a low-resolution feature map are connected in parallel was adopted. This structure enables the relative position information in the image to be effectively reflected in the segmentation process. It is shown that learning performance can be improved by combining the OCR module, in which pixel information and object information are mutually interacted in a multi-step process, and the channel attention module, which allows each channel of the network to be reflected as different weight values. In addition, a method of augmenting learning data is presented in order to provide robust performance against changes in the position, shape, and size of the subject in the X-ray image. The effectiveness of the proposed theory was evaluated through an experiment using 145 human chest X-ray images and 118 animal X-ray images.

Recovery of Silver from Nitrate Leaching Solution of Silicon Solar Cells (실리콘 태양전지 질산침출액에서 LIX63를 이용한 은(Ag) 회수)

  • Cho, Sung-Yong;Kim, Tae-Young;Sun, Pan-Pan
    • Resources Recycling
    • /
    • v.30 no.2
    • /
    • pp.39-45
    • /
    • 2021
  • Spent photovoltaic module is one of the important resource of silver, while related research concerning silver recovery remains limited. In our previous research, HNO3 was utilized to dissolve Ag(I) and Al(III) from the spent silicon solar cells. In order to recover Ag(I) from the leachate of a silicon solar cell, the present study made use of a nitrate solution containing Ag(I) and Al(III), which was subjected to a solvent extraction process with 5,8-diethyl-7-hydroxydodecan-6-oxime (LIX63). Ag(I) was selectively extracted with LIX63 over Al(III) from the nitrate leach solution. Subsequently, quantitative stripping of Ag(I) from the loaded LIX63 was performed by using 20% ammonia water. The McCabe-Thiele plots for the extraction and stripping isotherms of Ag(I) were also constructed. Extraction and stripping simulation tests confirmed an Ag(I) extraction and stripping efficiency of >99.99% and 98.9%, respectively with high purity Ag (99.998%) and Al (99.99%) solution. A process flow sheet for Ag(I) recovery from the nitrate leach solution was proposed.

A Study on an Extraction of the Geometric Characteristics of the Pyongchang River basin by Using Geographic Information System (GIS를 활용한 유역의 하천 형태학적 특성 추출에 관한 연구)

  • Hahm, Chang-Hahk
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.4 no.1 s.6
    • /
    • pp.115-119
    • /
    • 1996
  • odel). One of important tasks for hydrological analysis is the division of watershed. It can be an essential factor amThe main objective of this study is to extract of the geometric characteristics of the Pyongchang River basin, headwaters of the South Ran River. A GIS is capable of extracting various hydrological factors from DEM(digital elevation mong various geometric characteristics of watershed. In this study, watershed itself and other geometric factors of watershed are extracted from DEM by using a GIS technique. The manual process of tasks to obtain geometric characteristics of watershed is automated. by using the function of ARC/INFO software as a GIS package. Scanned data is used for this study and it is converted to DEM data Various forms of representation of spatial data are handled in main modules and a GRID module of ARC/INFO. A GRID module is used on a stream in order to define watershed boundary, so it would be possible to obtain the watersheds. Also, a flowdirection, stream networks and others are generated. The results show that GIS can aid watershed management and research and surveillance. Also the geometric characteristics as parameters of watershed can be quantified by a using GIS technique. Resonable results can be obtained as compared with conventional graphic methods.

  • PDF

Development of Human-machine Interface based on EMG and EOG (근전도와 안전도 기반의 인간-기계 인터페이스기술)

  • Gang, Gyeong Woo;Kim, Tae Seon
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.12
    • /
    • pp.129-137
    • /
    • 2013
  • As the usage of computer based systems continues to increase in our normal life, there are constant efforts to enhance the accessibility of information for handicapped people. For this, it is essential to develop new interface ways for physical disabled peoples by means of human-computer interface (HCI) or human-machine interface (HMI). In this paper, we developed HMI using electromyogram (EMG) and electrooculogram (EOG) for people with physical disabilities. Developed system is composed of two modules, hardware module for signal sensing and software module for feature extraction and pattern classification. To maximize ease of use, only two skin contact electrodes are attached on both ends of brow, and EOG and EMG are measured simultaneously through these two electrodes. From measured signal, nine kinds of command patterns are extracted and defined using signal processing and pattern classification method. Through Java based real-time monitoring program, developed system showed 92.52% of command recognition rate. In addition, to show the capability of the developed system on real applications, five different types of commands are used to control ER1 robot. The results show that developed system can be applied to disabled person with quadriplegia as a novel interface way.

Automatic Extraction of Focused Video Object from Low Depth-of-Field Image Sequences (낮은 피사계 심도의 동영상에서 포커스 된 비디오 객체의 자동 검출)

  • Park, Jung-Woo;Kim, Chang-Ick
    • Journal of KIISE:Software and Applications
    • /
    • v.33 no.10
    • /
    • pp.851-861
    • /
    • 2006
  • The paper proposes a novel unsupervised video object segmentation algorithm for image sequences with low depth-of-field (DOF), which is a popular photographic technique enabling to represent the intention of photographer by giving a clear focus only on an object-of-interest (OOI). The proposed algorithm largely consists of two modules. The first module automatically extracts OOIs from the first frame by separating sharply focused OOIs from other out-of-focused foreground or background objects. The second module tracks OOIs for the rest of the video sequence, aimed at running the system in real-time, or at least, semi-real-time. The experimental results indicate that the proposed algorithm provides an effective tool, which can be a basis of applications, such as video analysis for virtual reality, immersive video system, photo-realistic video scene generation and video indexing systems.

Hardware Architecture and Memory Bandwidth Analysis of AVM System (AVM 시스템의 하드웨어 구현에 따른 하드웨어 구조 및 메모리 대역폭 분석)

  • Nam, Kwnag-Min;Jung, Yong-Jin
    • Journal of IKEEE
    • /
    • v.20 no.3
    • /
    • pp.241-250
    • /
    • 2016
  • AVM(Around View Monitoring) is a function of ADAS(Advanced Driver Assistance Systems), which provides a bird's eye view of the surroundings of a vehicle to the user. AVM systems require large bandwidth since they are composed of four input images and require real-time processing for vehicle-embedded environments. Also, the memory bandwidth requirement increases greatly when the resolution of the input data is higher. In this paper, we propose four basic hardware models of AVM systems. The models are decided by whether or not there is a valid data extraction module and an image processing purpose LUT generation module. We analyze the required bandwidth and hardware resource for each model. For verification of the proposed models, we implemented an AVM system using XC7Z045 FPGA and DDR3 memory for VGA and FHD resolution. All four of the proposed hardware model is executed below 33ms, which shows that it can operate in real-time.

Multimodal approach for blocking obscene and violent contents (멀티미디어 유해 콘텐츠 차단을 위한 다중 기법)

  • Baek, Jin-heon;Lee, Da-kyeong;Hong, Chae-yeon;Ahn, Byeong-tae
    • Journal of Convergence for Information Technology
    • /
    • v.7 no.6
    • /
    • pp.113-121
    • /
    • 2017
  • Due to the development of IT technology, harmful multimedia contents are spreading out. In addition, obscene and violent contents have a negative impact on children. Therefore, in this paper, we propose a multimodal approach for blocking obscene and violent video contents. Within this approach, there are two modules each detects obsceneness and violence. In the obsceneness module, there is a model that detects obsceneness based on adult and racy score. In the violence module, there are two models for detecting violence: one is the blood detection model using RGB region and the other is motion extraction model for observation that violent actions have larger magnitude and direction change. Through result of these three models, this approach judges whether or not the content is harmful. This can contribute to the blocking obscene and violent contents that are distributed indiscriminately.