• Title/Summary/Keyword: Feature representation

Search Result 422, Processing Time 0.039 seconds

Research on the Spatial Expression Characteristics of Illustration in Picture Books (그림책 속 일러스트레이션의 공간 표현 특징 연구)

  • Han, YongGang;Kim, KieSu
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.3
    • /
    • pp.131-142
    • /
    • 2021
  • This research is based on the design of pictures in picture books, and the spatial representation of illustrations in the picture books contains the significantly important objective. Various texts, pictures, spaces, etc. in a picture must have the operator's various editing skills so that spatial arrangement is made smoothly.In this paper, the characteristics of spatial expression of design in picture books are derived by analyzing several examples of paintings and studies on classic picture books. First, the fusion of picture and text, that is, that both picture and text convey spatial information together as elements of the screen. Second, as a characteristic of the coherence of the space design in picture books, the story and content must be smoothly connected when reading the book. Third, when expressing a space, the creator should utilize the strengths and weaknesses of each other between abstract and conceived spatial expressions as needed. Fourth, as a symbolic feature of picture book spatial expression, it can be seen that many symbolic expression techniques are applied to the spatial expression of picture books according to the semiotic principle, which greatly improves the cognitive efficiency of reading picture books. The fifth characteristic is that the spatial expression of an excellent picture book has excellent interesting element, rich design means, and interestingly conveys screen contents and screen format to readers. In this research, it is thought that designers and artists should guide the creation within a spatial framework as designing picture books, thus greatly improving the efficiency of the creation process, while also provide a reader-centered visual Interesting experience.

A Thoracic Spine Segmentation Technique for Automatic Extraction of VHS and Cobb Angle from X-ray Images (X-ray 영상에서 VHS와 콥 각도 자동 추출을 위한 흉추 분할 기법)

  • Ye-Eun, Lee;Seung-Hwa, Han;Dong-Gyu, Lee;Ho-Joon, Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.1
    • /
    • pp.51-58
    • /
    • 2023
  • In this paper, we propose an organ segmentation technique for the automatic extraction of medical diagnostic indicators from X-ray images. In order to calculate diagnostic indicators of heart disease and spinal disease such as VHS(vertebral heart scale) and Cobb angle, it is necessary to accurately segment the thoracic spine, carina, and heart in a chest X-ray image. A deep neural network model in which the high-resolution representation of the image for each layer and the structure converted into a low-resolution feature map are connected in parallel was adopted. This structure enables the relative position information in the image to be effectively reflected in the segmentation process. It is shown that learning performance can be improved by combining the OCR module, in which pixel information and object information are mutually interacted in a multi-step process, and the channel attention module, which allows each channel of the network to be reflected as different weight values. In addition, a method of augmenting learning data is presented in order to provide robust performance against changes in the position, shape, and size of the subject in the X-ray image. The effectiveness of the proposed theory was evaluated through an experiment using 145 human chest X-ray images and 118 animal X-ray images.

Extending StarGAN-VC to Unseen Speakers Using RawNet3 Speaker Representation (RawNet3 화자 표현을 활용한 임의의 화자 간 음성 변환을 위한 StarGAN의 확장)

  • Bogyung Park;Somin Park;Hyunki Hong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.7
    • /
    • pp.303-314
    • /
    • 2023
  • Voice conversion, a technology that allows an individual's speech data to be regenerated with the acoustic properties(tone, cadence, gender) of another, has countless applications in education, communication, and entertainment. This paper proposes an approach based on the StarGAN-VC model that generates realistic-sounding speech without requiring parallel utterances. To overcome the constraints of the existing StarGAN-VC model that utilizes one-hot vectors of original and target speaker information, this paper extracts feature vectors of target speakers using a pre-trained version of Rawnet3. This results in a latent space where voice conversion can be performed without direct speaker-to-speaker mappings, enabling an any-to-any structure. In addition to the loss terms used in the original StarGAN-VC model, Wasserstein distance is used as a loss term to ensure that generated voice segments match the acoustic properties of the target voice. Two Time-Scale Update Rule (TTUR) is also used to facilitate stable training. Experimental results show that the proposed method outperforms previous methods, including the StarGAN-VC network on which it was based.

Improvement of Face Recognition Algorithm for Residential Area Surveillance System Based on Graph Convolution Network (그래프 컨벌루션 네트워크 기반 주거지역 감시시스템의 얼굴인식 알고리즘 개선)

  • Tan Heyi;Byung-Won Min
    • Journal of Internet of Things and Convergence
    • /
    • v.10 no.2
    • /
    • pp.1-15
    • /
    • 2024
  • The construction of smart communities is a new method and important measure to ensure the security of residential areas. In order to solve the problem of low accuracy in face recognition caused by distorting facial features due to monitoring camera angles and other external factors, this paper proposes the following optimization strategies in designing a face recognition network: firstly, a global graph convolution module is designed to encode facial features as graph nodes, and a multi-scale feature enhancement residual module is designed to extract facial keypoint features in conjunction with the global graph convolution module. Secondly, after obtaining facial keypoints, they are constructed as a directed graph structure, and graph attention mechanisms are used to enhance the representation power of graph features. Finally, tensor computations are performed on the graph features of two faces, and the aggregated features are extracted and discriminated by a fully connected layer to determine whether the individuals' identities are the same. Through various experimental tests, the network designed in this paper achieves an AUC index of 85.65% for facial keypoint localization on the 300W public dataset and 88.92% on a self-built dataset. In terms of face recognition accuracy, the proposed network achieves an accuracy of 83.41% on the IBUG public dataset and 96.74% on a self-built dataset. Experimental results demonstrate that the network designed in this paper exhibits high detection and recognition accuracy for faces in surveillance videos.

The relation between Movement working as a Grouping clue in Moving Picture and Semantic structure forming (동영상에서 그룹핑(grouping) 단서로 작용하는 움직임(Movement)과 의미구조 형성의 관계)

  • Lee, Soo-Jin
    • Archives of design research
    • /
    • v.19 no.5 s.67
    • /
    • pp.119-128
    • /
    • 2006
  • The scale of visual expression has expanded from freeze frame to motion picture as media have developed. Moving pictures such as animation, movies, TV CM and GUI become formative elements whose movement is necessary compared to freeze frame as apparent movement phenomenon and unit structure such as short and scene appear. Therefore, of formative elements such as a shape, color, space, size and movement, movement is importantly distinguished in the moving image. The expression and form of image as a relationship between the signified and signifier explained by Saussure are accepted as a sign by mutual complement even though they limit the content. This makes it possible to infer that the formal feature of movement participates in the message content. To verify this, the result of moving picture visual perception experiment based on the gestalt grouping principle result shows that 70-80 percent of subjects think that 'movement' is the important grouping clue in perception. Movement affects the maintenance of the context of message content in the communication process when the meaning structure of moving picture is analyzed based on the structural feature. The identity can be maintained with if there is a movement with similar directive point even if the color and shape of people, things and background are changed. Second, the clarity of the content is elevated by a distinguished object as a figure by movement. Third, it acts as a knowledge representation which can predict similar movement process of next information processing. Forth, movement gives the content consistency even though more than two scenes have fast switch and complicated editing structure like cross-cutting. Movement becomes a clue which can make grouping information input by visual perception reaction. Also, it gives the order to the visual expression which can be used improperly by formation of structural frame of image message and has the effectiveness which elevates the clarity of signification. Moving picture has discourse with several mixed unit structures because it fundamentally contains time and the common and distinguished expression is needed by media-mix circumstances. Therefore, by the application of gestalt grouping principle to moving picture field, movement becomes the more distinguished than other formative elements and affects the formation of meaning structure. This study propose a viewpoint that develops structural formative beauty and new image expression in the media image field.

  • PDF

A Comparative Study on the Design Element in Traditional Palaces Korea, China and Japan (한 중 일 의장 문화 비교 연구 - 궁궐전출을 중심으로 -)

  • Lee, Hyun-Jung;Park, Young-Soon;Choi, Ji-Young;Hwang, Jung-Ah
    • Archives of design research
    • /
    • v.18 no.4 s.62
    • /
    • pp.277-286
    • /
    • 2005
  • The purpose of this study is to ascertain the design element in traditional palaces of Korea, China and Japan. It takes threesteps to proceed this study. Firstly, it needs to be established the analysis framework from the documents. In second step, the design elements - the form, the material, the pattern and the color - should be collected and investigated through the observation of the actual traditional palaces the Changduckung, the Forbidden City, the Nijo castle. The third step is the analysis of the results of the investigation of the design elements from step two. To sum up similarities and dissimilarities among the design element in traditional palaces of Korea, China and Japan is as the following It is to be noticed that the mainly common characteristics of the artistic design are 'naturalism', 'harmonious ideas' and 'confucianism'. But the representation style of the design element is differed from the country. : The typical features of China are symmetry, glassy surface by artificial process, the meandered curve, the magnificent pattern and the constrable color. In Japan, the mathematical asymmetry, made-up rough surface by artificial skill, decorativepattern with abbreviation and achromatic color are important feature of the design element. While the major features of Korean design element are asymmetrical balance with nature, rough surface by natural process, moderate pattern and harmonious color.

  • PDF

Study for making movie poster applied Augmented Reality (증강현실 영화포스터 제작연구)

  • Lee, Ki Ho
    • Cartoon and Animation Studies
    • /
    • s.48
    • /
    • pp.359-383
    • /
    • 2017
  • 3,000 years ago, since the first poster of humanity appeared in Egypt, the invention of printing technique and the development of civilization have accelerated the poster production technology. In keeping with this, the expression of poster has also been developed as an attempt to express artistic sensibility in a simple arrangement of characters, and now it has become an art form that has become a domain of professional designers. However, the technological development in the expression of poster is keep staying in two-dimensional, and is dependent on printing only that it is irrelevant to the change of ICT environment based on modern multimedia. Especially, among the many kinds of posters, the style of movie posters, which are the only objects for video, are still printed on paper, and many attempts have been made so far, but the movie industry still does not consider ICT integration at all. This study started with the feature that the object of the movie poster dealt with the video and attempted to introduce the augmented reality to apply the dynamic image of the movie to the static poster. In the graduation work of the media design major of a university in Korea, the poster of each works for promoting the visual work of the students was designed and printed in the form of a commercial film poster. Among them, 6 artworks that are considered to be suitable for augmented reality were selected and augmented reality was introduced and exhibited. Content that appears matched to the poster through the mobile device is reproduced on a poster of a scene of the video, but the text informations of the original poster are kept as they are, so that is able to build a moving poster looked like a wanted from the movie "Harry Potter". In order to produce this augmented reality poster, we applied augmented reality to posters of existing commercial films produced in two different formats, and found a way to increase the characteristics of AR contents. Through this, we were able to understand poster design suitable for AR representation, and technical expression for stable operation of augmented reality can be summarized in the matching process of augmented reality contents production.

Facial Expression Control of 3D Avatar using Motion Data (모션 데이터를 이용한 3차원 아바타 얼굴 표정 제어)

  • Kim Sung-Ho;Jung Moon-Ryul
    • The KIPS Transactions:PartA
    • /
    • v.11A no.5
    • /
    • pp.383-390
    • /
    • 2004
  • This paper propose a method that controls facial expression of 3D avatar by having the user select a sequence of facial expressions in the space of facial expressions. And we setup its system. The space of expression is created from about 2400 frames consist of motion captured data of facial expressions. To represent the state of each expression, we use the distance matrix that represents the distances between pairs of feature points on the face. The set of distance matrices is used as the space of expressions. But this space is not such a space where one state can go to another state via the straight trajectory between them. We derive trajectories between two states from the captured set of expressions in an approximate manner. First, two states are regarded adjacent if the distance between their distance matrices is below a given threshold. Any two states are considered to have a trajectory between them If there is a sequence of adjacent states between them. It is assumed . that one states goes to another state via the shortest trajectory between them. The shortest trajectories are found by dynamic programming. The space of facial expressions, as the set of distance matrices, is multidimensional. Facial expression of 3D avatar Is controled in real time as the user navigates the space. To help this process, we visualized the space of expressions in 2D space by using the multidimensional scaling(MDS). To see how effective this system is, we had users control facial expressions of 3D avatar by using the system. As a result of that, users estimate that system is very useful to control facial expression of 3D avatar in real-time.

A Study on the Meaning and Mount Effect of Twelve Peaks of Musan in Yongho Garden, Jinju (진주 용호정원(龍虎庭園) 무산십이봉의 경관의미와 축산효과)

  • Lee, Hyun-Woo
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.29 no.4
    • /
    • pp.27-39
    • /
    • 2011
  • The study on Musan twelve peaks of Yongho garden in Jinju, Gyeongnam was anticipated to provide data and implication for reproducing similar spaces and modern changes in terms of design factor since it is the prototype of traditional mount for overcoming monotonous geographical features and intriguing changes and interests. The study analyzed and interpreted the symbolism of twelve peaks, principles of space composition and function and effect of visual construction that were pursued by the builder in terms of landscape view, which results are as following. The center of Yongho garden, Yonghoji(龍虎池) is a typical man-made pond for a supportive feng shui feature. It is a supporting equipment to complete the state of feng shui, and the result of strengthening the completion through the connection with the dragon-related name of the place. The shape of Musan twelve peaks looks like an oval form of Geumseongsan(金星山), 2~3.5m in height and 6~12m in diameter. Peaks are estimated as 1.5~3.7m(2.4m in average) in height, $35{\sim}138m^2$($73.4m^2$ in average) in area, and $30.7{\sim}115.0m^3$($62.5m^3$ in average) in volume. Given that Yonghojeong(龍虎亭), Soseon(小船), the site of main building and Yongsanjae(龍山齋) stand in line, Yonghoji was presumed as the state of enlightenment through ascribing the meaning to virtue and secularity. For the intention of realizing Musan twelve peaks, the builder probably had mounted twelve peaks forming the body of dragons with crossing the point corresponding to a head of tiger, and located Musan twelve peaks and Yonghojeong with a representation of dragons holding Cintamani rising into the sky in the center. The middle area near Musan twelve peaks surrounded by peaks like Geumseongsan running north and south shows a multi-structure of multilayer, maintaining the similarity centering on Yonghoji. It is considered the intention of mount planned at the time of Musan twelve peaks construction, caused by similar form harmony. Internalization of progressive realization through concealment and exposure, enframement effect and spatial order like prospect-refuge theory in the mount of Musan twelve peaks is considered the reflection of the intention to increase depth of the view and expectancy through the various degree of exposure and surroundings of each peak and the colorful combination of viewing and shutting. The "closed view" by Musan twelve peaks creates interesting, vivid and attractive recognition of the view, which is more effective in bringing depth of the view and interests in terms of the geographical design, particularly the area around Yonghoji. Moreover, it was identified that the combination of peaks can be formed resulted from the view configuration concerning the location through multilayer effect reveals an island through the other one when viewed from Yonghojeong.

The Discourse on Girls and the Comics in the 1970s Magazine, Schoolgirl - A Forced Model and the Invented Cheerfulness (1970년대 잡지 『여학생』의 소녀 담론과 만화 -강요된 모범과 만들어진 명랑)

  • Kim, So-Won
    • Journal of Popular Narrative
    • /
    • v.27 no.3
    • /
    • pp.13-51
    • /
    • 2021
  • The aim of this essay is to illustrate Sunjung Manhwa in the 1970s which has been alienated in comics studies. This essay analyses the articles and the serial comics in Schoolgirl, the magazine in the 1970s, and examines the ideal representations of the girls at that time. Sunjung Manhwa is really different between the 1960s and 1970s. It cannot be explained on this gap just by analyzing Sunjung Manhwa in book form alone. Even though the censorship on comics was the element that has hampered the development of comics as a whole, the slumps of Sunjung Manhwa in the 1970s were very excessive compared to other comics genres. This article can gain the answers to the reason of the changes of Sunjung Manhwa by studying the magazines which was the main mass media aimed at girls with Sunjung Manhwa. While the articles in magazines show the editing direction and its characteristics, they reflect the values and ideologies at that time. The same is true for the comics in the magazines. Especially, the comics in the magazines was relatively free from the censorship. This essay examined how the articles and the comics in the girls' magazine in the 1970s represented the images of girls at the time by focusing on feature articles and comics in the magazine, Schoolgirl. This article explored Um, Hee-Ja's Blue Zone and Bang, Young-Jin's Mini March among a full-length serial comics in the magazine, Schoolgirl. Both Blue Zone and Mini March reveal the images of an ideal girl that has been emphasized by the articles in Schoolgirl. Blue Zone draws the appearances of an earnest and obedient daughter, and Mini March represents the figures of a cheerful and bright girl. Through this study, it can be recognized that the magazines in the 1970s highly appraised girls who are obedient to a given society and serve to a harmonious family as ideal ones, and it might be guessed that the ideal images of girls that was characterized ceaselessly by the magazines were the standard of the censorship on comics and its creativity and had also a huge impact on the contents and the expressions of a great deal of works. The 1970s was the times when its importance has been lost in the history of the comics studies by the censorship on the comics and the monopoly of "Hapdong(합동) publisher." The limits of expression in terms of censorship were awfully distinct, so its result was few of good works in quality, and there are still many blanks in the study on 1970s' comics. This study has a meaning which fills up a blank in the comics studies.