• Title/Summary/Keyword: Linguistic Text Analysis

Search Result 75, Processing Time 0.027 seconds

Analysis of Plant Species in Elementary School Textbooks in South Korea

  • Kwon, Min Hyeong
    • Journal of People, Plants, and Environment
    • /
    • v.24 no.5
    • /
    • pp.485-498
    • /
    • 2021
  • Background and objective: This study was conducted to find out the status of plant utilization in the current textbooks by analyzing the plants by grade and subject in the national textbooks for all elementary school grades in the 2015 revised curriculum in Korea. Methods: The data collected was analyzed using Microsoft Office Excel to obtain the frequency and ratio of collected plant data and SPSS for Windows 26.0 to determine learning content areas by grade and the R program was used to visualize the learning content areas. Results: A total of 232 species of plants were presented 1,047 times in the national textbooks. Based on an analysis of the plants presented by grade, the species that continued to increase in the lower grades tended to decrease in the fifth and sixth grades, the upper grades of elementary school. As for the number and frequency of plant species by subject, Korean Language had the highest number and frequency of plant species. The types of presentation of plants in textbooks were mainly text, followed by illustrations and photos of plants, which were largely used in first grade textbooks. In addition, as for the area of learning contents in which plants are used, in the lower grades, plants were used in the linguistic domain, and in the upper grades, in the botanical and environmental domains of the natural sciences. Herbaceous plants were presented more than woody plants, and according to an analysis of the plants based on the classification of crops, horticultural crops were presented the most, followed by food crops. Out of horticultural crops, flowering plants were found the most diversity with 63 species, but the plants that appeared most frequently were fruit trees that are commonly encountered in real life. Conclusion: As a result of this study, various plant species were included in elementary school textbooks, but most of them were horticultural crops encountered in real life depending on their use. Nevertheless, plant species with high frequency have continued a similar trend of frequency from the previous curriculums. Therefore, in the next curriculum, plant learning materials should be reflected according to social changes and students' preference for plants.

Privacy-Preserving Language Model Fine-Tuning Using Offsite Tuning (프라이버시 보호를 위한 오프사이트 튜닝 기반 언어모델 미세 조정 방법론)

  • Jinmyung Jeong;Namgyu Kim
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.4
    • /
    • pp.165-184
    • /
    • 2023
  • Recently, Deep learning analysis of unstructured text data using language models, such as Google's BERT and OpenAI's GPT has shown remarkable results in various applications. Most language models are used to learn generalized linguistic information from pre-training data and then update their weights for downstream tasks through a fine-tuning process. However, some concerns have been raised that privacy may be violated in the process of using these language models, i.e., data privacy may be violated when data owner provides large amounts of data to the model owner to perform fine-tuning of the language model. Conversely, when the model owner discloses the entire model to the data owner, the structure and weights of the model are disclosed, which may violate the privacy of the model. The concept of offsite tuning has been recently proposed to perform fine-tuning of language models while protecting privacy in such situations. But the study has a limitation that it does not provide a concrete way to apply the proposed methodology to text classification models. In this study, we propose a concrete method to apply offsite tuning with an additional classifier to protect the privacy of the model and data when performing multi-classification fine-tuning on Korean documents. To evaluate the performance of the proposed methodology, we conducted experiments on about 200,000 Korean documents from five major fields, ICT, electrical, electronic, mechanical, and medical, provided by AIHub, and found that the proposed plug-in model outperforms the zero-shot model and the offsite model in terms of classification accuracy.

A Study of the Design Characteristics of the Police Uniform As A Visual Language - Focused on the U.S., England, Italy, France and Korea - (시각언어로서의 교통경찰관복의 디자인특성 연구 - 미국, 영국, 이태리, 프랑스, 한국을 중심으로 -)

  • Lee, Jung-Won;Geum, Key-Sook
    • Journal of the Korean Society of Costume
    • /
    • v.58 no.7
    • /
    • pp.13-30
    • /
    • 2008
  • Visual language is 'a form of communication without text'. Visual language is one of the strongest methods to spread knowledge. Uniforms could be interpreted as a symbolic language that establishes order in this complicated modern society by placing identity and responsibility on each members of various different organizations. In light of the above, the purpose of this research paper will be to analyze police uniforms of U.S.A, Great Britain, Italy, France and Korea as a form of visual language and interpreting them in terms of visual design in order to understand the fundamental ideas behind the designs and the effective applications thereof. Upon analysis of traffic police uniforms of each individual county mentioned above by separating each uniform's distinctive design, pattern, color, material and decoration based on visual factor, three characteristics of authority, dynamic functionality and friendliness were derived from comparing and analyzing each country's distinctive uniform design. The traditional unique role of police in our society was to maintain social order as their nature inherently possesses characteristic of authority and preservation, but has since undergone transition in many countries to appeal to the broader public by incorporating friendliness and dynamic functionality. Analyzing police uniforms in terms of visual linguistic sense requires a much more profound process of understanding beyond simple interpretation of configurative shapes. In conclusion, the true purpose of uniforms is to include and portray images of mankind's desire toward expressing ideas like 'mankind's bias toward existence beyond theirselves and the exercise of force through authority' and materializing such ideas into a physical form.

Study on Fashion Illustration as Viewed from the Allegorical - Based on the theory of Craig Owens - (알레고리 관점의 패션 일러스트레이션에 관한 연구 - 크렉 오웬스의 이론을 중심으로 -)

  • Kim, Mi-Hyun
    • Journal of the Korean Society of Costume
    • /
    • v.62 no.4
    • /
    • pp.81-90
    • /
    • 2012
  • The contents of this study are as follows. First, an academic understanding has been achieved by exploring the theoretical concept "allegory", and a new theoretical approached methodology has been sought. Second, an analysis-index of fashion illustration cases has been suggested based on the allegory theory of Craig Owens. Third, in order to draw the characteristics of fashion illustration as viewed from the allegorical viewpoint and find out its feasibility, the case studies has been referred and the internal significance, external significance that combines different characteristics has been extracted. In regards to this study method, literature studies and case studies has been done in parallel with each other. This study was done in the following sequence: the establishment of the study system, the drawing of the allegory-associated concepts and the discovering the characteristics of aesthetic expressions. The results of this study on the expression characteristics of fashion illustration as viewed from the allegorical viewpoint of Craig Owens are as follows. First, the borrowing of image, which is a characteristic of allegory, contains the meaning of uncertainty in the fashion illustration as it expresses the image-synthesis and forms a completely different meaning as the fixed meaning is dissolved and it is utilized as a photo-montage technique. Second, the inference of pictogram is the mixture of linguistic medium and visual medium. Fashion illustration utilizes the characters and transmits the fashion information visually and immanently. It has the characteristic of making the information into pictograms and the internal significances of mutual-text with communication function. Third, the uniqueness of location in the fashion illustration has the special nature of utilized mediums as it is used for advertising or publicizing. The fashion illustration from the viewpoint of allegory has the impermanency of existing only for a limited time and reflects the coincidence that gives the meaning of utilized location according to the season trend. Fourth, the cross-breeding is expressed as the mixture of various materials in the fashion illustration. The expressions made by the mixture of media, such as the use of computer graphic programs mixed together with various materials showed the trend of diversity and genre dissolution.

The Stream of Uncertainty in Scientific Knowledge using Topic Modeling (토픽 모델링 기반 과학적 지식의 불확실성의 흐름에 관한 연구)

  • Heo, Go Eun
    • Journal of the Korean Society for information Management
    • /
    • v.36 no.1
    • /
    • pp.191-213
    • /
    • 2019
  • The process of obtaining scientific knowledge is conducted through research. Researchers deal with the uncertainty of science and establish certainty of scientific knowledge. In other words, in order to obtain scientific knowledge, uncertainty is an essential step that must be performed. The existing studies were predominantly performed through a hedging study of linguistic approaches and constructed corpus with uncertainty word manually in computational linguistics. They have only been able to identify characteristics of uncertainty in a particular research field based on the simple frequency. Therefore, in this study, we examine pattern of scientific knowledge based on uncertainty word according to the passage of time in biomedical literature where biomedical claims in sentences play an important role. For this purpose, biomedical propositions are analyzed based on semantic predications provided by UMLS and DMR topic modeling which is useful method to identify patterns in disciplines is applied to understand the trend of entity based topic with uncertainty. As time goes by, the development of research has been confirmed that uncertainty in scientific knowledge is moving toward a decreasing pattern.

A Study on the Oral Characteristics in Personal Narrative Storytelling (체험 이야기하기의 구술적 특성에 대하여)

  • Kim, Kyung-Seop
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.4
    • /
    • pp.143-150
    • /
    • 2022
  • The folk language that lives and breathes in modern works does not just come from old stories, but it is a personal narrative which is based on the experiences of the narrator. Like many genres in oral literature, most of these personal narratives occur from the impulse of communicating and reinventing rather than from the impulse of creating. Compared to traditional folktales, stories about an individual's experiences, such as personal narratives are often performed by adding the individual tendencies of the narrator. In so doing, the phenomenon of "processing the experience by estimating it and reinterpreting the memories roughly" occurs, and this is a significant factor in making the oral literature. However, the question that arises here is: How can we deal with these significant elements that are inevitably captured when performed orally? Text linguistics, the main methodology of this paper, implies the possibility of expressing the impromptu elements of oral literature. Also, textual linguistic analysis of personal narratives provides the possibility of discussing oral characteristics from various angles which have been difficult to analyze, such as on-site atmosphere, speaker mistakes, contradictions in stories, and audience reactions. Hence, it is possible to effectively discuss oral-poetics in oral literature which are based on the one-off of 'words', the 'roughness' of the on-site atmosphere, and the stackability of the 'wisdom of crowds'. Furthermore, it is expected to contribute to the study of personal narrative storytelling that plays an important part in Veabal art in community culture.

A Study on Analysis of Research Data Repository in Humanities and Social Sciences (re3data를 기반으로 한 인문사회 RDR 연구)

  • Cho, Jane;Park, Jong-Do
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.30 no.2
    • /
    • pp.69-87
    • /
    • 2019
  • As the discussions on sharing research data prevail by the chance of the inauguration of the International Open Data Charter, research support organizations in the United States, the United Kingdom, and Japan are encouraging researchers to deposit their findings in a credible repository. Humanities and social sciences field, in which research data sharing culture and storage infrastructure are immature compared to life science and natural science, also needs to establish and operate a reliable storage infrastructure to guarantee the continuous access and utilization of data. This study analyzed the overall operational status of 305 subject repositories registered in re3data for the humanities and social sciences and clustered them according to the operational level using 5 indicators. As a result, 70% of the population were identified as universal clusters, and 20% of the excellent cluster was found to have the largest number of linguistic fields and the German-operated. In addition, this study confirmed through correspondence analysis that there is a relation between the sub-theme fields of humanities and social sciences and the types of data to be archived. The history and art domians are related to images, and social studies are related to statistical data. Linguistics has also been analyzed to be related to audio, plain text, and code.

Target Word Selection Disambiguation using Untagged Text Data in English-Korean Machine Translation (영한 기계 번역에서 미가공 텍스트 데이터를 이용한 대역어 선택 중의성 해소)

  • Kim Yu-Seop;Chang Jeong-Ho
    • The KIPS Transactions:PartB
    • /
    • v.11B no.6
    • /
    • pp.749-758
    • /
    • 2004
  • In this paper, we propose a new method utilizing only raw corpus without additional human effort for disambiguation of target word selection in English-Korean machine translation. We use two data-driven techniques; one is the Latent Semantic Analysis(LSA) and the other the Probabilistic Latent Semantic Analysis(PLSA). These two techniques can represent complex semantic structures in given contexts like text passages. We construct linguistic semantic knowledge by using the two techniques and use the knowledge for target word selection in English-Korean machine translation. For target word selection, we utilize a grammatical relationship stored in a dictionary. We use k- nearest neighbor learning algorithm for the resolution of data sparseness Problem in target word selection and estimate the distance between instances based on these models. In experiments, we use TREC data of AP news for construction of latent semantic space and Wail Street Journal corpus for evaluation of target word selection. Through the Latent Semantic Analysis methods, the accuracy of target word selection has improved over 10% and PLSA has showed better accuracy than LSA method. finally we have showed the relatedness between the accuracy and two important factors ; one is dimensionality of latent space and k value of k-NT learning by using correlation calculation.

Study of Rhetorical Puns in Korean Comic Strips in Daily Newspaper (한국 신문만화의 언어유희적 기법 연구)

  • Kim, Eul-Ho
    • Cartoon and Animation Studies
    • /
    • s.10
    • /
    • pp.1-16
    • /
    • 2006
  • This thesis aims to recall the importance of language in comics by studying comic strips in Korean daily newspapers: the comic strips are analyzed for rhetorical puns in its language text as they representatively show the value and role of language in comics. Moreover, Korean comic strips, as they developed into current affairs comics, acquired a stronger media characteristic of communicating information compared to other genres of cartoons. As a result, comics strips have become a genre where language plays an important role and the words needing to be able to convey the meaning quickly and implicitly. Due to tight control of national authority, the language technique developed into an indirect expression rather than a stronger direct imaging technique. The political oppression of the comic strip paradoxically brought on the rhetorical development in the creative techniques. Based on this analysis, the writer studied the rhetorical puns of the texts Korean comic strips by implementing the classification techniques of rhetoric expressions. As a result, through quotes and analysis of actual comic strips, the writer confirmed that Korean comic strips do actually show tremendously vast rhetorical puns in its language application techniques. The writer was also able to conclude that the rhetorical puns in comics were the force entertaining and impressing the readers, and also acting as the creative principle. Concluding this study, the writer emphasizes that language, not only in comic strips, is a combination of words and images and is also an important factor in all cartoons in general. Thus the thesis proposes that the training of humanistic thoughts and linguistic sensitivity are as important as learning to draw in the creation of cartoons.

  • PDF

An Semiotic analysis on Spirited Away (애니메이션(센과 치히로의 행방불명)에 대한 기호학적분석)

  • Lee Yun-Hui
    • Broadcasting and Media Magazine
    • /
    • v.10 no.1
    • /
    • pp.99-112
    • /
    • 2005
  • Christian Metz, the precursor of cine-semiology, considered cinema as a language in the sense that it is a set of messages grounded in a given matter of expression, and a signifying practice characterized by specific codifications. According to Metz, film forms a structured network produced by the interweaving of cinematic codes, within which cinematic subcodes represent specific usages of the particular code. For Metz, cinematic language is a totality of cinematic codes and subcodes, and history of the cinema is the trace of the competition, incorporations and exclusions of the subcodes. He also suggested a filmic text is not just a list of codes in effect, but a process of constant displacement and deformation of codes. Following Metz' textual analysis methodology, I investigated the formal configuration of Hayao Miyazaki‘s animation, Spirited Away. It is interesting to trace the interweaving of cinematic codes in Spirited Away, i.e. codes of lighting, color, movement, and auteurism, across the animation. I focused on the first scene at the bridge to Yubaba's bathhouse, analyzing each cinematic code and its subcode applied. The first bridge scene is carefully constructed to stand out the confrontation of Chihiro (with Haku) and the bathhouse. The bathhouse is not just a building, it represents the powerful witch, Yubaba, yet to appear on the scene, and functions as an antipode to Chihiro. In each shot, every subcode within the codes of framing, direction, angle, color, lighting and movement is used to maximize the contrast between the dominant bathhouse and the feeble 10-year-old girl. In Spirited Away, the subcodes within each cinematic ode are constantly competing and displacing each other to augment the antithesis between the characters and develop the narrative. As Metz's argument that film constitutes a quasi-linguistic practice as a pluricodic medium, Spirited Away communicates with the spectators with the combination and displacement of these cinematic codes and subcodes.