1. INTRODUCTION
1.1 General Appearanc
This paper aims to epitomize the visualization of character networks on influences and influenced among Jungian character archetypes in developing character archetypes in film using the R statistical language and igraph package for R. In particular, this paper provides an easily reproducible tutorial using open source software for encouraging adoption of data science ibn digital humanities.
Recent years have seen the rise of the digital humanities and an increasing computational approach to analyzing literary works, owing to technological progress and its wide availability. Moreover, as the relevant fields are maturing, the digital humanities are increasingly adopting Social Networks Analysis (SNA) for gaining novel insights into literary works and films.
A character network analysis is a recently emerged research technique stemming from increased use of the SNA at the intersection of literary/film analysis, the SNA, and computational technologies. The character network analysis focusses on characters and their networks as narrative entities in novels and films borrowing methodologies of the SNA used in sociology and artificial intelligence. The SNA is defined as being a strategy rather than a formal theory, with the goal of investigating social structures through the use of network and graph theory in modern sociology [1].
For character analysis I have relied on the Jungian master archetypes discussed in Victoria Lynn Schmidt’s book 45 master characters: mythic models for creating original characters [2]. I have used the R statistical language for computational tools for analysis and visualization.
The outline of this paper is as follows. At the outset, this paper deals with a two-folded theoretical framework drawing on (1) character analysis from Jungian archetype and (2) network analysis in digital humanities. In the next section, this paper addresses data collection and the computational implementation of R scripts. Subsequently, this paper presents visualizations and discussions of analyzed character networks. Finally, this paper will conclude.
2. THEORETICAL BACKGROUND
2.1 Jungian archetype for film-character analysis
Carl Gustav Jung (1875-1961) was a Swiss psychologist known for his pioneering work and contribution to analytical psychology. Jung has three divisions in his personality theory which consists of (1) the ‘ego’ as the conscious mind, (2) the ‘personal unconscious’, not presently conscious but including hidden memories that are easily recalled, and (3)’ collective unconscious’.
Jung initially supported Sigmund Freud because of their shared interest in the unconscious. However, Freud and Jung diverged because of Jung’s disunity with Freud’s emphasis on the impacts of libido on behavior and personality. Alternately, Jung introduced ‘collective unconscious’, a term to represent a form of the unconscious consisting of the unconscious mind containing memories and impulses commonly shared by a group. It arises from the inherited structure of the collective brains distinguishing from the personal unconscious in Freudian psychoanalysis originating from the experience of the individual. Hence, ‘the collective unconscious’ is a unique feature in Jung’s theory.
The contents of Jung’s collective unconscious are defined as archetypes. According to Jung, the ‘collective unconscious’ contains numerous archetypes which is the model image of a person or role. Hence, the ‘archetype’ enables us to identify numerous character models to shape our personalities and aspirations. Jung believed that the archetypes were universal and mythic characters that inhabit the ‘collective unconscious’ of people. In Jung’s theory, the archetypes are mental fingerprints in disclosing the details of personalities and fundamental human motifs.
Jung’s ideas have not been as popular as Freud’s. However, the Jung's ideas exert significant influences on the field of practical usage. Jung’s personality topology contributed to the development of numerous personality tests. Although Jung did not attempt to deploy his theory for the quantitative measuring of personalities, Jung’s theory of psychological typology was implemented, during World War II, by two American women Isabel Briggs Myers (1897~1980) and Katharine Cook Briggs (1875~1968), who developed the Myers-Briggs Type Indicator (MBTI) based on Jung’s personality topology [3].
Furthermore, the discipline of film studies has recently seen the fledgling rise of Jungian film theory. Since the 1970s Freud-Lacanian framework has been cemented into one of the dominant tools in film studies while other psychoanalytic approaches have been marginalized. Currently, Jungian film studies is a fast-growing academic field. Film studies have recently begun to employ Jung’s concept of archetypes prototypical characters which play the role of blueprint in constructing clear-cut characters like that of protagonist, antagonist, deuteragonist, tritagonist and supporting characters in films. Bassil-Morozow and Hockley (2017) attempted to bridge the space between Jungian concepts and traditional film theory by covering a range of Jungian concepts. The authors argued that cinema can be regarded as a place where the unconscious and conscious meet [4].
In Jung’s writings, prevalent archetypes are great mother, father, child, devil, god, wise old man, wise old woman, the trickster, the hero [5]. Victoria Schmidt embraced Jung’s master characters modifying and extending them into 45 in her a monograph 45 master characters: mythic models for creating original characters (2012) that provides the interaction between protagonist characters in films [2]. She followed Jung’s theory to explore the most common male and female fictional archetypes drawing on examples of how such archetypes played their parts in novels, film and television.
In her book (2012), she extended Jung’s master characters in constructing 45 master characters which consist of main characters with the subdivisions of ‘heroes’ and ‘villains’ and supporting characters as shown in Table 1 and Table 2. In addition, she provided relations with whom an archetype is best paired in formation of an original archetype. For example, in the case of developing the ‘Aphrodite’ archetype, the following archetype characters have influences on Aphrodite as follows:
- The Woman's Man — can teach her to value herself for her mind and spirit as well as her body.
- The Messiah — can teach her how to channel her sexual energy to advance spiritually.
- The Recluse and Mystic — can teach her how to be alone without fear of abandonment and also how to know herself deep down inside.
- The Amazon — can teach her to set limits and accept discipline as a positive thing in life.
Interestingly, Schmidt’s information on pairing among archetypes in development of an archetype in films constructs data into a character network, which could save an empirical survey for this research. Hence, I have drawn on her description on pairing of characters in character-formation to conduct character network analysis.
Table 1. Main characters in Schmidt’s 45 master characters: mythic models for creating original characters (2012)
Table 2. Supporting characters in Schmidt’s 45 master characters
2.2 Network Analysis in Digital Humanities
The SNA is a study of social structures through mathematical theories like networks and graph theory. The SNA has initially emerged as one of the key analytic methods in modern sociology. Later, it also gained extensive ground to affect a wide range of academic disciplines and practical applications. Prior to the advent of computational technology and internet, the SNA relied on limited data from interviews and surveys. Currently, both researchers in humanities and computer science have embarked on analyzing networks from various literary and historical texts using computational software. While the former tends to perform qualitative studies examining the intricacies of plot structure and character interactions in literary works, computer scientists concentrate more on frequencies, statistics and algorithms [6]
The network analysis generally deploys ‘vertex’ and ‘edge’ for representing relations between actors borrowing from graphic theory in mathematics. It characterizes a social network as a social structure made up of ‘nodes’ (actors/ vertices / points) and ‘edges’ (ties / arcs / lines / links, or connections) connecting nodes within the networks. In general, the visualization of the SNA is displayed through sociograms in which nodes are represented as points or circles and edges are depicted as lines.
Arnold and Tilton (2015) presented examples of visual representation of a family tree and citation network in their recently published technical textbook, Humanities Data in R: Exploring Networks, Geospatial Data, Images, and Text (2015) as shown Fig. 1 and Fig. 2 [7].
Fig. 1. Family tree
Arnold and Tilton (2015) implemented R scripts using the igraph package to represent parent-child relation of the Royal family in forms of family trees shown in Fig. 1.
Fig. 2. A citation network of US Supreme Court cases which dealt primarily with the topic of segregation [6]
Citation network in the SNA can be divides in two: (1) academic citation networks and (2) legal citation networks. The academic citation networks contain sources and the bibliographic references in academic papers. The legal system of Common law known as judicial precedent features a body of law referencing to precedent decisions of courts and similar tribunals. Hence, the exploration of the citation network from massively accumulated legal opinions can be of benefit. Fig. 2 represents an example of visualization of a citation network built from United States Supreme Court case opinions relying on Supreme Court Citation Network Data.
This paper reviews recent works relevant to character networks in film. Tran, Quang Dieu, Dosam Hwang, and Jason J. Jung (2015) proposed a co-occurrence character network analysis for movie summarization based on discovery and analysis movie storytelling. They conducted experiments on 17 movies including the Star War series, the Lord of the Ring series and Harry Porter series with more than 2000 minutes of movies play time along with evaluation of results compared to IMDb and IMSDb database. Their experiment outperformed the conventional approaches in terms of the movie summarization rate [8].
Tran, Quang Dieu, Dosam Hwang, and Jason J. Jung (2017) coped with poor performances of content-based indexing. They considered image processing techniques for semi-automatically character-based indexing. Moreover, they created a movie ontological model is created for connecting character appearances and character’s roles in the movie. Their experimental results showed that their proposed method assists user in consuming index time and providing a method for automatic indexing, searching and browsing based on semantic queries [9].
Quang Dieu Tran, Dosam Hwang, O-Joun Lee, Jai E. Jung (2017) proposed a novel method to summarize a movie based on character network analysis and the appearance of protagonist and main characters in the movie by carrying out experiment on two movies Titanic (1997) and Frozen (2013). They showed that their method outperforms conventional approaches in terms of the movie summarization rate [10].
Finally, O-Joun Lee, Nayoung Jo, and Jason J. Jung (2018) measured similarity among movies’ stories by clustering them with the character network and the genre distribution. As to methods, they used two kinds of features: (1) proximity among movie characters and (2) genres of the movies to construct the story-based taxonomy by clustering the movies [11].
3. EXPERIMENTAL DESIGN AND METHOD
My method for extracting character network data from films depends on ‘Developing the Character Arc’ in 16 chapters of Schmidt’s 45 master characters: mythic models for creating original characters (2012). Throughout her analysis of characters in film, Schmidt provided data on relations of ‘influences’ and ‘influenced’ between main characters in developing character archetypes as shown in Table 3 (Developing the female character archetype) and Table 4 (Developing the male character archetype). In Table 3 and Table 4, ‘Female Heroes’ and ‘Male Heroes’ in the left columns of each table are influenced in their growth from characters in the column of ‘Developing the Character Archetype’. I have cited Schmidt’s description on roles of influencing characters in the right column of the Table 3 and Table 4.
Table 3. Developing the female character archetype
Table 4. Developing the male character archetype
For creating and assessing basic properties of network graphs, I have used igraph for R package. R is an open source programming language for statistical computing and graphics. R is now available for addressing tasks ranging from standard manipulation, visualization, and characterization of network data. (e.g., igraph, network, and SNA), to modeling of networks, to network topology inference.
The igraph is a library collection for covering basics of network visualization of ‘graphic theory’. It was developed in C language by Gábor Csárdi and Tamás Nepusz. (http://igraph.org/r/) [12]. Currently it is freely available for both Python and R under GNU General Public License Version 2 and is widely used in academic research in network science and related fields. I have imported the csv file into R software and igraph for R package to produce visualization with R script as follows:
install.packages("igraph")
library (igraph)
dat <- read.csv(header=TRUE,row.names=1,check.names=FALSE)m=as.matrix(dat)
net=graph.adjacency(m,mode="directed",weighted=TRUE,diag=FALSE)
plot.igraph(net,vertex.label=V(net)$name,layout=layout.fruchterman.reingold,
vertex.label.color="black",edge.color="black",edge.width=E(net)$weight/3, edge.arrow.size=0.1)
have implemented the R scripts using R version 3.4.4 and Rstudio Desktop 1.1.442 on Macbook Pro installed with macOS High Sierra. The Rstudio is a free and open-source integrated development environment (IDE) for R, a programming language for statistical computing and graphics. It is available for Windows, macOS, and Linux.
Table 5. Matrix of the character networks in films from Schmidt’s 45 master characters: mythic models for creating original characters (2012)
4. RESULTS
Fig. 3. Visualized character network in developing archetype in films
Fig. 3 displays visualization from implementation of R scripts using igraph packages. In the diagram of the networks was represented in the layout of the Fruchterman-Reingold Algorithm which is a force-directed layout algorithm. Every node signifies a Jungian archetype character in mythic forms and edges with arrow directedness denote relation from the ‘influences’ and the ‘influenced’. For example, F1 (Aphrodite) node has three influences with three edges with directed arrows going to M2, M3, and F6 and six edges directed to F1 coming from F2, F6, F7, M4, M5, and M7. F1 has influences on M3, M3, and F5 and is influenced by F2, F6, F7, M4, M5, and M7 in development of character archetype.
The results of my visualization of character network from Schmidt’s 45 characters provides more comprehensive overview of character network in film than her description in Schmidt’s book. She did not deal with impacts of characters in growing other characters with visual diagrams. Moreover, she provided information on the relations at each end of every chapter without considering character network. Hence, the character relation in her book is difficult be grasp a whole outline and even seems to be fragmentary.
My visualization of character network in developing character improves the limits of Schmidt’s book. Moreover, it is easy for scholars in humanities to reproduce and to understand tangible benefits of adopting data science. It could inspire humanities to extend topics of the SNA like frequencies of conversations, mentoring, sexual intercourses, and conflicts among characters in literary works and films.
5. CONCLUSION
I have manifested a clear visual outline of character influence-relations in making up Jungian character archetypes in films by using R computational technology. My research has contributed to the integration of Jungian analytical psychology into film studies by revealing character network relations in film. Furthermore, my research has provided an epitomized way of performing the visualization of character networks by using computational technology, and by doing so, my research galvanizes the further relevant research based on massive and empirical data.
However, my analysis of characters is limited because of the small-size data from Schmidt’s single monograph and from the cultural bias of Western culture and history. In spite of my reliance on small data, this research contributes to galvanizing the academic communities in humanities into more active adoption of data science and embarking on building up massive data.
In humanities, massive data collection might be far different from those in business activities. Scholars and students should manually build up data from texts along with critical reviews. Moreover, up to now, many scholars in humanities are still reluctant to adopt data science in their research. Against this backdrop, my research provides an easily reproducible tutorial and could be a pioneer in removing fears among scholars and students in humanities on technical barriers. If given an opportunity, I hope to perform further character network analysis using massive and empirical data and produce localized renditions of analysis from data sources outside western cultures.
References
- E. Otte and R. Rousseau, "Social network analysis: a powerful strategy, also for the information sciences," Journal of Information Science, vol. 28, issue. 6, 2002, pp. 441-453. https://doi.org/10.1177/016555150202800601
- L. Schmidt, 45 master characters: mythic models for creating original characters, Writer's Digest Books, 2012.
- I. Myers and P. Myers, Gifts differing: understanding personality type, Davies-Black Publishing, 1995.
- H. Bassil-Morozo and L. Hockley, Jungian film studies: the essential guide, Routledge, 2017.
- C. Jung, The archetypes and the collective unconsciousness (Collective Works of C. G. Jung), vol. 9, pt. 1, Princeton University Press, 1981.
- B. Chen and Y. Wang, "Character interaction network analysis of Chinese literary work‐a preliminary study," Proceedings of the Association for Information Science and Technology, 2016, pp. 1-6.
- T. Arnold and L. Tilton, Humanities data in R: exploring networks, geospatial data, images, and text, Springer, 2015.
- Q. Tran, H. Dosam, and J. Jason, "Movie summarization using characters network analysis, Computational Collective Intelligence, Springer, Cham, 2015, pp. 390-399.
- Q. Tran, H. Dosam, and J. Jason, "Character-based indexing and browsing with movie ontology," Journal of Intelligent & Fuzzy Systems, vol. 32, issue. 2, 2017, pp. 1229-1240. https://doi.org/10.3233/JIFS-169122
- Q. Tran, H. Dosam, L. O. Joun Lee, and J. Jai "Exploiting character networks for movie summarization," Multimedia Tools and Applications, vol. 97, issue. 8, 2017, pp. 10357-10369.
- L. O. Joun, J. Nayoung, and J. Jason, "Measuring character-based story similarity by analyzing movie scripts," The 1st Workshop on Narrative Extraction From Text European Conference on Information Retrieval, Grenoble, France; Mar. 2018.
- E. Kolaczyk and G. Csardi, Statistical analysis of network data with R, Springer, 2014.