• Title/Summary/Keyword: 데이터 유사도

Search Result 3,346, Processing Time 0.027 seconds

Spawning patterns of three bitterling fish species (Pisces: Acheilognathinae) in host mussels and the first report of their spawning in Asian clam(Corbicula fluminae) from Korea (납자루아과(Pisces: Acheilognathinae) 어류 3종의 숙주조개에 대한 산란양상 및 재첩(Corbicula fluminae) 내 산란 국내 최초 보고)

  • Jin Kyu Seo;Hee-kyu Choi;Hyuk Je Lee
    • Korean Journal of Environmental Biology
    • /
    • v.41 no.3
    • /
    • pp.229-246
    • /
    • 2023
  • The bitterling (Cyprinidae, Acheilongnathinae) is a temperate freshwater fish with a unique spawning symbiosis with host mussels. Female bitterlings use their extended ovipositors to lay eggs on the gills of mussels through the mussel's exhalant siphon. In the present study, in April of 2020, we investigated spawning frequencies and patterns of three bitterling fish species in host mussel species in the Nakdong River basin (Hoecheon). During field surveys, a total of four bitterling and three mussel species were found. We observed bitterling's spawning eggs/larvae in the three mussel species: Anodonta arcaeformis(proportion spawned: 45.5%), Corbicula fluminea(12.1%), and Nodularia douglasiae (45.2%). The number of bitterlings' eggs/larvae per mussel ranged from 1 to 58. Using our developed genetic markers, we identified the eggs/larvae of each bitterling species in each mussel species (except for A. macropterus): A. arcaeformis (spawned by Acheilognathus yamatsutae), C. fluminea (A. yamatsutae and Tanakia latimarginata), and N. douglasiae (A. yamatsutae, Rhodeus uyekii, and T. latimarginata). Approximately 57.6% of N. douglasiae mussel individuals had eggs/larvae of more than one bitterling species, suggesting that interspecific competition for occupying spawning grounds is intense. This is the first report on bitterling's spawning events in the Asian clam C. fluminea from Korea; however, it should be ascertained whether bitterling's embryo undergoes successful development inside the small mussel and leaves as a free-swimming juvenile. In addition, the importance of its conservation as a new host mussel species for bitterling fishes needs to be studied further.

Korean Sentence Generation Using Phoneme-Level LSTM Language Model (한국어 음소 단위 LSTM 언어모델을 이용한 문장 생성)

  • Ahn, SungMahn;Chung, Yeojin;Lee, Jaejoon;Yang, Jiheon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.71-88
    • /
    • 2017
  • Language models were originally developed for speech recognition and language processing. Using a set of example sentences, a language model predicts the next word or character based on sequential input data. N-gram models have been widely used but this model cannot model the correlation between the input units efficiently since it is a probabilistic model which are based on the frequency of each unit in the training set. Recently, as the deep learning algorithm has been developed, a recurrent neural network (RNN) model and a long short-term memory (LSTM) model have been widely used for the neural language model (Ahn, 2016; Kim et al., 2016; Lee et al., 2016). These models can reflect dependency between the objects that are entered sequentially into the model (Gers and Schmidhuber, 2001; Mikolov et al., 2010; Sundermeyer et al., 2012). In order to learning the neural language model, texts need to be decomposed into words or morphemes. Since, however, a training set of sentences includes a huge number of words or morphemes in general, the size of dictionary is very large and so it increases model complexity. In addition, word-level or morpheme-level models are able to generate vocabularies only which are contained in the training set. Furthermore, with highly morphological languages such as Turkish, Hungarian, Russian, Finnish or Korean, morpheme analyzers have more chance to cause errors in decomposition process (Lankinen et al., 2016). Therefore, this paper proposes a phoneme-level language model for Korean language based on LSTM models. A phoneme such as a vowel or a consonant is the smallest unit that comprises Korean texts. We construct the language model using three or four LSTM layers. Each model was trained using Stochastic Gradient Algorithm and more advanced optimization algorithms such as Adagrad, RMSprop, Adadelta, Adam, Adamax, and Nadam. Simulation study was done with Old Testament texts using a deep learning package Keras based the Theano. After pre-processing the texts, the dataset included 74 of unique characters including vowels, consonants, and punctuation marks. Then we constructed an input vector with 20 consecutive characters and an output with a following 21st character. Finally, total 1,023,411 sets of input-output vectors were included in the dataset and we divided them into training, validation, testsets with proportion 70:15:15. All the simulation were conducted on a system equipped with an Intel Xeon CPU (16 cores) and a NVIDIA GeForce GTX 1080 GPU. We compared the loss function evaluated for the validation set, the perplexity evaluated for the test set, and the time to be taken for training each model. As a result, all the optimization algorithms but the stochastic gradient algorithm showed similar validation loss and perplexity, which are clearly superior to those of the stochastic gradient algorithm. The stochastic gradient algorithm took the longest time to be trained for both 3- and 4-LSTM models. On average, the 4-LSTM layer model took 69% longer training time than the 3-LSTM layer model. However, the validation loss and perplexity were not improved significantly or became even worse for specific conditions. On the other hand, when comparing the automatically generated sentences, the 4-LSTM layer model tended to generate the sentences which are closer to the natural language than the 3-LSTM model. Although there were slight differences in the completeness of the generated sentences between the models, the sentence generation performance was quite satisfactory in any simulation conditions: they generated only legitimate Korean letters and the use of postposition and the conjugation of verbs were almost perfect in the sense of grammar. The results of this study are expected to be widely used for the processing of Korean language in the field of language processing and speech recognition, which are the basis of artificial intelligence systems.

The Effect of Brand Extension of Private Label on Consumer Attitude - a focus on the moderating effect of the perceived fit difference between parent brands and an extended brand - (PL의 브랜드확장이 소비자태도에 미치는 영향에 관한 연구 : 모브랜드 적합도 인식 차이의 조절효과를 중심으로)

  • Kim, Jong-Keun;Kim, Hyang-Mi;Lee, Jong-Ho
    • Journal of Distribution Research
    • /
    • v.16 no.4
    • /
    • pp.1-27
    • /
    • 2011
  • Introduction: Sales of private labels(PU have been growing m recent years. Globally, PLs have already achieved 20% share, although between 25 and 50% share in most of the European markets(AC. Nielson, 2005). These products are aimed to have comparable quality and prices as national brand(NB) products and have been continuously eroding manufacturer's national brand market share. Stores have also started introducing premium PLs that are of higher-quality and more reasonably priced compared to NBs. Worldwide, many retailers already have a multiple-tier private label architecture. Consumers as a consequence are now able to have a more diverse brand choice in store than ever before. Since premium PLs are priced higher than regular PLs and even, in some cases, above NBs, stores can expect to generate higher profits. Brand extensions and private label have been extensively studied in the marketing field. However, less attention has been paid to the private label extension. Therefore, this research focuses on private label extension using the Multi-Attribute Attitude Model(Fishbein and Ajzen, 1975). Especially there are few studies that consider the hierarchical effect of the PL's two parent brands: store brand and the original PL. We assume that the attitude toward each of the two parent brands affects the attitude towards the extended PL. The influence from each parent brand toward extended PL will vary according to the perceived fit between each parent brand and the extended PL. This research focuses on how these two parent brands act as reference points to one another in the consumers' choice consideration. Specifically we seek to understand how store image and attitude towards original PL affect consumer perceptions of extended premium PL. How consumers perceive extended premium PLs could provide strategic suggestions for retailer managers with specific suggestions on whether it is more effective: to position extended premium PL similarly or dissimilarly to original PL especially on the quality dimension and congruency with store image. There is an extensive body of research on branding and brand extensions (e.g. Aaker and Keller, 1990) and more recently on PLs(e.g. Kumar and Steenkamp, 2007). However there are no studies to date that look at the upgrading and influence of original PLs and attitude towards store on the premium PL extension. This research wishes to make a contribution to this gap using the perceived fit difference between parent brands and extended premium PL as the context. In order to meet the above objectives, we investigate which factors heighten consumers' positive attitude toward premium PL extension. Research Model and Hypotheses: When considering the attitude towards the premium PL extension, we expect four factors to have an influence: attitude towards store; attitude towards original PL; perceived congruity between the store image and the premium PL; perceived similarity between the original PL and the premium PL. We expect that all these factors have an influence on consumer attitude towards premium PL extension. Figure 1 gives the research model and hypotheses. Method: Data were collected by an intercept survey conducted on consumers at discount stores. 403 survey responses were attained (total 59.8% female, across all age ranges). Respondents were asked to respond to a series of Questions measured on 7 point likert-type scales. The survey consisted of Questions that measured: the trust towards store and the original PL; the satisfaction towards store and the original PL; the attitudes towards store, the original PL, and the extended premium PL; the perceived similarity of the original PL and the extended premium PL; the perceived congruity between the store image and the extended premium PL. Product images with specific explanations of the features of premium PL, regular PL and NB we reused as the stimuli for the Question response. We developed scales to measure the research constructs. Cronbach's alphaw as measured each construct with the reliability for all constructs exceeding the .70 standard(Nunnally, 1978). Results: To test the hypotheses, path analysis was conducted using LISREL 8.30. The path analysis for verification of the model produced satisfactory results. The validity index shows acceptable results(${\chi}^2=427.00$(P=0.00), GFI= .90, AGFI= .87, NFI= .91, RMSEA= .062, RMR= .047). With the increasing retailer use of premium PLBs, the intention of this research was to examine how consumers use original PL and store image as reference points as to the attitude towards premium PL extension. Results(see table 1 & 2) show that the attitude of each parent brand (attitudes toward store and original pL) influences the attitude towards extended PL and their perceived fit moderates these influences. Attitude toward the extended PL was influenced by the relative level of perceived fit. Discussion of results and future direction: These results suggest that the future strategy for the PL extension needs to consider that positive parent brand attitude is more strongly associated with the attitude toward PL extensions. Specifically, to improve attitude towards PL extension, building and maintaining positive attitude towards original PL is necessary. Positioning premium PL congruently to store image is also important for positive attitude. In order to improve this research, the following alternatives should also be considered. To improve the research model's predictive power, more diverse products should be included in study. Other attributes of product should also be included such as design, brand name since we only considered trust and satisfaction as factors to build consumer attitudes.

  • PDF

Change Acceptable In-Depth Searching in LOD Cloud for Efficient Knowledge Expansion (효과적인 지식확장을 위한 LOD 클라우드에서의 변화수용적 심층검색)

  • Kim, Kwangmin;Sohn, Yonglak
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.171-193
    • /
    • 2018
  • LOD(Linked Open Data) cloud is a practical implementation of semantic web. We suggested a new method that provides identity links conveniently in LOD cloud. It also allows changes in LOD to be reflected to searching results without any omissions. LOD provides detail descriptions of entities to public in RDF triple form. RDF triple is composed of subject, predicates, and objects and presents detail description for an entity. Links in LOD cloud, named identity links, are realized by asserting entities of different RDF triples to be identical. Currently, the identity link is provided with creating a link triple explicitly in which associates its subject and object with source and target entities. Link triples are appended to LOD. With identity links, a knowledge achieves from an LOD can be expanded with different knowledge from different LODs. The goal of LOD cloud is providing opportunity of knowledge expansion to users. Appending link triples to LOD, however, has serious difficulties in discovering identity links between entities one by one notwithstanding the enormous scale of LOD. Newly added entities cannot be reflected to searching results until identity links heading for them are serialized and published to LOD cloud. Instead of creating enormous identity links, we propose LOD to prepare its own link policy. The link policy specifies a set of target LODs to link and constraints necessary to discover identity links to entities on target LODs. On searching, it becomes possible to access newly added entities and reflect them to searching results without any omissions by referencing the link policies. Link policy specifies a set of predicate pairs for discovering identity between associated entities in source and target LODs. For the link policy specification, we have suggested a set of vocabularies that conform to RDFS and OWL. Identity between entities is evaluated in accordance with a similarity of the source and the target entities' objects which have been associated with the predicates' pair in the link policy. We implemented a system "Change Acceptable In-Depth Searching System(CAIDS)". With CAIDS, user's searching request starts from depth_0 LOD, i.e. surface searching. Referencing the link policies of LODs, CAIDS proceeds in-depth searching, next LODs of next depths. To supplement identity links derived from the link policies, CAIDS uses explicit link triples as well. Following the identity links, CAIDS's in-depth searching progresses. Content of an entity obtained from depth_0 LOD expands with the contents of entities of other LODs which have been discovered to be identical to depth_0 LOD entity. Expanding content of depth_0 LOD entity without user's cognition of such other LODs is the implementation of knowledge expansion. It is the goal of LOD cloud. The more identity links in LOD cloud, the wider content expansions in LOD cloud. We have suggested a new way to create identity links abundantly and supply them to LOD cloud. Experiments on CAIDS performed against DBpedia LODs of Korea, France, Italy, Spain, and Portugal. They present that CAIDS provides appropriate expansion ratio and inclusion ratio as long as degree of similarity between source and target objects is 0.8 ~ 0.9. Expansion ratio, for each depth, depicts the ratio of the entities discovered at the depth to the entities of depth_0 LOD. For each depth, inclusion ratio illustrates the ratio of the entities discovered only with explicit links to the entities discovered only with link policies. In cases of similarity degrees with under 0.8, expansion becomes excessive and thus contents become distorted. Similarity degree of 0.8 ~ 0.9 provides appropriate amount of RDF triples searched as well. Experiments have evaluated confidence degree of contents which have been expanded in accordance with in-depth searching. Confidence degree of content is directly coupled with identity ratio of an entity, which means the degree of identity to the entity of depth_0 LOD. Identity ratio of an entity is obtained by multiplying source LOD's confidence and source entity's identity ratio. By tracing the identity links in advance, LOD's confidence is evaluated in accordance with the amount of identity links incoming to the entities in the LOD. While evaluating the identity ratio, concept of identity agreement, which means that multiple identity links head to a common entity, has been considered. With the identity agreement concept, experimental results present that identity ratio decreases as depth deepens, but rebounds as the depth deepens more. For each entity, as the number of identity links increases, identity ratio rebounds early and reaches at 1 finally. We found out that more than 8 identity links for each entity would lead users to give their confidence to the contents expanded. Link policy based in-depth searching method, we proposed, is expected to contribute to abundant identity links provisions to LOD cloud.

Derivation of Digital Music's Ranking Change Through Time Series Clustering (시계열 군집분석을 통한 디지털 음원의 순위 변화 패턴 분류)

  • Yoo, In-Jin;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.171-191
    • /
    • 2020
  • This study focused on digital music, which is the most valuable cultural asset in the modern society and occupies a particularly important position in the flow of the Korean Wave. Digital music was collected based on the "Gaon Chart," a well-established music chart in Korea. Through this, the changes in the ranking of the music that entered the chart for 73 weeks were collected. Afterwards, patterns with similar characteristics were derived through time series cluster analysis. Then, a descriptive analysis was performed on the notable features of each pattern. The research process suggested by this study is as follows. First, in the data collection process, time series data was collected to check the ranking change of digital music. Subsequently, in the data processing stage, the collected data was matched with the rankings over time, and the music title and artist name were processed. Each analysis is then sequentially performed in two stages consisting of exploratory analysis and explanatory analysis. First, the data collection period was limited to the period before 'the music bulk buying phenomenon', a reliability issue related to music ranking in Korea. Specifically, it is 73 weeks starting from December 31, 2017 to January 06, 2018 as the first week, and from May 19, 2019 to May 25, 2019. And the analysis targets were limited to digital music released in Korea. In particular, digital music was collected based on the "Gaon Chart", a well-known music chart in Korea. Unlike private music charts that are being serviced in Korea, Gaon Charts are charts approved by government agencies and have basic reliability. Therefore, it can be considered that it has more public confidence than the ranking information provided by other services. The contents of the collected data are as follows. Data on the period and ranking, the name of the music, the name of the artist, the name of the album, the Gaon index, the production company, and the distribution company were collected for the music that entered the top 100 on the music chart within the collection period. Through data collection, 7,300 music, which were included in the top 100 on the music chart, were identified for a total of 73 weeks. On the other hand, in the case of digital music, since the cases included in the music chart for more than two weeks are frequent, the duplication of music is removed through the pre-processing process. For duplicate music, the number and location of the duplicated music were checked through the duplicate check function, and then deleted to form data for analysis. Through this, a list of 742 unique music for analysis among the 7,300-music data in advance was secured. A total of 742 songs were secured through previous data collection and pre-processing. In addition, a total of 16 patterns were derived through time series cluster analysis on the ranking change. Based on the patterns derived after that, two representative patterns were identified: 'Steady Seller' and 'One-Hit Wonder'. Furthermore, the two patterns were subdivided into five patterns in consideration of the survival period of the music and the music ranking. The important characteristics of each pattern are as follows. First, the artist's superstar effect and bandwagon effect were strong in the one-hit wonder-type pattern. Therefore, when consumers choose a digital music, they are strongly influenced by the superstar effect and the bandwagon effect. Second, through the Steady Seller pattern, we confirmed the music that have been chosen by consumers for a very long time. In addition, we checked the patterns of the most selected music through consumer needs. Contrary to popular belief, the steady seller: mid-term pattern, not the one-hit wonder pattern, received the most choices from consumers. Particularly noteworthy is that the 'Climbing the Chart' phenomenon, which is contrary to the existing pattern, was confirmed through the steady-seller pattern. This study focuses on the change in the ranking of music over time, a field that has been relatively alienated centering on digital music. In addition, a new approach to music research was attempted by subdividing the pattern of ranking change rather than predicting the success and ranking of music.

A Study on the Characteristics of Enterprise R&D Capabilities Using Data Mining (데이터마이닝을 활용한 기업 R&D역량 특성에 관한 탐색 연구)

  • Kim, Sang-Gook;Lim, Jung-Sun;Park, Wan
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.1-21
    • /
    • 2021
  • As the global business environment changes, uncertainties in technology development and market needs increase, and competition among companies intensifies, interests and demands for R&D activities of individual companies are increasing. In order to cope with these environmental changes, R&D companies are strengthening R&D investment as one of the means to enhance the qualitative competitiveness of R&D while paying more attention to facility investment. As a result, facilities or R&D investment elements are inevitably a burden for R&D companies to bear future uncertainties. It is true that the management strategy of increasing investment in R&D as a means of enhancing R&D capability is highly uncertain in terms of corporate performance. In this study, the structural factors that influence the R&D capabilities of companies are explored in terms of technology management capabilities, R&D capabilities, and corporate classification attributes by utilizing data mining techniques, and the characteristics these individual factors present according to the level of R&D capabilities are analyzed. This study also showed cluster analysis and experimental results based on evidence data for all domestic R&D companies, and is expected to provide important implications for corporate management strategies to enhance R&D capabilities of individual companies. For each of the three viewpoints, detailed evaluation indexes were composed of 7, 2, and 4, respectively, to quantitatively measure individual levels in the corresponding area. In the case of technology management capability and R&D capability, the sub-item evaluation indexes that are being used by current domestic technology evaluation agencies were referenced, and the final detailed evaluation index was newly constructed in consideration of whether data could be obtained quantitatively. In the case of corporate classification attributes, the most basic corporate classification profile information is considered. In particular, in order to grasp the homogeneity of the R&D competency level, a comprehensive score for each company was given using detailed evaluation indicators of technology management capability and R&D capability, and the competency level was classified into five grades and compared with the cluster analysis results. In order to give the meaning according to the comparative evaluation between the analyzed cluster and the competency level grade, the clusters with high and low trends in R&D competency level were searched for each cluster. Afterwards, characteristics according to detailed evaluation indicators were analyzed in the cluster. Through this method of conducting research, two groups with high R&D competency and one with low level of R&D competency were analyzed, and the remaining two clusters were similar with almost high incidence. As a result, in this study, individual characteristics according to detailed evaluation indexes were analyzed for two clusters with high competency level and one cluster with low competency level. The implications of the results of this study are that the faster the replacement cycle of professional managers who can effectively respond to changes in technology and market demand, the more likely they will contribute to enhancing R&D capabilities. In the case of a private company, it is necessary to increase the intensity of input of R&D capabilities by enhancing the sense of belonging of R&D personnel to the company through conversion to a corporate company, and to provide the accuracy of responsibility and authority through the organization of the team unit. Since the number of technical commercialization achievements and technology certifications are occurring both in the case of contributing to capacity improvement and in case of not, it was confirmed that there is a limit in reviewing it as an important factor for enhancing R&D capacity from the perspective of management. Lastly, the experience of utility model filing was identified as a factor that has an important influence on R&D capability, and it was confirmed the need to provide motivation to encourage utility model filings in order to enhance R&D capability. As such, the results of this study are expected to provide important implications for corporate management strategies to enhance individual companies' R&D capabilities.