• Title/Summary/Keyword: Text data

Search Result 2,953, Processing Time 0.028 seconds

Improved Statistical Language Model for Context-sensitive Spelling Error Candidates (문맥의존 철자오류 후보 생성을 위한 통계적 언어모형 개선)

  • Lee, Jung-Hun;Kim, Minho;Kwon, Hyuk-Chul
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.2
    • /
    • pp.371-381
    • /
    • 2017
  • The performance of the statistical context-sensitive spelling error correction depends on the quality and quantity of the data for statistical language model. In general, the size and quality of data in a statistical language model are proportional. However, as the amount of data increases, the processing speed becomes slower and storage space also takes up a lot. We suggest the improved statistical language model to solve this problem. And we propose an effective spelling error candidate generation method based on a new statistical language model. The proposed statistical model and the correction method based on it improve the performance of the spelling error correction and processing speed.

Association Rules and Application Study in The Digital Library

  • Yu, Jian-Kun;Zeng, Zhi-Yong;Zhang, Wen-Bin
    • Proceedings of the Korea Society for Industrial Systems Conference
    • /
    • 2007.02a
    • /
    • pp.61-71
    • /
    • 2007
  • The Association Rules is the most important method in technology of the data mining. This text further study The Association Rules, has analyzed and commented to Apriori algorithm of The Association Rules. Have realized Apriori algorithm base on Visual Basic 6.0, probe into Apriori algorithm application among the digital library, show with experimental data of application of Association Rules in borrow in the data analysis in readers finally.

  • PDF

Age and Gender in Reddit Commenting and Success

  • Finlay, S. Craig
    • Journal of Information Science Theory and Practice
    • /
    • v.2 no.3
    • /
    • pp.18-28
    • /
    • 2014
  • Reddit is a large user generated content (USG) website in which users form common interest groups and submit links to external content or text posts of user-created content. The web site operates on a voting system whereby registered users can assign positive or negative ratings to both submitted content and comments made to submitted content. While Reddit is a pseudonymous site, with users creating usernames but providing no biographical data, an informal survey posted to a large shared interest community yielded 734 responses including age and gender of users. This provided a large amount of contextual biographical data with which to analyse user profiles at the first level of Computer Mediated Discourse Analysis (CMDA), articulated by Susan Herring. The results indicate that older Reddit users both formulate more complex writing and enjoy more success when rated by other users. Gender data was incomplete and as such only tentative results could be proposed in that regard.

A Study on State Synthesis Algorithm for ICSC(InCheon Silicon Compiler) (ICSC(InCheon Silicon Compiler)를 위한 상태 합성알고리즘에 대한 연구)

  • Cho, Joong-Hwee
    • Proceedings of the KIEE Conference
    • /
    • 1988.07a
    • /
    • pp.521-524
    • /
    • 1988
  • This paper describes BSDL(Behavioral/Structural Description Language), CDTF(Control Data Text File) and state synthesizer built for use in ICSC(InCheon Silicon Compiler). BSDL describes structral and behaviral specifications of an ASIC(Application Specific IC) for digital system design. ICSC's paser generates CDTF consists of if-then-else, arithmetic and data transfer statement according to each BSDL statement. State synthesizer generates CCG(Control Constraint Graph) in consideration of execution of statement and generates VCG (Variable Constraint Graph) in consideration use of variable generation and use of variable. Also, it involves allocating algorithm operation nodes in the data path and the control path to machine states with minimum state number and as small area as possible.

  • PDF

Mobile Information Sharing System Based-on Android Mobile Platform (안드로이드 기반 모바일 정보공유시스템)

  • Bae, Sung-Ho;Kim, Woo-Saeng
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.46 no.2
    • /
    • pp.58-64
    • /
    • 2009
  • The existing note on mobile can store only text data and cannot share the data, which means that the notes stored on mobile are just seasonal or temporary memo. Therefore, this research designs an improved note on mobile and gives a chance for sharing by importing a concept of Mindmap and backup server through the internet. The mobile application is developed based on Android Platform and the server applications are developed based on Linux. These can communicate each other throughout the internet to upload and download some mindmap data.

Analysis of Success Factors for Mobile Commerce using Text Mining and PLS Regression

  • Kim, Yong-Hwan;Kim, Ja-Hee;Park, Ji hoon;Lee, Seung-Jun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.11
    • /
    • pp.127-134
    • /
    • 2016
  • In this paper, we propose factors that influence on the mobile commerce satisfaction conducted by data mining and a PLS regression analysis. We extracted the most frequent words from mobile application reviews in which there are a large number of user's requests. We employed the content analysis to condense the large number of texts. We took a survey with the categories by which data are condensed and specified as factors that influence on the mobile commerce satisfaction. To avoid multicollinearity, we employed a PLS regression analysis instead of using a multiple regression analysis. Discovered factors that are potential consequences of customer satisfaction from direct requests by customers, the result may be an appropriate indicator for the mobile commerce market to improve its services.

CALS oriented design/fabrication information system for steel bridges

  • Isohata, Hiroshi;Fukuda, Masahiko;Watanabe, Sueo
    • Steel and Composite Structures
    • /
    • v.3 no.1
    • /
    • pp.13-32
    • /
    • 2003
  • In this paper design and fabrication information system for steel bridge construction is studied and proposed according to the progress of Construction CALS/EC in the construction industry in Japan. The data exchange in this system bases on the text file as well as CAD data with simplified drawings. The concept of this system is discussed following the analysis on the issues of the conventional system. The application of the product model is also discussed including effects and issues on the inspection system. This paper is based on the study carried out by Special Committee on Construction CALS of JASBC to which author belong.

A Dataset of Online Handwritten Assamese Characters

  • Baruah, Udayan;Hazarika, Shyamanta M.
    • Journal of Information Processing Systems
    • /
    • v.11 no.3
    • /
    • pp.325-341
    • /
    • 2015
  • This paper describes the Tezpur University dataset of online handwritten Assamese characters. The online data acquisition process involves the capturing of data as the text is written on a digitizer with an electronic pen. A sensor picks up the pen-tip movements, as well as pen-up/pen-down switching. The dataset contains 8,235 isolated online handwritten Assamese characters. Preliminary results on the classification of online handwritten Assamese characters using the above dataset are presented in this paper. The use of the support vector machine classifier and the classification accuracy for three different feature vectors are explored in our research.

Automatic Conversion of Machining Data by the Feature Recognition of Press Mold (프레스 금형의 특징형상 인식에 의한 가공데이타 자동변환)

  • Choi, Hong-Tae;Bahn, Kab-Soo;Lee, Seok-Hee
    • IE interfaces
    • /
    • v.7 no.3
    • /
    • pp.181-191
    • /
    • 1994
  • This paper presents an automatic conversion of machining data from the orthographic views of press mold by feature recognition rule. The system includes following 6 modules : separation of views, function support, dimension text check and feature processing modules. The characteristic of this system is that with minimum user intervention, it recognizes basic features such as holes, slots, pockets and clamping parts and thus automatically converts CAD drawing details of press mold into machining data using 2D CAD system instead of using an expensive 3D Modeler. The system is developed by using IBM-PC in the environment of AutoCAD R12, AutoLISP and MetaWare High C. Performance of the system is verified as a good interfacing of CAD and CAM when applied to a lot of sample drawing.

  • PDF

Implementation of CAN Communication using LabVIEW (LabVIEW를 이용한 CAN 통신 구현)

  • Kim, Jueun;Choi, Nam-Sup;Han, Byung-Moon;Lee, Jun-Young
    • Proceedings of the KIPE Conference
    • /
    • 2012.07a
    • /
    • pp.441-442
    • /
    • 2012
  • LabVIEW is faster than text language based program regarding development time and can monitor the output of data fast without the separate compiling work as the graphic-based graphical programming language. And, its coding is fast because it is designed by connecting the function with the wire and its has the merit of relatively intuitive UI. In this paper, data transmission and receiving between the program that is implemented in C language as CAN communication method that is strong against noise and used in power electronics application field variously and LabVIEW based program are explained. And, the design of LabVIEW based CAN communication program, data analysis and GUI screen composition that is convenient for monitoring are shown.

  • PDF