• Title/Summary/Keyword: evaluation framework

Search Result 1,570, Processing Time 0.031 seconds

Evaluating Conversational AI Systems for Responsible Integration in Education: A Comprehensive Framework

  • Utkarch Mittal;Namjae Cho;Giseob Yu
    • Journal of Information Technology Applications and Management
    • /
    • v.31 no.3
    • /
    • pp.149-163
    • /
    • 2024
  • As conversational AI systems such as ChatGPT have become more advanced, researchers are exploring ways to use them in education. However, we need effective ways to evaluate these systems before allowing them to help teach students. This study proposes a detailed framework for testing conversational AI across three important criteria as follow. First, specialized benchmarks that measure skills include giving clear explanations, adapting to context during long dialogues, and maintaining a consistent teaching personality. Second, adaptive standards check whether the systems meet the ethical requirements of privacy, fairness, and transparency. These standards are regularly updated to match societal expectations. Lastly, evaluations were conducted from three perspectives: technical accuracy on test datasets, performance during simulations with groups of virtual students, and feedback from real students and teachers using the system. This framework provides a robust methodology for identifying strengths and weaknesses of conversational AI before its deployment in schools. It emphasizes assessments tailored to the critical qualities of dialogic intelligence, user-centric metrics capturing real-world impact, and ethical alignment through participatory design. Responsible innovation by AI assistants requires evidence that they can enhance accessible, engaging, and personalized education without disrupting teaching effectiveness or student agency.

Development and Application of an Evaluation Model for Ubiquitous City Project (U-City 사업평가모델 개발 및 활용방안)

  • Kim, Byoung-Gun;Kim, Jung-Hun;Lee, Choon-Seong
    • The Journal of Society for e-Business Studies
    • /
    • v.17 no.2
    • /
    • pp.87-104
    • /
    • 2012
  • Ubiquitous City is emerging as a new paradigm in future city development. U-City is nationwide project for future strategy to implement sustainable city environment and solve several issues in urban area. And as worldwide leading role on future city research, there are lots of U-City related researches in Government and Industry sector. However, it has raised unsustainable development concerns that indiscriminate promotion and visibility for long-term effects because it is not conducted an assessment. Thus, to overcome these problems and in order to develop a more stable U-City project, need to a fundamental consideration about U-City evaluation. This study is to provide the evaluation framework for Ubiquitous City(U-City). The framework is consisted of evaluation dimensions derived from characteristics of U-City development project. From this research, we expect it helps U-City development to be inspected and managed.

Enhancing the Efficiency and Reliability for M&S based Test and Evaluation System Development (M&S 기반 시험평가 장비 개발의 효율성 및 신뢰성 강화 방안)

  • Cho, Kyu-Tae;Lee, Seung-Young;Lee, Han-Min;Kim, Sae-Hwan;Jeong, Ha-Min
    • Journal of the Korea Society for Simulation
    • /
    • v.21 no.1
    • /
    • pp.89-96
    • /
    • 2012
  • Recent modeling and simulation technologies are being used in various fields, especially in the field of military simulation-based acquisition (Simulation Based Acquisition) is recognized as an essential policy. In test and evaluation phase of the SBA process, to build a simulation-based T&E(test and evaluation) environment is needed when T&E cannot be carried by real weapon system. To improve efficiency and reliability for T&E, interoperability, reusability and reliability for T&E equipments and systems are important. In this study, we propose applying simulation framework for efficienct test and applying VV&A process for reliable evaluation. We describes the characteristics of the development process, the actual test cases and the results of evaluation. Finally utilization plan and the future direction of research is described.

Domestic and Foreign Trends in the Study of the Landscape Evaluation (경관평가연구의 국내외 동향)

  • 주신하;임승빈
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.28 no.2
    • /
    • pp.49-60
    • /
    • 2000
  • The purpose of this study is to overview domestic and foreign trends in the study of the landscape evaluation through new framework of landscape evaluation studies. 108 studies on the landscape evaluation are summarized and categorized into theoretical studies, verification of theories, development of evaluation methods and applications in physical planning. Major theories in the landscape evaluation came from the psycho-physics, the evolutionary theory ann the cultural-learning theory, and were verified and applied into physical planning. Early experimental researches on landscape evaluation, based on psycho-physics, were focused on relatively simple responses to landscapes. But many studies have been gradually related to the evolutionary theory and the cultural learning theory, emphasizing biological and cultural effects on landscape evaluation. Especially, Appleton's Prospect-Refuge theory' and Kaplans' 'Information Processing model' have very strong influence in landscape evaluation. Relatively there have been many application researches in Korea, which tells there have been strong needs to solve pending practical problems caused by the rapid economic and social growth for several decades. Almost of applications in physical planning are focused on physical features of landscapes, but for more comprehensive landscape evaluation, many other factors such as cognitive and sociocultural variables should be integrated into the whole evaluation system. As a result of reviewing of landscape evaluation studies, I found the overall domestic and foreign trends and the necessity of more research on the applications in physical planning. Because this study mainly focused on academic researches, for more appropriate landscape evaluation and management there should be more practical researches including various approaches.

  • PDF

A Study on How to Set up a Standard Framework for AI Ethics and Regulation (AI 윤리와 규제에 관한 표준 프레임워크 설정 방안 연구)

  • Nam, Mun-Hee
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.4
    • /
    • pp.7-15
    • /
    • 2022
  • With the aim of an intelligent world in the age of individual customization through decentralization of information and technology, sharing/opening, and connection, we often see a tendency to cross expectations and concerns in the technological discourse and interest in artificial intelligence more than ever. Recently, it is easy to find claims by futurists that AI singularity will appear before and after 2045. Now, as part of preparations to create a paradigm of coexistence that coexists and prosper with AI in the coming age of artificial intelligence, a standard framework for setting up more correct AI ethics and regulations is required. This is because excluding the risk of omission of setting major guidelines and methods for evaluating reasonable and more reasonable guideline items and evaluation standards are increasingly becoming major research issues. In order to solve these research problems and at the same time to develop continuous experiences and learning effects on AI ethics and regulation setting, we collect guideline data on AI ethics and regulation of international organizations / countries / companies, and research and suggest ways to set up a standard framework (SF: Standard Framework) through a setting research model and text mining exploratory analysis. The results of this study can be contributed as basic prior research data for more advanced AI ethics and regulatory guidelines item setting and evaluation methods in the future.

Client Profile Framework for Providing Adapted Content to Context (상황에 적응화된 콘텐츠 제공을 위한 클라이언트 프로파일 프레임워크)

  • Kim, Kyung-Sik;Lee, Jae-Dong
    • The KIPS Transactions:PartC
    • /
    • v.14C no.3 s.113
    • /
    • pp.293-304
    • /
    • 2007
  • In this paper, a client-side framework for processing of the profile that is necessary for providing adapted content to user's context in the client is designed and implemented. The profile must be constituted context information and various user's information for providing the adapted content to user's context. The client device also provides functionalities such as the creation, the management, and the transmission of the profile. The profile which is used in the proposed profile framework consists of various related information of a user for content adaptation. The technology such as creation, transmission and manage of the profile for effective processing is proposed and apply this technologies to client profile framework during the design are applied. As the result of evaluation, techniques of the proposed framework for processing profiles is more effective than previous techniques.

A Study on Comparison of Development Productivity of Spring Framework 2.0 and 2.5 with Lightweight Container Architecture (동일한 경량 컨테이너 구조 환경에서 스프링 프레임워크 2.0과 2.5의 개발 생산성 비교 연구)

  • Lee, Myeong-Ho
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.10 no.6
    • /
    • pp.1265-1274
    • /
    • 2009
  • This paper proposes an object-oriented software development guidance and an evaluation index for the productivity related to Spring Framework 2.0 and 2.5. Spring Framework is a known successful open source standard model for lightweight container architecture. However, there is no comparison research about the performance of Spring Framework 2.0 and 2.5 with same identical platform. Quantitative analysis is supported as a part of LoC(Line of Code) analysis. There is a limit to develop the updated software with no the specific evaluating index for the productivity of the software. This work proposes an specific index for evaluating the productivity of new version Spring Framework on a platform. Base on the result, the specific guidance of the developing software is obtained.

Open API Software Framework for Information Processing of RCS-e Presence Feature (RCS-e 프레즌스 정보 처리를 위한 오픈 API 소프트웨어 프레임워크)

  • Lee, Dongcheul
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.16 no.5
    • /
    • pp.77-82
    • /
    • 2016
  • Web developers have had difficulties in using Rich Communication Service-e(RCS-e) on their applications because of complicated protocols and closed interfaces. In order to vitalize the use of RCS-e, we need a RCS Application Program Interface(API) which has simple protocols and can be accessed easily. This paper presents the web-based Open API Framework for the RCS-e presence feature. A system architecture for the framework is defined. Call flows for the presence feature between the framework and other nodes are defined. Also, one of the call flows is illustrated to explain how to convert web-based requests to RCS-e requests. Finally, performance evaluation proves that the framework does not add any loads to the existing network infrastructure.

Evaluation of marginal and internal gaps in single and three-unit metal frameworks made by micro-stereolithography

  • Kim, Dong-Yeon;Lee, Ha-Na;Kim, Ji-Hwan;Kim, Hae-Young;Kim, Woong-Chul
    • The Journal of Advanced Prosthodontics
    • /
    • v.9 no.4
    • /
    • pp.239-243
    • /
    • 2017
  • PURPOSE. The purpose of this study is to compare single and three-unit metal frameworks that are produced by micro-stereolithography. MATERIALS AND METHODS. Silicone impressions of a selected molar and a premolar were used to make master abutments that were scanned into a stereolithography file. The file was processed with computer aided design software to create single and three-unit designs from which resin frameworks were created using micro-stereolithography. These resin frameworks were subjected to investment, burnout, and casting to fabricate single and three-unit metal ones that were measured under a digital microscope by using the silicone replica technique. The measurements were verified by means of the Mann-Whitney U test (${\alpha}=.05$). RESULTS. The marginal gap was $101.9{\pm}53.4{\mu}m$ for SM group and $104.3{\pm}62.9{\mu}m$ for TUM group. The measurement of non-pontics in a single metal framework was $93.6{\pm}43.9{\mu}m$, and that of non-pontics in a three-unit metal framework was $64.9{\pm}46.5{\mu}m$. The dimension of pontics in a single metal framework was $110.2{\pm}61.4{\mu}m$, and that of pontics in a three-unit metal framework was $143.7{\pm}51.8{\mu}m$. CONCLUSION. The marginal gap was smaller for the single metal framework than for the three-unit one, which requires further improvement before it can be used for clinical purposes.

A Study on Comparison of Development Productivity of Hibernate 3.2 and iBatis 2.3 Based Lightweight Container Architecture (경량 컨테이너 구조 환경에서 하이버네이트 3.2와 아이바티스 2.3의 개발 생산성 비교 연구)

  • Lee, Myeong-Ho
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.12 no.4
    • /
    • pp.1919-1926
    • /
    • 2011
  • This paper proposes an object-oriented software development guidance and an evaluation index for the productivity related to Hibernate 3.2 and iBatis 2.3 in same platform of Spring framework 2.5. Currently in production until the lightweight container architecture, known most commonly used architecture framework is Spring framework. Also intended to increase the productivity of database techniques are ORM. Hibernate and iBatis is an ORM tool is currently being used. In this study, Spring framework 2.5 is based on the framework of the same Hibernate 3.2 and iBatis 2.3 to design and implement the pilot system. In addition, comparison and standardization of software development productivity assessment is to provide guidance.