From Turing Test to Chinese Room Argument: how to apply artificial intelligence in aviation

Date published

2025

Free to read from

2025-07-03

Supervisor/s

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier

Department

Course name

Type

Article

ISSN

2352-1465

Format

Citation

Saunders D, Li W-C, Wang T. (2025) From Turing Test to Chinese Room Argument: how to apply artificial intelligence in aviation. Transportation Research Procedia, Volume 88, 2025, pp. 270-277

Abstract

The emergence of artificial intelligence (AI) with advanced large language model (LLM) offers promising approaches for enhancing the capacity of textual analysis. Both the Turing Test and Chinese Room Argument explore AI’s understanding of human language, although both methodologies have dissimilar interpretation on AI’s ‘intelligence’. Current AI systems have demonstrated the capacity for achieving defined test goals for ‘intelligence’. The aviation industry is increasingly interested in adopting AI to improve efficiency, safety, and cost efficiency; with the Generative Pre-trained Transformers’ (GPT) capability to reduce resource-intensive analytics in accident causation classification. This study investigates the potential and challenges of using AI to analyze human factors involved in aviation accidents based on the Human Factors Analysis and Classification System (HFACS). Six subject-matter experts in aviation human factors and AI domain participated in this research. All participants were familiar with the HFACS framework to analyze aviation accident reports and the output of GPT which were based on the prompt engineering developed by the research team. This research creates a framework to perform its initial generation and training using GE 235 accident investigation report from Taiwan Transportation Safety Board (TTSB). Initial discoveries demonstrated that the AI model could populate the sub-dimensions of Level 1 HFACS framework with moderate accuracy, although there remains a high presence of hallucinations in generated outputs, with a lack of reproducibility in consecutive outputs with consistent input data. There are still different opinions on AI applications in real-world operations with ethics and safety concerns. While there is clear potential for GPT models to supplement accident analysis within the HFACS framework, there is still more work to integrate HFACS framework into GPT modelling for effective generation to accident data.

Description

European Association for Aviation Psychology Conference EAAP 35

Software Description

Software Language

Github

Keywords

3505 Human Resources and Industrial Relations, 35 Commerce, Management, Tourism and Services, Networking and Information Technology R&D (NITRD), Machine Learning and Artificial Intelligence, 3509 Transportation, logistics and supply chains, Accident Analysis, Artifical Intelligence, Chinese Room Argument, Human Factors Analysis, Classification System, Large Language Model

DOI

Rights

Attribution 4.0 International

Funder/s

Relationships

Relationships

Resources