Integrating an incident dataset with a question and answering language model to assist hazard identification: comparison of an extractive and generative model

dc.contributor.authorRicketts, Jon
dc.contributor.authorGuo, Weisi
dc.contributor.authorPelham, Jonathan
dc.contributor.authorBarry, David
dc.date.accessioned2025-02-27T14:54:42Z
dc.date.available2025-02-27T14:54:42Z
dc.date.freetoread2025-02-27
dc.date.issued2024
dc.date.pubOnline2024-09-29
dc.description.abstractRobust hazard identification (HAZID) relies upon extensive knowledge of the system being analysed, the technical aspects, and how it will be used operationally. Typically, this knowledge is held by human participants who can draw out answers in natural language to hazard related questions based upon their own experience. However, several threats exist to this, such as high staff turnover, a poor learning from incidents capability or even insufficient Information Technology resources. Alternatively, incident databases hold vast amounts of hazard information that can be transformed into a source of knowledge. As mitigation to the aforementioned issues, this paper presents a Question and Answering (Q&A) Bidirectional Encoder Representations from Transformers (BERT) language model trained upon aviation incidents and a unique Q&A dataset. The model can extract answers to typical HAZID questions, based upon factual incident reports. Alongside this extractive approach, the paper also explores the use of a generative Large Language Model combined with an incident dataset. Both models proved a useful addition to HAZID activities based upon the Structured What If Technique (SWIFT), answering safety-themed questions based upon a retrieved context of incident reports that semantically matched the query. For the purposes of HAZID, it was suggested that the generative option is preferable based upon its ease of implementation, lower resource requirements and quality of responses. Additionally, it is shown that it is possible for organisations to train and create their own custom models for HAZID purposes. Future work may wish to consider the application of models that can hypothesize scenarios based upon incident reports, building further understanding to the relationships between causes, hazards and consequences.
dc.description.journalNameProceedings of the Institution of Mechanical Engineers, Part O: Journal of Risk and Reliability
dc.identifier.citationRicketts J, Guo W, Pelham J, Barry D. (2024) Integrating an incident dataset with a question and answering language model to assist hazard identification: comparison of an extractive and generative model. Proceedings of the Institution of Mechanical Engineers, Part O: Journal of Risk and Reliability, Available online 29 September 2024en_UK
dc.identifier.eissn1748-0078
dc.identifier.elementsID553955
dc.identifier.issn1748-006X
dc.identifier.issueNoahead-of-print
dc.identifier.urihttps://doi.org/10.1177/1748006x241272831
dc.identifier.urihttps://dspace.lib.cranfield.ac.uk/handle/1826/23526
dc.identifier.volumeNoahead-of-print
dc.languageEnglish
dc.language.isoen
dc.publisherSAGEen_UK
dc.publisher.urihttps://journals.sagepub.com/doi/10.1177/1748006X241272831
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectNatural language processingen_UK
dc.subjecthazard analysisen_UK
dc.subjectinformation retrievalen_UK
dc.subjectincident reportingen_UK
dc.subjectsafety analysisen_UK
dc.subject4005 Civil Engineeringen_UK
dc.subject4015 Maritime Engineeringen_UK
dc.subject40 Engineeringen_UK
dc.subject4017 Mechanical Engineeringen_UK
dc.titleIntegrating an incident dataset with a question and answering language model to assist hazard identification: comparison of an extractive and generative modelen_UK
dc.typeArticle
dc.type.subtypeJournal Article
dcterms.dateAccepted2024-07-13

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Integrating_an_incident_dataset-2024.pdf
Size:
575.38 KB
Format:
Adobe Portable Document Format
Description:
Publisher version

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.63 KB
Format:
Plain Text
Description: