Close menu

SURE

Sunderland Repository records the research produced by the University of Sunderland including practice-based research and theses.

Cognitive based detection of anomalous sequences using Bayesian surprise

McGarry, Kenneth and Nelson, David (2025) Cognitive based detection of anomalous sequences using Bayesian surprise. Expert Systems. ISSN 0266-4720 (In Press)

Item Type: Article

Abstract

In this work we implement Bayesian surprise as a method to sift through sequences of discrete patterns and identify any unusual or interesting patterns that deviate from known sequences. Surprise is a biological trait inherent in humans and animals and is essential for many creative acts and efforts of discovery. Numerous technical domains are comprised of discrete elements in sequences such as e-commerce transactions, genome data searching, online financial transactions of many types, criminal cyber-attacks and life-course data from sociology. In addition to the complexity and computational burden of this type of problem is the issue of their rarity. Many anomalies are infrequent and may defy categorization therefore not suited to classification solutions. We test our methods on four discrete datasets (Hospital Sepsis patients, Chess Moves, the Wisconsin Card Sorting Task and BioFamilies) consisting of discrete sequences. Probabilistic Suffix Trees are trained on this data which maintain each discrete symbols location and position in a given sequence. The trained models are exposed to ``new'' data where any deviations from learned patterns either in location on the sequence or frequency of occurrence will denote patterns that are unusual compared with the original training data. To assist in the identification of new patterns and to avoid confusing old patterns as new or novel we use Bayesian surprise to detect the discrepancies between what we are expecting and actual results. We can assign the degree of surprise or unexpectedness to any new pattern and provides an indication of why certain patterns are deemed novel or surprising and why others are not.

[img] PDF (Author Accepted Manuscript on Publisher template/ LaTex)
EXSY-Jan-23-027-REV3.pdf
Restricted to Repository staff only until 10 August 2026.
Available under License Creative Commons Attribution Non-commercial.

Download (1MB) | Request a copy

More Information

Additional Information: "This is the peer reviewed version of the following article: [FULL CITE], which has been published in final form at [Link to final article using the DOI]. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Use of Self-Archived Versions. This article may not be enhanced, enriched or otherwise transformed into a derivative work, without express permission from Wiley or by statutory rights under applicable legislation. Copyright notices must not be removed, obscured or modified. The article must be linked to Wiley’s version of record on Wiley Online Library and any embedding, framing or otherwise making available the article or pages thereof by third parties from platforms, services and websites other than Wiley Online Library must be prohibited."
Uncontrolled Keywords: Bayesian Surprise, Probabilistic Suffix Tree, Sequences, Anomaly, interestingness measure, sequence, entropy
Depositing User: Kenneth McGarry

Identifiers

Item ID: 19259
ISSN: 0266-4720
URI: http://sure.sunderland.ac.uk/id/eprint/19259
Official URL: https://onlinelibrary.wiley.com/journal/14680394

Users with ORCIDS

ORCID for Kenneth McGarry: ORCID iD orcid.org/0000-0002-9329-9835
ORCID for David Nelson: ORCID iD orcid.org/0000-0002-0868-9100

Catalogue record

Date Deposited: 18 Jul 2025 14:13
Last Modified: 18 Jul 2025 14:13

Contributors

Author: Kenneth McGarry ORCID iD
Author: David Nelson ORCID iD

University Divisions

Faculty of Business and Technology

Subjects

Computing > Artificial Intelligence
Computing

Actions (login required)

View Item (Repository Staff Only) View Item (Repository Staff Only)