[ACE-24] Relevant alarms detection system Created: 22/May/24  Updated: 24/May/24  Due: 13/May/24

Status: To Do
Project: ACE
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Story Priority: Normal
Reporter: Ahmed Osman Assignee: Unassigned
Resolution: Unresolved Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Attachments: PDF File Relevant Alarms Detection v1.pdf     PDF File Relevant Alarms Detection v2.pdf    
Customer:
NEP_3.X
Planned Start:
Planned End:

 Description   

The objective is to use AI/Machine Learning to detect the relevant alarms from the irrelevant (transient) ones.



 Comments   
Comment by Ahmed Osman [ 22/May/24 ]

The first version "Relevant Alarms Detection v1" contains the data exploration and analysis of the alarms, and feature engineering.

We will label an alarm as irrelevant if it is cleared within a short period of time, denoted as "n". The value of "n" should ideally be chosen by a domain expert. For the purpose of this study, we will use "n = 7 minutes".

Comment by Ahmed Osman [ 24/May/24 ]
  • Modeling:
  • Split the data into 80% training and 20% testing sets.
  • Trained a baseline Random Forest Classifier and evaluated its performance using precision, recall, f1-score, and accuracy metrics.
  • Applied Stratified Shuffle Split to handle class imbalance and re-evaluated the model.
  • Fine-tuned the model using Bayesian optimization to improve performance.
  • Addressed class imbalance using SMOTE to oversample the minority class (Relevant alarms).
  • Re-trained and evaluated the model post-oversampling, achieving significant performance improvements.
  • Feature Importance Analysis:
  • Analyzed feature importance from the Random Forest model, highlighting the key contributors: Severity, Technical ID, FM Receive Time, and First Occurrence.
  • Documented insights on the impact of each feature on model predictions.
  • Results and Conclusions:
  • Achieved an F1-score, recall, precision, and accuracy of 96% with the oversampled model.
  • Recommended future improvements, including the collection of more labeled data from domain experts to enhance model training.
Generated at Mon Jun 15 14:23:05 EEST 2026 using JIRA 6.1.4#6159-sha1:44eaedef2e4a625c6c7183698b2468d4719c20dc.