[ACE-24] Relevant alarms detection system - Mobinets-JIRA

Details

Type: Story
Status: To Do (View Workflow)
Priority: Normal
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None Labels

Customer:

NEP_3.X

Description

The objective is to use AI/Machine Learning to detect the relevant alarms from the irrelevant (transient) ones.

Attachments

Relevant Alarms Detection v1.pdf

22/May/24 8:28 AM

916 kB

Ahmed Osman
Relevant Alarms Detection v2.pdf

24/May/24 12:42 PM

1.51 MB

Ahmed Osman

Activity

Ascending order - Click to sort in descending order

Hide

Permalink

Ahmed Osman added a comment - 22/May/24 8:37 AM

The first version "Relevant Alarms Detection v1" contains the data exploration and analysis of the alarms, and feature engineering.

We will label an alarm as irrelevant if it is cleared within a short period of time, denoted as "n". The value of "n" should ideally be chosen by a domain expert. For the purpose of this study, we will use "n = 7 minutes".

Show

Ahmed Osman added a comment - 22/May/24 8:37 AM The first version "Relevant Alarms Detection v1" contains the data exploration and analysis of the alarms, and feature engineering. We will label an alarm as irrelevant if it is cleared within a short period of time, denoted as "n". The value of "n" should ideally be chosen by a domain expert. For the purpose of this study, we will use "n = 7 minutes".

Hide

Permalink

Ahmed Osman added a comment - 24/May/24 12:42 PM

Modeling:
Split the data into 80% training and 20% testing sets.
Trained a baseline Random Forest Classifier and evaluated its performance using precision, recall, f1-score, and accuracy metrics.
Applied Stratified Shuffle Split to handle class imbalance and re-evaluated the model.
Fine-tuned the model using Bayesian optimization to improve performance.
Addressed class imbalance using SMOTE to oversample the minority class (Relevant alarms).
Re-trained and evaluated the model post-oversampling, achieving significant performance improvements.

Feature Importance Analysis:
Analyzed feature importance from the Random Forest model, highlighting the key contributors: Severity, Technical ID, FM Receive Time, and First Occurrence.
Documented insights on the impact of each feature on model predictions.

Results and Conclusions:
Achieved an F1-score, recall, precision, and accuracy of 96% with the oversampled model.
Recommended future improvements, including the collection of more labeled data from domain experts to enhance model training.

Show

Ahmed Osman added a comment - 24/May/24 12:42 PM Modeling : Split the data into 80% training and 20% testing sets. Trained a baseline Random Forest Classifier and evaluated its performance using precision, recall, f1-score, and accuracy metrics. Applied Stratified Shuffle Split to handle class imbalance and re-evaluated the model. Fine-tuned the model using Bayesian optimization to improve performance. Addressed class imbalance using SMOTE to oversample the minority class (Relevant alarms). Re-trained and evaluated the model post-oversampling, achieving significant performance improvements. Feature Importance Analysis : Analyzed feature importance from the Random Forest model, highlighting the key contributors: Severity, Technical ID, FM Receive Time, and First Occurrence. Documented insights on the impact of each feature on model predictions. Results and Conclusions : Achieved an F1-score, recall, precision, and accuracy of 96% with the oversampled model. Recommended future improvements, including the collection of more labeled data from domain experts to enhance model training.

People

Assignee:

Unassigned

Reporter:

Ahmed Osman

Votes:

0 Vote for this issue

Watchers:

1 Start watching this issue

Dates

Due:

13/May/24

Created:

22/May/24 8:28 AM

Updated:

24/May/24 12:42 PM

Planned Start:

13/May/24 12:00 AM

Planned End:

13/May/24 12:00 AM

Drag and Drop