Machine learning / Final project

Azure VM Criticality Prediction

A full ML pipeline for predicting whether a new Azure VM request is critical using configuration, CPU behavior, and tenant history.

Interactive ML demo

Azure Criticality Console

Gradient boosting baseline

0.488

Precision

Recall

Sequential tree boosting gave the strongest ranking quality across thresholds, making it the clean baseline for threshold tradeoffs.

Evaluation split Test

Tree ensemble Threshold 0.488

Decision Critical if score clears threshold

Best AUC model in the final report; useful when threshold can be tuned for the scheduling policy.

Top signals LightGBM validation permutation importance

}

The modeling task uses request-time features while the criticality label depends on later VM behavior.

The pipeline builds request-level tables, tenant history features, time-based splits, and model notebooks for classical and neural methods.

The repository documents a 70+ column request-level dataset and multiple model families for evaluating critical VM prediction.