Dual self-attention is what you need for model drift detection in 6G networks