Explanation-Aware Experience Replay in Rule-Dense Environments