Reinforcement Learning-Based Spectrum Management for Cognitive Radio Networks: A Literature Review and Case Study