Abstract: Deep reinforcement learning (DRL) has recently found comprehensive application in communication systems to address complex resource allocation problems. However, conventional DRL algorithms ...