DEEP REINFORCEMENT LEARNING-BASED TRAJECTORY PLANNING FOR MANIPULATOR OBSTACLE AVOIDANCE