Reinforcement Learning (RL)
- Core: Robots autonomously learn strategies through environmental reward signals
- Typical algorithms:
- DQN (value function method)
- PPO (policy gradient method)
- SAC (hybrid method)
- Applications: Robotic arm manipulation, mobile robot navigation, quadruped robot walking, drone flight