Reinforcement Learning (RL)

  • Core: Robots autonomously learn strategies through environmental reward signals
  • Typical algorithms:
    • DQN (value function method)
    • PPO (policy gradient method)
    • SAC (hybrid method)
  • Applications: Robotic arm manipulation, mobile robot navigation, quadruped robot walking, drone flight

Imitation Learning (IL)