Basic Data Collection Methods (10 Types)

  1. Manual entry
  2. Sensor collection
  3. Web crawlers
  4. Database export
  5. Log collection
  6. API calls
  7. File import
  8. Image/video collection
  9. Voice collection
  10. RFID/NFC

Teleoperation

Game Controllers

  • Xbox / PlayStation

Professional Equipment

  • 3Dconnexion
  • Force Dimension

VR Devices

  • HTC Vive
  • Manus VR

Stanford ALOHA: Uses modified game controllers with 6-DOF control, achieving sub-millimeter precision

Simulation Collection

Using Unity/Gazebo and physics engines.

OpenAI Dactyl: Generates 100+ years equivalent training data through domain randomization.

Human Demonstration

Wearable devices (IMU, force sensors).

Industrial “hand-by-hand guidance”: Records poses at 100-1000Hz sampling rate.

Internet Data Utilization

Scraping data from YouTube, forums, and social media for multimodal AI training.

Data Collection Guidelines

Task ComplexitySample Size
Simple tasks50-200 demos
Medium tasks500-2000
Complex tasks5000+

Key: Diversity (environment, objects, operations) is more important than quantity.