For simple grippers

  1. Obtain a Point Cloud or Depth image of the objects of interest
  2. Pre-process the data to remove extraneous information
  3. Segmentation: Cluster points that are close enough to identify individual objects
  4. Filter clusters by size, geometry or other features to down-select objects of interest
  5. Planning: Compute center-of-mass and principal axes of relevant clusters
  6. Collision-checking: Generate possible grasps and check for collisions with point clouds
  7. Execution: Physically test a grasp by monitoring jaw distances, as well as forces and torques at the wrist

Depending on the task, these steps will be easier or harder or even skippable.

For multi-fingered Hands

  • do a very similar thing but utilize random sampling to find best options