Introduction To Robot Vision: Ziv Yaniv Computer Aided Interventions and Medical Robotics, Georgetown University
Introduction To Robot Vision: Ziv Yaniv Computer Aided Interventions and Medical Robotics, Georgetown University
Introduction To Robot Vision: Ziv Yaniv Computer Aided Interventions and Medical Robotics, Georgetown University
Ziv Yaniv
Computer Aided Interventions and Medical Robotics,
Georgetown University
Vision
endoscope
webcam
C-arm X-ray
The Sensor
x
y Im
ag
e p la
pri ne
nc
iple
po
int
is l
ax tica
y
op
x z
focal point
Machine Vision
Goal:
Obtain useful information about the 3D world from 2D
images.
Model:
Regions 3D Geometry
Textures Object identification
Corners Activity detection
Lines …
images … actions
Machine Vision
Goal:
Obtain useful information about the 3D world from 2D
images.
•Low level (image processing)
• image filtering (smoothing, histogram modification…),
• feature extraction (corner detection, edge detection,…)
• stereo vision
• shape from X (shading, motion,…)
•…
• High level (machine learning/pattern recognition)
• object detection
• object recognition
• clustering
•…
Machine Vision
2. Visual Servoing.
Robot Vision
NASA stereo vision image processing, as used by the MER Mars rovers
Robot Vision
y x
z
x y y z
y y
z x x
x x
z z
z z
System Characteristics
world
Camera 1
Camera 2
2
T
1
Commercial Stereo Vision
Camera 1 Camera 2
Camera 2 Camera 2
Camera Model
• Points P, p, and O, given in the camera coordinate system, are collinear.
P = p
X Y
= f/Z , therefore x f y f
Z Z
P=[X,Y,Z]
p=[x,y,f]
y X
x u f 0 0 0
v 0 Y
z f 0 0
O
f
Z
w 0 0 1 0
1
Camera Model
Transform the pixel coordinates from the camera coordinate system to the image
coordinate system:
• Image origin (principle point) is at [x0,y0] relative to the camera coordinate
system.
• Need to change from metric units to pixels, scaling factors kx, ky.
X
y u '
x fk 0 x 0 0
[x’,y’] v' 0 fk y 0 Y
y 0 Z
x w' 0 0 1 0
principle point
1
• As our original assumption was that points are given in the camera coordinate
system, a complete projection matrix is of the form:
R RC
M 34 K 33[I 33 | 031 ] KR[I | C]
0 1 fk x s x0
K 0 fk y y0
0 0 1
C – camera origin in the
world coordinate system.
• How many degrees of freedom does M have?
•As the points are in homogenous coordinates the vectors p and MP are not
necessarily equal, they have the same direction but may differ by a non-zero scale
factor.
p MP 0
Camera Calibration
0T wi PiT yi PiT M1
wi Pi
T
0T xi PiT M 2 0
yi PiT xi PiT 0 T M 3
Am 0
xi y
• The three equations are linearly dependent: A1 i A 2 A 3
wi wi
• Each point pair contributes two equations.
• These two points define our ray in the world coordinate system.
r1 (t1 ) a1 t1n1
r2 (t 2 ) a 2 t 2n 2 a1 n1
n2
a2
T
((a 2 a1 ) n 2 ) (n1 n 2 ) ((a 2 a1 ) n1 ) T (n1 n 2 )
t1 t2 2
n1 n 2
2 n1 n 2
[r1 (t1 ) r2 (t 2 )]
2
World vs. Model
• Actual cameras most often don’t follow the ideal pin-hole model, usually exhibit
some form of distortion (barrel, pin-cushion, S).
• Machine Vision:
– “Multiple View Geometry in Computer Vision”, Hartley and Zisserman,
Cambridge University Press.
– "Machine Vision", Jain, Kasturi, Schunck, McGraw-Hill.
• Robot Vision:
– “Simultaneous Localization and Mapping: Part I”, H. Durant-Whyte, T. Bailey,
IEEE Robotics and Automation Magazine, Vol. 13(2), pp. 99-110, 2006.
– “Simultaneous Localization and Mapping (SLAM) : Part II”,T. Bailey, H. Durant-
Whyte, IEEE Robotics and Automation Magazine, Vol. 13(3), pp. 108-117, 2006.
– “Visual Servo Control Part I: Basic Approaches”, IEEE Robotics and Automation
Magazine, Vol. 13(4), 82-90, 2006.
– Visual Servo Control Part II: Advanced Approaches”, IEEE Robotics and
Automation Magazine, Vol. 14(1), 109-118, 2007.