logging in or signing up CS223B L9 StructureFromMotion craig Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINTLite Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 329 Category: News & Reports.. License: All Rights Reserved Like it (0) Dislike it (0) Added: October 03, 2007 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript Stanford CS223B Computer Vision, Winter 2007Lecture 8 Structure From Motion: Stanford CS223B Computer Vision, Winter 2007 Lecture 8 Structure From Motion Professors Sebastian Thrun and Jana Košecká CAs: Vaibhav Vaish and David Stavens Slide credit: Gary Bradski, Stanford SAIL Summary SFM: Summary SFM Problem Determine feature locations (=structure) Determine camera extrinsic (=motion) Two Principal Solutions Bundle adjustment (nonlinear least squares, local minima) SVD (through orthographic approximation, affine geometry) Correspondence (RANSAC) Expectation Maximization Structure From Motion: Structure From Motion Recover: structure (feature locations), motion (camera extrinsics)SFM = Holy Grail of 3D Reconstruction: SFM = Holy Grail of 3D Reconstruction Take movie of object Reconstruct 3D model Would be commercially highly viable live.comStructure From Motion (1): Structure From Motion (1) [Tomasi & Kanade 92]Structure From Motion (2): Structure From Motion (2) [Tomasi & Kanade 92]Structure From Motion (3): Structure From Motion (3) [Tomasi & Kanade 92]Structure From Motion (4a): Images: Structure From Motion (4a): Images Marc PollefeysStructure From Motion (4b): Structure From Motion (4b) Marc PollefeysStructure From Motion (5): Structure From Motion (5) http://www.cs.unc.edu/Research/urbanscapeStructure From Motion: Structure From Motion Problem 1: Given n points pij =(xij, yij) in m images Reconstruct structure: 3-D locations Pj =(xj, yj, zj) Reconstruct camera positions (extrinsics) Mi=(Aj, bj) Problem 2: Establish correspondence: c(pij)Structure From Motion: Structure From Motion Recover: structure (feature locations), motion (camera extrinsics)Recovery Problems: Recovery ProblemsSFM: General Formulation: SFM: General FormulationSFM: Bundle Adjustment: SFM: Bundle AdjustmentBundle Adjustment: Bundle Adjustment SFM = Nonlinear Least Squares problem Minimize through Gradient Descent Conjugate Gradient Gauss-Newton Levenberg Marquardt common method Prone to local minimaCount # Constraints vs #Unknowns: Count # Constraints vs #Unknowns m camera poses n points 2mn point constraints 6m+3n unknowns Suggests: need 2mn 6m + 3n But: Can we really recover all parameters??? How Many Parameters Can’t We Recover?: How Many Parameters Can’t We Recover? Place Your Bet! We can recover all but… m = #camera poses n = # feature pointsCount # Constraints vs #Unknowns: Count # Constraints vs #Unknowns m camera poses n points 2mn point constraints 6m+3n unknowns Suggests: need 2mn 6m + 3n But: Can we really recover all parameters??? Can’t recover origin, orientation (6 params) Can’t recover scale (1 param) Thus, we need 2mn 6m + 3n - 7Are we done?: Are we done? No, bundle adjustment has many local minima.The “Trick Of The Day”: The “Trick Of The Day” Replace Perspective by Orthographic Geometry Replace Euclidean Geometry by Affine Geometry Solve SFM linearly via PCA (“closed” form, globally optimal) Post-Process to make solution Euclidean Post-Process to make solution perspective By Tomasi and Kanade, 1992 Orthographic Camera Model: Orthographic Camera Model Orthographic = Limit of Pinhole Model: Orthographic Projection: Orthographic Projection Limit of Pinhole Model: Orthographic ProjectionThe Orthographic SFM Problem: The Orthographic SFM Problem subject toThe Affine SFM Problem: The Affine SFM Problem subject toCount # Constraints vs #Unknowns: Count # Constraints vs #Unknowns m camera poses n points 2mn point constraints 8m+3n unknowns Suggests: need 2mn 8m + 3n But: Can we really recover all parameters???How Many Parameters Can’t We Recover?: How Many Parameters Can’t We Recover? Place Your Bet! We can recover all but…The Answer is (at least): 12: The Answer is (at least): 12Points for Solving Affine SFM Problem: Points for Solving Affine SFM Problem m camera poses n points Need to have: 2mn 8m + 3n-12Affine SFM: Affine SFMThe Rank Theorem: The Rank Theorem n elements 2m elementsSingular Value Decomposition: Singular Value DecompositionAffine Solution to Orthographic SFM: Affine Solution to Orthographic SFM Gives also the optimal affine reconstruction under noiseBack To Orthographic Projection: Back To Orthographic Projection Find C for which constraints are met Search in 9-dim space (instead of 8m + 3n-12)Back To Projective Geometry: Back To Projective Geometry Orthographic (in the limit) ProjectiveBack To Projective Geometry: Back To Projective Geometry Optimize Using orthographic solution as starting pointThe “Trick Of The Day”: The “Trick Of The Day” Replace Perspective by Orthographic Geometry Replace Euclidean Geometry by Affine Geometry Solve SFM linearly via PCA (“closed” form, globally optimal) Post-Process to make solution Euclidean Post-Process to make solution perspective By Tomasi and Kanade, 1992 Structure From Motion: Structure From Motion Problem 1: Given n points pij =(xij, yij) in m images Reconstruct structure: 3-D locations Pj =(xj, yj, zj) Reconstruct camera positions (extrinsics) Mi=(Aj, bj) Problem 2: Establish correspondence: c(pij)The Correspondence Problem: The Correspondence Problem View 1 View 3 View 2Correspondence: Solution 1: Correspondence: Solution 1 Track features (e.g., optical flow) …but fails when images taken from widely different posesCorrespondence: Solution 2: Correspondence: Solution 2 Start with random solution A, b, P Compute soft correspondence: p(c|A,b,P) Plug soft correspondence into SFM Reiterate See Dellaert/Seitz/Thorpe/Thrun, Machine Learning Journal, 2003Example: ExampleResults: Cube: Results: Cube Animation: AnimationTomasi’s Benchmark Problem: Tomasi’s Benchmark ProblemReconstruction with EM: Reconstruction with EM3-D Structure: 3-D StructureCorrespondence: Alternative Approach: Correspondence: Alternative Approach Ransac [Fisher/Bolles] = Random sampling and consensus Will be discussed Wednesday Summary SFM: Summary SFM Problem Determine feature locations (=structure) Determine camera extrinsic (=motion) Two Principal Solutions Bundle adjustment (nonlinear least squares, local minima) SVD (through orthographic approximation, affine geometry) Correspondence (RANSAC) Expectation Maximization You do not have the permission to view this presentation. In order to view it, please contact the author of the presentation.
CS223B L9 StructureFromMotion craig Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINTLite Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 329 Category: News & Reports.. License: All Rights Reserved Like it (0) Dislike it (0) Added: October 03, 2007 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript Stanford CS223B Computer Vision, Winter 2007Lecture 8 Structure From Motion: Stanford CS223B Computer Vision, Winter 2007 Lecture 8 Structure From Motion Professors Sebastian Thrun and Jana Košecká CAs: Vaibhav Vaish and David Stavens Slide credit: Gary Bradski, Stanford SAIL Summary SFM: Summary SFM Problem Determine feature locations (=structure) Determine camera extrinsic (=motion) Two Principal Solutions Bundle adjustment (nonlinear least squares, local minima) SVD (through orthographic approximation, affine geometry) Correspondence (RANSAC) Expectation Maximization Structure From Motion: Structure From Motion Recover: structure (feature locations), motion (camera extrinsics)SFM = Holy Grail of 3D Reconstruction: SFM = Holy Grail of 3D Reconstruction Take movie of object Reconstruct 3D model Would be commercially highly viable live.comStructure From Motion (1): Structure From Motion (1) [Tomasi & Kanade 92]Structure From Motion (2): Structure From Motion (2) [Tomasi & Kanade 92]Structure From Motion (3): Structure From Motion (3) [Tomasi & Kanade 92]Structure From Motion (4a): Images: Structure From Motion (4a): Images Marc PollefeysStructure From Motion (4b): Structure From Motion (4b) Marc PollefeysStructure From Motion (5): Structure From Motion (5) http://www.cs.unc.edu/Research/urbanscapeStructure From Motion: Structure From Motion Problem 1: Given n points pij =(xij, yij) in m images Reconstruct structure: 3-D locations Pj =(xj, yj, zj) Reconstruct camera positions (extrinsics) Mi=(Aj, bj) Problem 2: Establish correspondence: c(pij)Structure From Motion: Structure From Motion Recover: structure (feature locations), motion (camera extrinsics)Recovery Problems: Recovery ProblemsSFM: General Formulation: SFM: General FormulationSFM: Bundle Adjustment: SFM: Bundle AdjustmentBundle Adjustment: Bundle Adjustment SFM = Nonlinear Least Squares problem Minimize through Gradient Descent Conjugate Gradient Gauss-Newton Levenberg Marquardt common method Prone to local minimaCount # Constraints vs #Unknowns: Count # Constraints vs #Unknowns m camera poses n points 2mn point constraints 6m+3n unknowns Suggests: need 2mn 6m + 3n But: Can we really recover all parameters??? How Many Parameters Can’t We Recover?: How Many Parameters Can’t We Recover? Place Your Bet! We can recover all but… m = #camera poses n = # feature pointsCount # Constraints vs #Unknowns: Count # Constraints vs #Unknowns m camera poses n points 2mn point constraints 6m+3n unknowns Suggests: need 2mn 6m + 3n But: Can we really recover all parameters??? Can’t recover origin, orientation (6 params) Can’t recover scale (1 param) Thus, we need 2mn 6m + 3n - 7Are we done?: Are we done? No, bundle adjustment has many local minima.The “Trick Of The Day”: The “Trick Of The Day” Replace Perspective by Orthographic Geometry Replace Euclidean Geometry by Affine Geometry Solve SFM linearly via PCA (“closed” form, globally optimal) Post-Process to make solution Euclidean Post-Process to make solution perspective By Tomasi and Kanade, 1992 Orthographic Camera Model: Orthographic Camera Model Orthographic = Limit of Pinhole Model: Orthographic Projection: Orthographic Projection Limit of Pinhole Model: Orthographic ProjectionThe Orthographic SFM Problem: The Orthographic SFM Problem subject toThe Affine SFM Problem: The Affine SFM Problem subject toCount # Constraints vs #Unknowns: Count # Constraints vs #Unknowns m camera poses n points 2mn point constraints 8m+3n unknowns Suggests: need 2mn 8m + 3n But: Can we really recover all parameters???How Many Parameters Can’t We Recover?: How Many Parameters Can’t We Recover? Place Your Bet! We can recover all but…The Answer is (at least): 12: The Answer is (at least): 12Points for Solving Affine SFM Problem: Points for Solving Affine SFM Problem m camera poses n points Need to have: 2mn 8m + 3n-12Affine SFM: Affine SFMThe Rank Theorem: The Rank Theorem n elements 2m elementsSingular Value Decomposition: Singular Value DecompositionAffine Solution to Orthographic SFM: Affine Solution to Orthographic SFM Gives also the optimal affine reconstruction under noiseBack To Orthographic Projection: Back To Orthographic Projection Find C for which constraints are met Search in 9-dim space (instead of 8m + 3n-12)Back To Projective Geometry: Back To Projective Geometry Orthographic (in the limit) ProjectiveBack To Projective Geometry: Back To Projective Geometry Optimize Using orthographic solution as starting pointThe “Trick Of The Day”: The “Trick Of The Day” Replace Perspective by Orthographic Geometry Replace Euclidean Geometry by Affine Geometry Solve SFM linearly via PCA (“closed” form, globally optimal) Post-Process to make solution Euclidean Post-Process to make solution perspective By Tomasi and Kanade, 1992 Structure From Motion: Structure From Motion Problem 1: Given n points pij =(xij, yij) in m images Reconstruct structure: 3-D locations Pj =(xj, yj, zj) Reconstruct camera positions (extrinsics) Mi=(Aj, bj) Problem 2: Establish correspondence: c(pij)The Correspondence Problem: The Correspondence Problem View 1 View 3 View 2Correspondence: Solution 1: Correspondence: Solution 1 Track features (e.g., optical flow) …but fails when images taken from widely different posesCorrespondence: Solution 2: Correspondence: Solution 2 Start with random solution A, b, P Compute soft correspondence: p(c|A,b,P) Plug soft correspondence into SFM Reiterate See Dellaert/Seitz/Thorpe/Thrun, Machine Learning Journal, 2003Example: ExampleResults: Cube: Results: Cube Animation: AnimationTomasi’s Benchmark Problem: Tomasi’s Benchmark ProblemReconstruction with EM: Reconstruction with EM3-D Structure: 3-D StructureCorrespondence: Alternative Approach: Correspondence: Alternative Approach Ransac [Fisher/Bolles] = Random sampling and consensus Will be discussed Wednesday Summary SFM: Summary SFM Problem Determine feature locations (=structure) Determine camera extrinsic (=motion) Two Principal Solutions Bundle adjustment (nonlinear least squares, local minima) SVD (through orthographic approximation, affine geometry) Correspondence (RANSAC) Expectation Maximization