Jake's CS Notes - Computer Vision

//****************************************************************************//
//*********** Midterm Review & Fear-Mongering - October 7th, 2019 ***********//
//**************************************************************************//

- Alright, here's where my knowledge debt hopefully gets called in (i.e. at the practice test, i.e. BEFORE the actual exam)

- On Wednesday, the actual exam will be on paper; it should be 8 questions (some of which may have multiple parts)
- In the meantime, take the practice midterm on Canvas! You're free to use reference materials (since, y'know, it doesn't count for a grade), but I think that'd rob you of a good reflection of your actual knowledge

--------------------------------------------------------------------------------

- "Alright, the TAs told me this was a medium quiz - did you think it was hard?"
- Collective class: YES.
- The questions should all be from the slides; some're based on recall, others are based off of reasoning and thinking through stuff

- Okay, so here're the questions from the practice midterm:
1. How do you compute a line joining 2 points "p" and "q" in homogenous coordinates?
- ANSWER: The cross product! p X q
- I got this right!
2. Give the 3x3 matrix form of a 2D similarity transform using the symbols s, R, and t (and 0/1 as constants)
- ANSWER:

[s*R t] OR [R t]
[0 1] [0 1/s]

- I got this right!
- Remember, non-uniform scaling is an AFFINE transformation!
3. How many degrees of freedom does this 2D similarity transform have?Explain why.
- ANSWER: 4 degrees - 1 from scaling, 1 from 2D rotation, and 2 from translation
4. The phong shading model augments the Lambertian model with...
- ANSWER: This adds an ambient term AND a specular term!
- I forgot the ambient term :(
5. What's the benefit of max-pooling in convolutional networks? How does it affect the receptive field of neurons in successive layers?
- ANSWER: The pooling itself helps by making the analysis more robust and saying "hey, as long as I saw a feature within this 5x5 window, we're good," rather than depending on pixel-exact locations for each feature. This grows the receptive field as we go down the pipeline (approximately doubling it) and leading to looking at higher-level features.
- In general, layers get smaller as we go through the pipeline, but the receptive field gets bigger
- I...kinda got this right (got the )
6. A bilateral filter is a nonlinear filter; explain what it does and why it's useful.
- ANSWER: This filter figures out where edges are and gets rid of noises only on one side of the edge, making it useful for not destroying edges.
- I got this wrong :(
7. Give another nonlinear filter and explain why it's useful.
- ANSWER: Median filter, etc.
8. Given the 2nd moment matrix formula in the Harris detector, explain what it means and how it's used in the Harris corner detection
- ANSWER: Here, "M" - the 2nd moment matrix - is the 2nd derivative of how much intensity values have changed around our pixel, letting us get a quadratic view of how these values change to figure out how "corner-like" they are
- x/y are coordinates of the pixel in our windows (I think?), w is our windowing function giving weights, and then I_x/I_y are the image gradients in the x/y directions
9. What determines the dimensions of the Jacobian J associated with a 2D point match in least-squares fitting? What do the columns mean?
- "Okay, here I probably should've given you the least-squares algorithm formula to help you"
- ANSWER: so, the idea here is we're mapping a point "p" in one image to a point "p'" in another image; thre size of this point determines the size of the Jacobian. If |p| = 6, then, "J" would be 2x6 (I believe because it's 2D, so x/y coordinates?); the columns of the Jacobian would be the partial derivative of the output point with respect to the derivative (missed this, but I think how much the output image should change for that coordinate in p?). for the Hessian, then, the Jacobian would be 2x6, while the Hessian "A" would be 6x6 (I think since it's the derivative of the Jacobian?).
- ...I was not particularly close (I need to go through the slides again)
- "View P as a set of 6 knobs, where if you move one of those knobs, everything in the image moves; J tells us how sensitive to be to each of those knobs."
10. Which of the following statements is true about estimating a Euclidean transform between two 3D coordinate frames (i.e. 3D point -> 3D points) (basically, degrees of freedom, linear/non-linear, )?
- ANSWER: 6 degrees of freedom, non-linear least squares (because rotation involves sines/cosines), 3 matches needed (6 degrees of freedom, each point gives 3 degrees - HOWEVER, could spin it on the axis between the 2 points)
- I got the correct DOF, forgot the sine/cosine non-linearity, screwed up the matches needed
11. Draw the epipolar lines/points for this diagram
- I can't draw here, but I did actually get it right! Things to watch out for:
- e/L line are on the left image, e'/l' are on the right image

- *Professor Delleart tries to keep us for learning, realizes it's 10 minutes past the end of class, frantically apologizes*
- Okay, be here for the exam on Wednesday! Study hard!