The document summarizes a workshop on computer vision held at MIT in August 2011. It discusses fundamental techniques in computer vision like image processing, feature extraction, structure from motion, and statistical learning methods. It highlights impressive computer vision applications and challenges, including 3D modeling from images/videos, large-scale object detection and classification, and event modeling from video. It also notes hurdles like the need for more focus on applications and large datasets representative of the real world.