【CS231n】-学习笔记-1-Intro to Computer Vision, historical context.
来源:互联网 发布:ddos网络层攻击 编辑:程序博客网 时间:2024/06/15 11:30
Class: http://cs231n.stanford.edu
Schedule: http://cs231n.stanford.edu/syllabus.html
Slides: http://vision.stanford.edu/teaching/cs231n/slides/winter1516_lecture1.pdf
Video: https://www.youtube.com/watch?v=NfnWJUyUJYU&feature=youtu.be
Explosion of Data
Sensors enable the explosion
Visual Data is hard to grasp the contents
Help to search the content of data needs visual technology
Problems facing today: massive amount of data and the challenges of the dark matter
To know the problems help you go on
Neuroscience
神经科学
Cognitive sciences
认知科学
optics
光学
Image processing , Speech, NLP,
Big Bang of Evolution:543million years, B.C. :
Camera Obscura
相机 暗盒
the beginning of visual processing:simple structure of the world
oriented edges
experiments: awake but anaesthetized cats
little needle electrode to push electrons through to the skull
primary visual cortex: do a log of visual processing
early: tons and tons of new orleans
1st stage: back of the brain, the furthest of the eyes, not ear the eyes
the edges define the shape:
Birthday of CV: 1966, MIT Standford, AI lab,
the beginning of deep learning: David Marr, 1970s Stages of Visual Representation
Goal is to reconstruct 3D model: so we can recognize objects
the first wave of visual recognition algorithms went after the 3D model:
the world is composed of simple shapes like blocks
David Lowe, 1987
Normalized Cut (Shi & Malik, 1997)
Face Detection, Viola & Jones, 2001
the first successful high-level visual recognition algorithms being used by consumer product
the first digital camera that has a face detector Fujifilm 2006
deep learning algorithms try to learn simple features
focus on features: “SIFT” & Object Recognition, David Lowe, 1999
since hard to describe the whole thing
ML tools like SVM to recognize scene: Spatial Pyramid Matching, Lazebnik, Schmid & Ponce, 2006
Deformable Part Model:Felzenswalb, McAllester, Ramanan, 2009
PASCAL Visual Object Challenge (20 object categories), [Everingham et al. 2006-2012]
www.image-net.org 22K categories and 14M images,
Deng, Dong, Socher, Li, Li, & Fei-Fei, 2009
The Image Classification Challenge: 1,000 object classes 1,431,167 images
the beginning of deep learning evolution
cool problems:
labeling of the entire scene with perceptual grouping
combining recognition with 3D
CS231n focuses on one of the most important problems of visual recognition – image classification
There is a number of visual recognition problems that are related to image classification, such as object detection, image captioning
Convolutional Neural Network (CNN) has become an important tool for object recognition
Convolutional Neural Network (CNN) is not invented overnight
Pre-requisite
• Proficiency in Python, some high-level familiarity with C/C++
– All class assignments will be in Python (and use numpy), but some of the deep learning libraries we may look at later in the class are written in C++.
– A Python tutorial available on course website
• CollegeCalculus,LinearAlgebra
• Equivalent knowledge of CS229 (Machine Learning)
– We will be formulating cost functions, taking derivatives and performing optimization with gradient descent.
- 【CS231n】-学习笔记-1-Intro to Computer Vision, historical context.
- Computer Vision(CS131,CS231n)学习笔记(1)
- Computer Vision 学习笔记1 - Fundamentals of image formation
- Computer Vision ---- Introduction to Computer Vision
- CS231N-11-Other Computer Vision Tasks
- introduction to computer vision
- computer vision笔记
- 【Matlab Computer Vision System ToolBox】学习笔记-1-点云配准流程 | 特征匹配
- Computer Vision: Algorithms and Applications(学习笔记一)--introduction
- Programming Computer Vision with Python (学习笔记一)
- Programming Computer Vision with Python (学习笔记二)
- Programming Computer Vision with Python (学习笔记三)
- Programming Computer Vision with Python (学习笔记四)
- Programming Computer Vision with Python (学习笔记五)
- Programming Computer Vision with Python (学习笔记六)
- Programming Computer Vision with Python (学习笔记七)
- Programming Computer Vision with Python (学习笔记八)
- Programming Computer Vision with Python (学习笔记九)
- 自定义圆形头像
- 【总结】消息服务中间件(ActvieMQ)
- 苹果开发那些事儿-D-U-N-S 号申请
- iOS怎么实现不进appstore的增量更新?(类似各种游戏,12306)
- 【总结】MySQL数据库
- 【CS231n】-学习笔记-1-Intro to Computer Vision, historical context.
- Codeforces Beta Round #91 (Div. 2 Only)深度优先
- 网站开发进阶(二十七)导航栏高亮显示
- 【总结】MySQL性能优化
- [leetcode] 319. Bulb Switcher 解题报告
- 0814-应用程序管理(笔记)
- caffe框架翻译-理解(转载)
- 自学ios——基础篇
- 禁止屏幕旋转