Machine Learning Foundations

来源:互联网 发布:马其顿 知乎 编辑:程序博客网 时间:2024/06/05 07:14

1.  when can machines learn?

1.1 The Learning Problem

1.1.1 what is machine learning


learning: acquiring skill

  with experience accumulated from observations

  observations --> learning --> skill

machine learning: acquiring skill <-> improving some performance measure

  with experience accumulated/computed from data

  data --> machine learning --> skill (improved performance measure)


ML: an alternative route to build complicated systems

Some Use Scenarios
• when human cannot program the system manually
  —navigating on Mars
• when human cannot ‘define the solution’ easily
  —speech/visual recognition
• when needing rapid decisions that humans cannot do
  —high-frequency trading
• when needing to be user-oriented in a massive scale
  —consumer-targeted marketing


Key Essence of Machine Learning

1 exists some ‘underlying pattern’ to be learned
  —so ‘performance measure’ can be improved
2 but no programmable (easy) definition
  —so ‘ML’ is needed
3 somehow there is data about the pattern
  —so ML has some ‘inputs’ to learn from


1.1.2 Components of Machine Learning
A takes D and H to get g


1.1.3 Machine Learning and Other Fields


difficult to distinguish ML and DM in reality

Machine Learning: use data to compute hypothesis g that approximates target f
Data Mining: use (huge) data to find property that is interesting

• if ‘interesting property’ same as ‘hypothesis that approximate target’
  —ML = DM (usually what KDDCup does)
• if ‘interesting property’ related to ‘hypothesis that approximate target’
  —DM can help ML, and vice versa (often, but not always)
• traditional DM also focuses on efficient computation in large database


ML is one possible route to realize AI

Machine Learning: use data to compute hypothesis g that approximates target f
Artificial Intelligence: compute something that shows intelligent behavior

• g ≈ f is something that shows intelligent behavior
  —ML can realize AI, among other routes
• e.g. chess playing
  • traditional AI: game tree
  • ML for AI: ‘learning from board data’


statistics: many useful tools for ML

Machine Learning: use data to compute hypothesis g that approximates target f
Statistics: use data to make inference about an unknown process
• g is an inference outcome; f is something unknown
  —statistics can be used to achieve ML
• traditional statistics also focus on provable results with math assumptions, and care less about computation