2007年1月30日 星期二

Summary of Reading: Introduction to Machine Learning Chapter 1

這本書第一章是Machine Learning的基本介紹

1. 這是對machine learning的定義,讓我可以從更廣義的角度來看何謂ML
Machine Learning is programming computers to optimize a performance criterion using example data or past experience. We have a model defined up to some parameters and learning is the execution of a computer program to optimize the parameters of the model using the training data or past experience. The model may be predictive to make predictions in the future, or descriptive to gain knowledge from data, or both

ML使用統計的理論來建立數學模型,並可以應用於不同領域問題,包含Data Mining、AI等

2. 一直以來我們幾乎都講求predictive accuracy,但我覺得這是應用他人的演算法才如此
如果當你自行設計新的演算法時,還要考慮space and time complexity

3. ML主要可分為下面五類
  • Association
  • Classification (supervised learning)
  • Regression (supervised learning)
  • Unsupervised Learning
  • Reinforcement Learning

4. ML的應用如下…

  • Association Rule:Basket Analysis
  • Pattern Recognition:optical character recognition、face recognition、medical diagnosis、speech recognition
  • Knowledge Extraction: Learning a rule from data
  • Outlier detection
  • Robotics
  • Image Compression
  • …太多了

5. Supervised Learning:the task is to learn the mapping from the input to the output
Unsupervised Learning:find the regularities in the input. There is a structure to the input space such that certain patterns occur more often than others, and we want to see what generally happens and what does not. In statistics, this is called density estimation. One method for density estimation is clustering

6. Reinforcement Learning: find policy that is the sequence of correct actions to reach the goal
one factor that makes reinforcement learning harder is when the system has unreliable and partial sensory information. we can not decide in a partially observable state → using multiple agents that interact and cooperate to accomplish a common goal.For example, robots playing soccer

7. In statistics, going from particular observations to general descriptions is called inference and learning is called estimation. Classification is called discriminant analysis in statistics. In engineering, classification is called pattern recognition and the approach is nonparametric and much more empirical
(終於解答我之前一直confuse的地方- 「pattern recognition」的意義)

8. Relevant Resource

Journal

  • Machine Learning
  • Journal of Machine Learning Research
  • Neural Computation
  • Neural Networks
  • IEEE Transactions on Neural Networks
  • IEEE Transactions on Pattern Analysis
  • Machine Intelligence
  • Artificial Intelligence
  • Pattern Recognition
  • Fuzzy Logic
  • Data Mining and Knowledge Discovery
  • IEEE Transactions on Knowledge and Data Engineering
  • ACM SIGKDD

Conference

  • Neural Information Processing Systems (NIPS)
  • Uncertainty in Artificial Intelligence (UAI)
  • International Conference on Machine Learning (ICML)
  • European Conference on Machine Learning (ECML)
  • Computational Learning Theory (COLT)
  • Internatioal Joint Conference on Artificial Intelligence (IJCAI)
  • ......................

Dataset (DataBase)

-------------------------------------------------
下一章 Supervised Learning

沒有留言: