Flyman的知識庫: Summary of Reading: Introduction to Machine Learning Chapter 1

這本書第一章是Machine Learning的基本介紹

1. 這是對machine learning的定義，讓我可以從更廣義的角度來看何謂ML
Machine Learning is programming computers to optimize a performance criterion using example data or past experience. We have a model defined up to some parameters and learning is the execution of a computer program to optimize the parameters of the model using the training data or past experience. The model may be predictive to make predictions in the future, or descriptive to gain knowledge from data, or both

ML使用統計的理論來建立數學模型，並可以應用於不同領域問題，包含Data Mining、AI等

2. 一直以來我們幾乎都講求predictive accuracy，但我覺得這是應用他人的演算法才如此
如果當你自行設計新的演算法時，還要考慮space and time complexity

3. ML主要可分為下面五類

Association
Classification (supervised learning)
Regression (supervised learning)
Unsupervised Learning
Reinforcement Learning

4. ML的應用如下…

Association Rule：Basket Analysis
Pattern Recognition：optical character recognition、face recognition、medical diagnosis、speech recognition
Knowledge Extraction: Learning a rule from data
Outlier detection
Robotics
Image Compression
…太多了

5. Supervised Learning：the task is to learn the mapping from the input to the output
Unsupervised Learning：find the regularities in the input. There is a structure to the input space such that certain patterns occur more often than others, and we want to see what generally happens and what does not. In statistics, this is called density estimation. One method for density estimation is clustering

6. Reinforcement Learning： find policy that is the sequence of correct actions to reach the goal
one factor that makes reinforcement learning harder is when the system has unreliable and partial sensory information. we can not decide in a partially observable state → using multiple agents that interact and cooperate to accomplish a common goal.For example, robots playing soccer

7. In statistics, going from particular observations to general descriptions is called inference and learning is called estimation. Classification is called discriminant analysis in statistics. In engineering, classification is called pattern recognition and the approach is nonparametric and much more empirical
(終於解答我之前一直confuse的地方－「pattern recognition」的意義)

8. Relevant Resource

Journal

Machine Learning
Journal of Machine Learning Research
Neural Computation
Neural Networks
IEEE Transactions on Neural Networks
IEEE Transactions on Pattern Analysis
Machine Intelligence
Artificial Intelligence
Pattern Recognition
Fuzzy Logic
Data Mining and Knowledge Discovery
IEEE Transactions on Knowledge and Data Engineering
ACM SIGKDD

Conference

Neural Information Processing Systems (NIPS)
Uncertainty in Artificial Intelligence (UAI)
International Conference on Machine Learning (ICML)
European Conference on Machine Learning (ECML)
Computational Learning Theory (COLT)
Internatioal Joint Conference on Artificial Intelligence (IJCAI)
......................

Dataset (DataBase)

UCI Repository : http://www.ics.uci.edu/~mlearn/MLRepository.html (超多dataset)
UCI KDD Archive：http://kdd.ics.uci.edu/summary.data.application.html (這裡也有一些，而且已分類好了，其實跟上面那一個為同個網站)
Delve：http://www.cs.toronto.edu/~delve/ (有一些dataset及software可以使用)

-------------------------------------------------
下一章 Supervised Learning

Flyman的知識庫

2007年1月30日星期二

Summary of Reading: Introduction to Machine Learning Chapter 1

沒有留言:

2007年1月30日 星期二

Summary of Reading: Introduction to Machine Learning Chapter 1

沒有留言:

2007年1月30日星期二