Tom M. Mitchell, "Does
Machine Learning Really Work? AI Magazine, Fall 1997 v18 n3 p11(10)
![]() |
![]() |
Fredkin Professor
of AI and Learning
Director, Center for Automated Learning and Discovery School of Computer Science Carnegie Mellon University |
"Although we do not yet know how to make computers learn nearly as well as people learn, in recent years many succesful machine-learning applications have been developed"
THE NICHE FOR MACHINE LEARNING
Data Mining
Difficult-to-Program Applications
Customized Software
Applications
Use historical databases to improve subsequent decision making
Examples:
Banks: which future loan applicants are likely to be credit worthy?
Hosptials: Which new patients are likely to respond best to new treatments?
Credit Card Companies: Which transactions are likely to be fraudlent?
Businesses: Which new customers are likely to buy new products?
Colleges: Which new students are likely to graduate?
For more on data mining, see http://www.kdnuggets.com/
Example: Predicting Medical Outcomes from Historical Data
Data: records on 9714 pregnant women
215 attributes over time: health history, measurements, type of delivery, final health of mother and baby.
General Problem: Given time-series data, we want to learn to predict features that occur late in the time series based on features known earlier.
Which future patients
are at exceptionally high risk of requiring an emergency caesarean section?
Patient
103 time =1
Age: 23 FirstPregnancy: no Anemia: No Diabetes: No PreviousPrematureBirth? No Ultrasound: ? Elective C-Section: ? Emergency C-Section: ? |
Patient
103 time =2
Age: 23 FirstPregnancy: no Anemia: No Diabetes: YES PreviousPrematureBirth? No Ultrasound: abnormal Elective C-Section: no Emergency C-Section: ? |
Patient
103 time =n
Age: 23 FirstPregnancy: no Anemia: No Diabetes: No PreviousPrematureBirth? No Ultrasound: ? Elective C-Section: no Emergency C-Section: Yes |
Learned Rule:
If No previous vaginal delivery and
Abnormal 2nd Trimester Ultrasound and
Malpresentation at admissions,
Then
Probability of Emergency C-Section is 0.6
Training set accuracy: 26/41 = .63
Test set accuracy: 12/20 = .60
THE NICHE FOR MACHINE LEARNING
Data Mining
Difficult-to-Program Applications
Customized Software
Applications
Some applications have proved too difficult for traditional manual programming:
Face Recognition
Speech Understanding
http://www.cs.cmu.edu/~tom/faces.html
THE NICHE FOR MACHINE LEARNING
Data Mining
Difficult-to-Program Applications
Customized
Software Applications
Example: News Weeder: Learns the reading interests of its users.
News Weeder develops a general model of a reader's interest from a collection of specific text documents the user has rated for level of interest.
It then automatically examines new articles to produce a "top 20" list for the user.
K. Lang. News Weeder: "Learning to Filter
Netnews". In Proc. of the 12th International Conference on Machine Learning
ICML95,
1995
True
Rating |
Predicted Rating | ||||||||||
1 | 2 | 3 | 4 | 5 | skip | Total | |||||
1 | 0 | 1 | 0 | 0 | 0 | 1 | 2 | ||||
2 | 1 | 15 | 6 | 4 | 0 | 15 | 41 | ||||
3 | 0 | 6 | 31 | 20 | 0 | 15 | 72 | ||||
4 | 0 | 6 | 8 | 42 | 0 | 20 | 76 | ||||
5 | 0 | 0 | 0 | 4 | 0 | 1 | 5 | ||||
skip | 0 | 8 | 4 | 5 | 1 | 141 | 159 |