Material Detail

Generalization to Unseen Cases: (No) Free Lunches and Good-Turing estimation

This video was recorded at Workshop on Modelling in Classification and Statistical Learning, Eindhoven 2004. The present workshop addresses the problem of predicting a - binary - label Y from given the feature X. A procedure for classification is to be learned from a training set (X1, Y1) , ... , (Xn , Yn ). In the statistical literature on classification, the training set is traditionally seen as an i.i.d. sample from the distribution P of (X,Y), but one otherwise does not assume any a priori knowledge on P. Theoretical results have been derived that hold no matter what P is, which typically means that such results concentrate on worst cases. There are various reasons to step aside from this so-called black box approach. For example, the by now generally accepted rule ``regression is harder that classification" has led to a bad name for certain "plug in" methods, although under distributional assumptions the latter are at least competitive with ``direct" methods. Moreover, theoretical results for a case where P is assumed to be within a small class, can give benchmarks on what one may hope for. Also, procedures which adapt to properties of P need further exploration. These procedures are designed to work well in case one is "lucky", and are as such also inspired by having certain distributional assumptions in the back of ones mind. It moreover is often quite reasonable to assume some knowledge of the marginal distribution of X.

Keywords:: videolectures, ocwc, oec

Disciplines:

Science and Technology / Computer Science / Programming & Programming Languages

Go to Material

Bookmark / Add to Course ePortfolio

Create a Learning Exercise

Add Accessibility Information

Rate

Add a Comment

Quality

User Rating
Comments
Learning Exercises
Bookmark Collections
Course ePortfolios
Accessibility Info

Report Broken Link
Report as Inappropriate

More about this material

Material Type:: Presentation
Date Added to MERLOT:: February 10, 2015
Date Modified in MERLOT:: February 10, 2015
Author:: Teemu Roos, Helsinki Institute for Information Technology
Submitter:: The Open Education Consortium
Primary Audience:: College General Ed, College Lower Division, College Upper Division
Technical Format:: Video

Mobile Compatibility:: Not specified at this time
Language:: English
Cost Involved:: No
Source Code Available:: No
Creative Commons:: This work is licensed under a Attribution-NonCommercial-NoDerivs 3.0 United States