October 25, 2013 – Robert Stine, Professor of Statistics at the Wharton School of the University of Pennsylvania, presented his seminar, “Featurizing Text: Converting Text into Predictors for Regression Analysis.” Professor Stine has appeared in numerous journals, including the Journal of the American Statistical Association, Journal of the Royal Statistical Society, and the Annals of Statistics.
In his presentation, Professor Stine introduces three ways of converting text into numerical values. He guides the audience through the methods of counting words, principal components analysis of word counts, and the forming of eigenwords from sequences of words. These new predictors can be then used to build regression models.
Be sure to watch the seminar if you were unable to attend the seminar. You can find Professor Stine’s seminar here: http://videostreaming.gc.cuny.edu/videos/video/1079/
Want to learn more about Professor Stine’s work, read his draft manuscripts, or look through his presentation slides? You can find them on his website here: www-stat.wharton.upenn.edu/~stine