
- This event has passed.
DSI Workshop: Topic Modeling with LDA
June 1, 2018 @ 9:30 am - 12:00 pm
Topic Modeling with Latent Dirichlet Allocation (LDA)
This DSI workshop, led by Associate Director Dr. Carl Stahmer, takes an in depth look at hyper parameters – the math behind the algorithms and the effects of the tuning parameters.
Prerequisites: beginner R skills and a working R environment with the following packages installed: TM, topicmodels, ggplot2. If you have a corpus you want to work on, bring it in ascii form. If not, a practice corpus will be available for you to use.
Resources
“You shall know a word by the company it keeps.” Firth, J. R. (1957:11)
Recording
Github Repo
Blei et al. 2003 Journal of Machine Learning Research “Latent Dirichlet Allocation”