My vignette is about text mining and analysis, utilising the tm and topicmodels packages in R and Latent Dirichlet Allocation, to work out what the documents are written about without having to read them all!
The vignette shows you how to create a Document-Term Matrix, then uses LDA to work out what key themes are present in a body of documents (called a corpus) and assigns each document to the topics, with varying probabilities for each topic.
This tool can help a user find a relevant document without having to search for it by name, or even knowing what it was written about!
Anyway, here is the link to my vignette:
I hope you find it useful.