On 21-22 April, the London School of Economics hosted the Text Analysis Package Developers' Workshop, a two-day event held in London that brought together developers of R packages for working with text and text-related data. This included a wide range of applications, including string handling (stringi
) and tokenization (the rOpenSci-onboarded tokenizers
, KoNLP
), corpus and text processing (readtext
, tm
, quanteda
, and qdap
), natural language processing (NLP) such as part of speech and dependency tagging (cleanNLP
, spacyr
), and the statistical analysis of textual data (stm
, text2vec
, and koRpus
) -- although this list is hardly complete. The main objective was to bring together experts working on various aspects of text processing and text analysis using R, to discuss common challenges and identify collaborative solutions.
This is a companion discussion topic for the original entry at https://ropensci.org/blog/blog/2017/05/03/textworkshop17