pubchunks: extract parts of scholarly XML articles

text-mining
fulltext
pubchunks
r
Tags: #<Tag:0x00007fbc5683c100> #<Tag:0x00007fbc56843ef0> #<Tag:0x00007fbc56843d88> #<Tag:0x00007fbc56843c48>

#1

Author: Scott Chamberlain

pubchunks is a package grown out of the fulltext package. fulltext
provides a single interface to many sources of full text scholarly articles. As
part of the user flow in fulltext there is an extraction step where fulltext::chunks()
pulls parts of articles out of XML format article files.

As part of making fulltext more maintainable and focused on simply fetching articles,
and realizing that pulling out bits of structured XML files is a more general problem,
we broke out pubchunks into a separate package.

Read the full post here: https://ropensci.org/technotes/2018/10/16/pubchunks/