Generate pdf from xml

andreifoldes · December 11, 2017, 3:17pm

Dear forum,

I’d like to text-mine Science Direct papers with an R package called statcheck, but I am only able to reliable get xml through the API.

Is there any way to generate PDF file from the XML? Statcheck only works on HTML and PDF formats.

Thank you,
Andrei

sckott · December 11, 2017, 3:37pm

Curious, how are you using the API?

will get back to you on the xml to pdf thing

andreifoldes · December 11, 2017, 5:32pm

Thank you for the prompt reply.

With respect to the unsuccessfull pdf download thing, I should add, I’m having the issue that I only get the first page using the URL that they provide, so I didn’t try via the “fulltext” package.

I am having some trouble understanding how the pdf download works in “fulltext” - can you please explain how I supposed to get the pdf exactly? I ran some of the dummy examples, like:

res ← ft_get(x=‘10.1101/012476’)
res$biorxiv

but I don’t understand what I am to do with:

res$biorxiv$data$path
[[1]]
[1] “~/.fulltext/10.1101_012476.pdf”

thank you

sckott · December 11, 2017, 6:57pm

“~/.fulltext/10.1101_012476.pdf” is a path to the pdf file. more later -i’m in a conference rightnow

Topic		Replies	Views
Transform article XML into ft_data Package Use Questions r , package	1	682	June 15, 2018
pdftools for parsing .pdf from a URL - public data mining UseCases package , pdftools	0	1622	February 15, 2020
Feedback on text mining in rcrossref package Package Use Questions	0	1318	January 16, 2015
pdftools + map to download & read multiple pdfs UseCases pdftools , purrr	0	1672	July 15, 2021
New package: fulltext Package Use Questions literature , openaccess	1	1968	August 7, 2015

Generate pdf from xml

Related topics