Here I am again with a nebulous question.
Does anyone know of an established method for automatically extracting citations to data sets from published articles (in R or elsewhere)?
e.g. Water Survey of Canada requests that users cite their data with
“Extracted from the Environment and Climate Change Canada Real-time Hydrometric Data web site (https://wateroffice.ec.gc.ca/mainmenu/real_time_data_index_e.html) on [DATE]”
Obviously I could do this by searching a subset of the above citation (in Google Scholar or wherever), downloading the full text and then parsing to see if that citation phrase occurs in the text/biblio but that seems pretty excessive…