Package covering two different APIs

frequena · May 25, 2020, 9:05am

Dear Ropensci community,

I am planning to develop an R-package and submit it to Ropensci.

This package gives access to 2 different APIs whose idea is very similar: the user provides a text and retrieves a dataframe of the identified entities and their respective categories.
Since one of the two APIs has many other functions, I plan to cover one of the many endpoints available by that API.

Although both APIs have the same purpose, the use of two is very convenient since they identify different types of categorical entities. Therefore, these two APIs
complement each other.

In reviewing the list of packages on Ropensci, they are all (I believe) based on a single API.

Therefore, my question is whether the use of 2 APIs in a package is avoidable despite being justified (as I pointed out above).

Thanks in advance!

Francisco

maelle · May 25, 2020, 10:44am

Hello @frequena, welcome!

@sckott has developed spocc that wraps several of his API wrappers for species occurrence data, which might be a good model to follow. It’d mean writing three packages (one for each API, a further one to wrap them and provide a consistent user interface).
Now, I’m not sure how it’d work with software review (I’m an editor but can’t speak for everyone). It doesn’t sound optimal for anyone involved to submit three packages in one go. (What did happen a few times is packages getting split as a result of reviews)

Furthermore, if I follow correctly, your tools have some overlap with tokenizers? I’m asking both in term of scope (i.e. do the APIs extend this functionality significantly) and of interface (if the APIs work the same input and output as tokenizers, the user might expect a similar interface). I might be completely off, not knowing what the APIs are.

Feel free to open a pre-submission inquiry so we can discuss.

frequena · May 26, 2020, 7:37am

Hello @maelle,

First of all, thank you for your fast reply! These are my two APIs:

NCBI Text Mining web service (Text Mining Web APIs - NCBI - NLM)
biolink (BioLink API)

At first, my idea was to develop a package whose user input is a text (or a PMID) and retrieves
the words and their categories as a dataframe object. Therefore, the scope of the package is an interface to Named-Entity recognition APIs of biomedical words.

The NCBI Text mining web service API fits pretty well for that task.

The second API (biolink) has many interesting functionalities. One of them, it’s the nlp/annotate endpoint. This API endpoint allows you to identify words based on
phenotypic information, a functionality not covered by NCBI Text mining API but, in my opinion, with a great potential.

Therefore, I found a good idea to integrate both of them on a single package. But as I said in the previous message, I’m not sure how this approach is considered in the software
review step.

Topic		Replies	Views
R Wrapper for the CORE API Wishlist literature , openaccess	5	1588	March 10, 2016
Feedback on text mining in rcrossref package Package Use Questions	0	1317	January 16, 2015
An R API for UK Police Data Package Use Questions data , api , package	5	2226	October 27, 2016
rOpenSci \| rOpenSci News Digest, January 2023 Blog	0	207	January 20, 2023
rOpenSci vs rOpenGov General Q&A questions , package	7	2077	April 12, 2017

Package covering two different APIs

Related topics