Taxize: Get rank of lowest common taxon

sckott · February 26, 2016, 1:09am

Thanks much! Sorry haven’t gotten to the pull request yet, will do so very soon

Thanks for adding the rank name test

Will have a look at that warning vs. stop thing

jimmyodonnell · February 26, 2016, 5:22pm

No rush from my end. Holler if I can help.

I didn’t submit a pull request for the rank name validation; it’s pushed to my fork, but I didn’t know if piling multiple pull requests on top of one another was a good idea.

sckott · February 26, 2016, 9:04pm

makes sense to add commits to the PR you sent if related. If not related, definitely send as a different PR

sckott · February 26, 2016, 9:13pm

#1

that makes sense to allow calling classification() outside of the function first, then passing the results in to lowest_common() - I’ll think about how to make that as smooth as possible.

#2

where is that code in your PR?

#3

Not off the top, but I’ll think about it.

We can make this work, just need to tweak the internals a bit

Thanks for pointing that out, will have a look

jimmyodonnell · February 26, 2016, 10:47pm

Regarding adding a check commit to the pull request: Looks like it’s already there? https://github.com/ropensci/taxize/pull/509/commits

#2: line 39 (low_rank = NULL); line 50:55.

sckott · February 27, 2016, 12:04am

@jimmyodonnell Seems like the results that spit out should include the taxon name itself as well as the rank of that name (if not known then that replacement you’ve put in), Agree?

jimmyodonnell · February 27, 2016, 12:26am

I’m confused – Do you mean when a low_rank option is forced? I suppose I didn’t think that was necessary since it was supplied, but maybe in some cases it’d be useful?

sckott · February 27, 2016, 12:30am

As far as is possible, it’s best if functions always return the same structure, whether it’s a single character string, a vector, a data.frame, etc. With the low_rank option it returns a single character vector, while not using it returns a data.frame., e,.g

             name      rank     id
16 Epidendroideae subfamily 158332

jimmyodonnell · February 27, 2016, 12:42am

Ah; roger that. I see why you were looking to put them in separate functions originally. Which makes more sense to you: Two separate functions, or one whose output looks like this if the lowest common taxon is higher than the specified level:

>lowest_common(getuid(c("Humulus lupulus", "Homo sapiens")), low_rank = "family")
  name   rank  id
1   NA family  NA

sckott · February 27, 2016, 12:45am

is that supposed to be a taxonomic name in that data.frame that’s returned?

jimmyodonnell · February 27, 2016, 12:51am

No. The intention there is to say “Show me the name of the taxon at the family level that these taxa have in common”. Because Hops and Humans do not share the same taxon at the family level, it outputs NA.

This might seem totally pointless; for metabarcoding studies it’s pretty common to want to consolidate everything at the same taxonomic rank.

Does that make any sense?

sckott · February 27, 2016, 12:54am

Yep, that makes sense

Also, made some changes, can see the diff here work on lowest_common, now exported, in proper fxn structure now #505 · ropensci/taxize@f145fb2 · GitHub

jimmyodonnell · February 27, 2016, 1:17am

Looks great. As is, the valid_names object is not defined; is it added elsewhere?

sckott · February 27, 2016, 3:34am

Not understanding, what do you mean?

jimmyodonnell · February 27, 2016, 9:19pm

Sorry, I Friday afternooned – looked at just the diff, not the whole file. Everything looks fine.

Bouring · October 4, 2017, 5:08pm

jimmyodonnell and sckott hello. you guys seem to be experts… or at least you seem to know these things much better than me. i have registered just a while ago because i want to learn instead of sitting home doing nothing but searching for humatrope (because that’s what i am taking right now because of health issues), so i see that you really know these things and wanted to ask if you could answer my questions that i would most likely have later. thanks

Topic		Replies	Views
taxize: new function id2name() and WORMS in downstream() Package Use Questions r , taxize , taxonomy , worms	0	1002	November 8, 2018
Taxonomic databases from R Package Use Questions sql , taxize , taxonomy	2	1712	March 24, 2016
Using taxize and highcharter in R to extract and visualize taxonomic data UseCases	0	857	September 13, 2022
neotoma & taxize - resolve taxon names from the Neotoma Paleoecological Database UseCases taxize , taxonomy , neotoma	0	1124	October 1, 2016
Taxize v0.6 is on CRAN Package Use Questions	0	1288	June 19, 2015

Taxize: Get rank of lowest common taxon

Related topics