Extracting tweets from Twitter in Rstudio

Ariyanjyr · October 15, 2022, 5:42pm

Hi!

I was be able to extract tweets from twitter into R, using search_30days. However, I wanted to exclude the retweets, but got the error code 422 which means that something is wrong with the way I called it.

This is what I use:

´´´
dataBTC1 ← search_30day(“Bitcoin analysis lang:en -is:retweet”, n = 100, env_name = “Tweets30”, fromDate = “20220915000”, toDate = “202209152359”, parse = FALSE)
´´´
Moreover, I have converted the tweets from a list into a data frame by doing the following:

´´´
dataBTC1 ← search_30day(“Bitcoin analysis future lang:en”, n = 100, env_name = “Tweets30”, fromDate = “20220915000”, toDate = “202209152359”, parse = FALSE)

dfbtc1 ← do.call(rbind.data.frame, dataBTC1)
´´´

However, when I change my search query to:

´´´
dataBTC1 ← search_30day(“Bitcoin analysis on the market lang:en”, n = 100, env_name = “Tweets30”, fromDate = “20220915000”, toDate = “202209152359”, parse = FALSE)

dfbtc1 ← do.call(rbind.data.frame, dataBTC1)
´´´
I get the following error:

´´´
Warning: non-unique values when setting ‘row.names’: ‘1’, ‘2’, ‘3’, ‘4’, ‘5’, ‘6’, ‘7’Error in .rowNamesDF<-(x, value = value) :
duplicate ‘row.names’ are not allowed
´´´

How can this be fixed? I also noticed that the first code results in a large list approx 2 mb and the second code results in a large list (16 elements, 2.9 mb). I assume it has something to do with this, but I dont know how to fix it.

llrs · October 20, 2022, 8:30am

Be aware that with parse = FALSE the returned data is not parsed by rtweet and it is in json format. I’m not sure why you expect it to work with the rbind.data.frame or why it worked in the first case.
You might be using parse = FALSE to avoid the issue with the new edit* fields that the API is returning and that the current released rtweet version doesn’t know how to handle. I’ve set a workaround in the devel version of rtweet in github, that I plan to release soon.

kenkoonwong · October 31, 2022, 3:05am

Any news on this yet? Or should I install devel version?

Topic		Replies	Views
Error using network_data() function for rtweet data Package Use Questions	2	550	August 22, 2022
search_30day() 403 error? General Q&A rtweet	3	339	October 31, 2022
rtweet save_as_csv failing General Q&A rtweet	3	1255	September 3, 2022
Twitter API Failed [403] General Q&A rtweet	5	2044	July 22, 2022
rtweet update 1.0.2 Package Use Questions rtweet	4	554	August 1, 2022

Extracting tweets from Twitter in Rstudio

Related topics