Twitter for Academic Research
- See what other ideas have been submitted and vote for ideas you like.
- If you have an idea that isn't already listed, create a new post.
- Include only one idea per post.
-
Allow academic research on deleted tweets/ suspended accounts
Twitter’s terms of service requires us to delete tweets we have collected when they are removed from Twitter. This makes academic research on certain topics practically impossible, as continually monitoring whether a tweet is still present is unfeasible, and much time could be spent on research that would to be have to have to be abandoned because using the data would violate Twitter’s terms of service. Could an exception requiring the deletion of tweets to the terms of service be made for academic research?
6 votes -
Allow access to the "friends" of suspended accounts
I would like to study who networks of bots tend to follow, but since they are quickly suspended this data doesn't seem to be available anywhere.
6 votes -
Provide a way to retrieve quote tweets
If I have a set of tweet IDs, I do not believe there is a natural way of collecting quote tweets of those tweets using the Twitter API v1 or v2. This limits the ability of me and other researchers to fully study things like political dialogue, polarization, and interactions with political elites.
I believe it may be possible to retrieve quote tweets by searching tweet text for mentions of the tweet IDs, since quote retweets technically contain a URL pointing back to the quoted tweet. However, I do not think that is obvious to many API users, even if…
5 votes -
Historically Accurate Friend, Follower, Status Count
It is often useful to know when an account gains new followers or followers new accounts. If a researcher queries a user, Twitter returns the number of followers or friends at the time of the query. That makes sense. If, however, I query a tweet, such as by asking for a user's 3200 most recent tweets, each of those tweets will show me the number of followers , friends, or tweets at the time of my query, not the time of the tweet authorship. The only way to get the true number of followers, friends, or statuses when a tweet…
3 votes -
Extend Historical Data For Academics
Academics should receive increased access (beyond seven days) to historical data. The seven-day limitation severely limits academic inquiry and is a primary driver behind extravagant costs for those using Twitter data in their research.
133 votes -
Raise Rate Limits For Academics
Academics should not be burdened with the same rate limits as those who access the APIs for commercial means. Lifting these rate limits would significantly increase the rate at which data could be gathered, creating cost and time efficiencies for those using Twitter data in their research.
89 votes -
Add an API to retrieve Tweet threads
I request the Twitter to allow access to all the replies of a tweet (all the conversations of a thread). It would be very useful for academics working in user behavior analysis, argumentation mining, hate speech detection because comment or reply reflects the actual thought process of a replying user.
53 votes -
covid-19 related datasets of tweets
Given the current need for data around covid-19 would you consider making relevant datasets containing tweet contents available now, possibly with some relaxed redistribution terms. There is a previous idea for datasets in general but right now there is a need for this data. Several groups are publishing different ID datasets but then anyone wanting it is having to pull down 70-300M matching tweets, that's going to be a load on your APIs and is slowing down access. I put a related topic up on the dev forums https://twittercommunity.com/t/covid-19-related-dataset-availability/136030
7 votes -
crear api
No puedo tener una api y la necesito para hacer una PoC para la empresa, se esta usando para hacer análisis de sentimientos
3 votes -
Allow API access to tweets from suspended accounts
For research focused on still-existing troll farms, it's critical to have access to tweets of accounts Twitter already suspended.
As troll farms:
1) are highly skillful in pushing legit users to breaking Twitter rules (and therefore to become suspended)
2) frequently experience suspension of their own accounts
3) when creating new accounts, tend to refer to their previous accounts to easier regain followershipCurrently, conversations with heavy involvement of troll farm accounts look like this:
https://twitter.com/pj1m6y2z1fmwdn7/status/1151811700557045761This Tweet is from a suspended account. Learn more
This Tweet is from a suspended account. Learn more
(single live tweet)
This Tweet is…11 votes -
Informed Consent API
Implement an API / Bulk access call to notify users that their tweets were / are used in a study, with the option to opt-out, or opt-in to more information from the researcher. Maybe a DM with links to IRB / Home Institution / Researcher contact details?
Ideally, i'd like to notify and ask permission from people - in the past, i've implemented this as a Twitter list, adding people to give them some notification - but this doesn't give people an easy opt-out.
Given the number of times data is used, maybe an equivalent to the list of advertisers…
42 votes -
Make Twitter Labels visible in JSON payload
Twitter has launched some labels for Tweets that contain misleading information. For example, under those suspicious tweets, a label named as "Get the facts about.." will show up. However, these labels are not visible in the current json file. These labels are actually pretty interesting and valuable features because we can study the impact of it on the cascade of the misleading information, which will contribute to the current information cascading literature.
2 votes -
Add accurate data on Replies and Likes to the Search API
In the current search API, the only form of engagement for which scholars currently get accurate metadata is RTs. There is no metadata field for Replies and the field for Likes does not return the number of times a tweet has been liked by other users. This information would be very valuable for academics, especially considering that the use of the "Like" button by users has increased substantially.
24 votesYou might be interested in one of our roadmap items: an API to access Tweet engagement & impression metrics.
More detail in our roadmap: https://trello.com/b/myf7rKwV/twitter-developer-platform-roadmap
-
Predefined datasets
I'd like access to predefined datasets that provide ideas for my research and make it easy to get started.
20 votes -
New endpoint for detecting whether tweets are still live
If you have a large collection of tweets it can be unwieldy to use the API to detect which tweets have been deleted - beyond complying with Twitter's own terms of service this is also necessary to respect a users agency over their own content. When a tweet or tweets are no longer available can also have an impact on what and how the results of academic research can be reported.
When sharing a dataset of tweet ids, you can also spend a lot of time trying to hydrate tweets that have mostly been deleted.
It would significantly simplify this…
19 votes -
Nominate non-profit intermediaries for Firehose and other complete data APIs
Take part in Social Science One (https://socialscience.one/ ) and/or work with other non-profit intermediaries such as academic libraries and data providers (for example GESIS in Germany) for Firehose access or access to the complete follow network or other large scale data. They have the most experience when it comes to researchers' needs and can act as a trustworthy partner when it comes to protecting user privacy and the like.
19 votes -
We need an API to validate our bot/troll detection techniques.
What I would like is the ability to use an API to alert Twitter to the presence of malicious actors interfering in elections. Ideally, it would be process whereby we crowd source and gamify the detection pieces, with Twitter's decision to suspend (or not) as the ultimate validation determining who is best at the game.
Recently, I sent 500 Canadian election Twitter IDs our model saw as bots/automated to a third party scholar: Kurt Wirth <kurtawirth@gmail.com>. I asked him if I could share his response to what I sent, he said yes. His response is below:
---------- Forwarded…
7 votes -
Twitter Trends API
Many common research use-cases would be greatly facilitated by an API that could return summary level information. For instance, for any subset of tweets based on a particular query term(s) and period:
hourly/daily/weekly counts of matching tweets
histogram data of top hashtags / handles referenced / links shared
proportion of retweets / replies / first posts
proportion of dataset generated by top n contributors
These would be very valuable from a purely research perspective, but presumably they present considerable challenges for implementation. But a methodology similar to Google Trends might be possible.
10 votes -
Retrieve random tweet from the index (new API call)
Deploy a new API call to retrieve a random tweet from the index. Would be an important asset for characterization studies.
8 votes -
Bring back the Streaming APIs
Streaming was a VERY good+easy way to get low key sampling data, without digging into heavy duty "last x million tweets" data dumps. A couple of lines of code (ie minimum wasted researcher time) and you could start collecting/analysing data.
It is very much missed.
Not to mention, of course, the damage removing streaming has done to the general Twitter ecosystem: http://apps-of-a-feather.com/
7 votes
- Don't see your idea?