Downloading earliest 3200 tweets with Tweepy? - python

I've been trying to download the entire timelines of a number of users on Twitter. While I'm able to download the most recent 3,200 tweets from a user, I was wondering if anyone had any success downloading timelines in opposite chronological order--i.e. from first tweet to the 3,200th?
The best I can do is view these tweets in my browser via Twitter's Advanced Search feature. But there's not even a way to copy-paste these tweets into a csv where I can analyze them.
Thanks for any help you can provide!

I don't think you can get tweets older than the most recent 3,200.
This is a limitation of Twitter's underlying GET statuses/user_timeline API endpoint in the REST API:
This method can only return up to 3,200 of a user’s most recent Tweets. Native retweets of other statuses by the user is included in this total, regardless of whether include_rts is set to false when requesting this resource.

I submitted a bot TwEater on GitHub today, aiming at collecting earlier data more than 3200 for text mining tasks. Hopefully it might help you.

Related

Twitter API v2 - Rate Limit

I'm accessing Twitter's API v2 and have Academic Research access.
I am interested in pulling as much data as possible, but am getting rate limited.
I am using tweepy in Python to extract the data, and the call "search_all_tweets"
I am looping the extraction for each day and limiting tweets extracted per day. By using time.sleep() I can modify how many tweets I can extract per 15 minutes.
Twitter has published this to answer my question, however I am still in doubt which category and thereby limit I am under:
https://developer.twitter.com/en/docs/twitter-api/rate-limits#v2-limits
Can anybody help to clarify how many tweets I can extract per 15 minutes before getting rate limited?
Thanks in advance
Go to the Twitter developper portal, select your app in your project and, at the bottom of the Settings tab, check if you have activated the Read and Write permissions in the OAuth 1.0a authentication. If not, activate them and regenerate your tokens.

Retrieve all user all recent tweets from past 7 days and then retrieve all users who replied to those tweets

What I want to do is to learn which is better way to get all tweets from a user profile URL and then get all users who replied to those tweets and remove duplicates from list before adding them in to csv or excel file. I found this Git project but its not what I want
https://github.com/nirholas/Get-Tweet-Replies-With-Python-Tweepy
Could anyone guide me to a better library or idea?
Tweepy is a very good library for interacting with Twitter in Python.
I'd suggest taking a look at Tweepy's documentation and starter guide. It's quite simple to work with once you understand what functions are available.

How to use Twitter Streaming API?

I been successfully able to use Twitter REST API to do find tweets and retweet them automatically. Now I want to tweet user a Thank You Message, I am using Python and Tweepy for my BOT. I can do live collection of tweets, using Twitter Streaming API, but I can't understand how to use the API to send a thank you message to those who favorite my tweets. I saw both the documentations but couldn't understand and use much from there. I also tried to search for any example but couldn't find any good one for Streaming API.
Can anyone show me how to use it with tweepy to achieve the same. It would be nice if you can show how to use it when someone retweets my tweet or follows me, I want to send them a thank you tweet automatically.
PS: I understand that Twitter doesn't likes much automation. But its all for learning.

Recovering tweets with media using tweepy

i'm having problem with the twitter api. I used this script Script to delete tweets
To delete the tweets form an account. I see that tweepy method (api,user_timeline) at
for status in tweepy.Cursor(api.user_timeline).items():
try:
will not return those tweets that contain media (accessing twitter from the website they don't show up in the timeline but in another tab called 'photos and videos') so they are not deleted. include_entities is not a parameter for this method.
how can i recover these tweets? I thought i could searching with the username as query, but that doesn't work and could be very inefficient.
is there another method on the twitter API or a parameter that's not included with the documentation?
Thanks for all the help!
I found that this is a problem twitter has when erasing massively tweets. Nothing we can do for now!

Using Twython to pull all (of Miley Cyrus') tweets?

I'm new to Python and Twython, but I'm working on a project where I want to use Twython to analyze all of Miley Cyrus' tweets. Currently there are 7,193, but Twython will only let me take 200 at a time...how can I scrape all of them? Is there there a way to scrape all of them using Twython or do I have to manually scrape the Twitter website? Ideally I would preserve access to all of the tweets' metadata so I could use it in my analysis (rather than just the text of all of the tweets). Suggestions for code?
If you read the documentation, you will see that:
This method can only return up to 3,200 of a user's most recent Tweets.
So there is no way the API can be used to programatically return all historic Tweets. And, yes, you are limited to requesting 200 at a time.

Categories

Resources