Homebreadcumb forward arrow iconBlogbreadcumb forward arrow iconArtificial Intelligencebreadcumb forward arrow iconHow to Build a Twitter Sentiment Analysis Python Program? [Step-by-Step Tutorial]

How to Build a Twitter Sentiment Analysis Python Program? [Step-by-Step Tutorial]

Last updated:
7th Aug, 2020
Read Time
9 Mins
share image icon
In this article
Chevron in toc
View All
How to Build a Twitter Sentiment Analysis Python Program? [Step-by-Step Tutorial]


As companies are becoming increasingly data-driven, a Machine Learning technique called ‘Sentiment Analysis’ is gaining immense popularity day by day. It analyses the digital data/text through Natural Language Processing (NLP) to find the polarity (positive, negative, neutral), feelings, and emotions (angry, happy, sad, etc.) expressed in the text. 

Top Machine Learning and AI Courses Online

Ads of upGrad blog

Since Twitter is one of the most comprehensive sources of live, public conversation worldwide, business firms, political groups, etc. are interested in performing ‘Sentiment Analysis’ of tweets to understand the emotions/opinions of the target market or for studying competitors’ market. Although they are ready to use programs for the purpose but to achieve predictions with a high level of accuracy, specific to particular criteria and domains, the best way is to create a customized Twitter Sentiment Analysis Python model or program.

Trending Machine Learning Skills

Enrol for the Machine Learning Course from the World’s top Universities. Earn Masters, Executive PGP, or Advanced Certificate Programs to fast-track your career.

Step-by-step Tutorial: Create Twitter Sentiment Analysis Program Using Python

This tutorial aims to create a Twitter Sentiment Analysis Program using Python. The resultant program should be capable of parsing the tweets fetched from twitter and understanding the text’s sentiments, like its polarity and subjectivity.

The Perquisites

1. Foremost is the basic coding/programming knowledge of Python.

2. Tools to be installed on your computer:

  • Python
  • Libraries: Tweepy, text blob, word cloud, pandas, NumPy, matplotlib

(Tweepy is the official python library for twitter API that enables Python to communicate with Twitter platform)

3. A Twitter Account 

4. A Twitter App needs to be created and authenticated by Twitter: This is necessary to get the ‘Consumer key and Access tokens’ that you will need in your programming.

If you already don’t have a Twitter App created for the purpose, then here is how to create it.

Read more: Python NumPy Tutorial: Learn Python Numpy With Examples

How to Create a Twitter App?

  • Go to the Twitter developer site:
  • Sign in with your Twitter account
  • Go to ‘My applications’
  • Click on ‘Create a new application.’
  • Next, you need to fill a form, as shown below. 


  • Next, click on ‘Create my Access Token.’
  • In the next page, choose the ‘Read and Write’ option under the column ‘Application Type.’

You will be provided with your Twitter App OAuth Settings, which includes all necessary details related to your consumer key, consumer secret, Access token, Access token secret, etc. You need to note these details as these API credentials will enable you to fetch tweets from twitter. Better to save it in a CSV file in your computer, latter you can directly upload the CSV file into your program to read API credentials

Get Started with Creating Twitter Sentiment Analysis Python Program

1. Import the Libraries: Tweepy, text blob, word cloud, pandas, NumPy, matplotlib 

2. Authenticate the Twitter App: Next, you need to authenticate your twitter app using the Twitter App OAuth Settings credentials, also referred to as Twitter API credentials. For this, you need to create an Authentication object, using the codes as shown in the image below. 


To fill up the Twitter API credentials, you can either upload the CSV file or manually copy paste the credential details. 

3. Fetch the Tweets from the Twitter User: Now, for fetching the tweets, you first need to choose a Twitter user whose tweets you want to parse to understand the sentiment expressed in it. Let’s say; you want to see whether the tweets of ‘UserXYZ’ are positive or negative or neutral by performing sentiment analysis of the 100 tweets by the UserXYZ. 

Code for fetching the tweets

posts = api.user_timeline(screen_name = ”UserXYZ”, count= 100, Lang =”en”, tweet_mode=“extended”)

Running the above command will show up the tweets.  

4. Create Data Frame: Now, you need to create a data frame for the tweets you have fetched. Let’s say you name the first column of your df as ‘Tweets’, and it will contain all the tweets spread across 100 rows since you are analyzing 100 tweets. 

Df = pd.dataframe( [tweet.full_text for tweet in posts] , columns=[ ‘Tweet’])

5. Clean the Text: Cleaning the text of the tweets is important for the success of your twitter sentiment analysis python program, as there will be many unwanted symbols like @, #, re-tweets, hyperlinks in the URLs, etc. Here your python’’ library gets into use. 

Get the Subjectivity and Polarity: Once you have cleaned the text, you need to create two functions using the TextBlob python library to get the tweets’ subjectivity and polarity. The subjectivity shows how opinionated the text is, and polarity describes the positivity or negativity of the text. It would be best to write the python script to create two more columns in your data frame to host Subjectivity and Polarity. So, now your data frame will have three columns (first for the tweets, 2nd for the subjectivity, 3rd for the polarity)

The codes for creating Subjectivity and Polarity functions are as follows:


After you run the code, you will see the scores of subjectivity and polarity of each tweet shown in the respective columns. TextBlob describes the polarity within a scale of 1 to -1. So, if a tweet has -0.4 polarity means it’s slightly negative, and if it has 0.6 subjectivity, then it is fairly subjective. 

6. Next, you can choose to include a word cloud in your Twitter Sentiment Analysis Python program, as word clouds are also popular as a data visualization technique used for sentiment analysis, wherein the size of the words indicates its importance. 

Example of a WordCloud:


The matplotlib, Pandas, and WordCloud libraries will come into action that you have already imported. To plot a word cloud-first, you need to create a variable; let’s name it ‘allwords’ to represent all the tweets in the ‘Tweets’ column of the data frame.  

Code for creating WordCloud

allwords = ‘ ‘.join( [twts for twts in df [ ‘Tweets’ ]] )

WordCloud = WordCloud (width =xxx, height =xxx, randon_state =xxx, max_font_size =xxx. generate (allwords) 



7. As you have the polarity scores for each tweet, you can start to compute positive, negative, and neutral analysis of the tweets. For this, you need to create a function, let’s call it ‘Analysis’, wherein you can assign the score 0 to neutral, <0 to negative, and >0 to positive. 

Def Analysis(score): 

 If score < 0

return ‘Negative.’

elif score == 0

return ‘neutral


return ‘positive.’

Next, to host the results of the sentiment analysis of the tweets, create a new column in your data frame, let’s name it ‘TwtAnalysis’ and then write the following code:

df [ ‘TwtAnalysis’ ] = df [ ‘Polarity’ ]. apply(Analysis)

8. The new data frame will have the added column named ‘TwtAnalysis’, and it will refer to each tweet either as positive, negative, or neutral based on its polarity score. An example is shown below in the image:


9. Once you have the classification of the tweets as positive, negative, and neutral, you can continue building your Twitter Sentiment Analysis Python program to represent the data in different formats such as:

  • Get the percentage of positive, negative, or neutral tweets.
  • Print all of the positive comments or negative or neutral tweets separately
  • Create a visual sentiment analysis chart of the positive, negative, and neutral tweets, and much more.

Also Read: Top 9 Python Libraries for Machine Learning

Popular AI and ML Blogs & Free Courses

Check out all trending Python tutorial concepts in 2024.


Ads of upGrad blog

The Twitter Sentiment Analysis Python programexplained in this article, is just one way to create such a program. The developer can customize the program in many ways to match the specifications for achieving utmost accuracy in the data reading, that is the beauty of programming it through python, which is a great language, supported by an active community of developers and too many libraries. 

Python holds immense scope in the space of Machine Learning and Data Science. Those who are into programming for a while know it well that Machine Learning will continue to be one of the breakthroughs in the future of programming.

If you want to get a comprehensive and structured learning experience, also if you’re interested to learn more about machine learning, check out IIIT-B & upGrad’s PG Diploma in Machine Learning & AI which is designed for working professionals and offers 450+ hours of rigorous training, 30+ case studies & assignments, IIIT-B Alumni status, 5+ practical hands-on capstone projects & job assistance with top firms.


Pavan Vadapalli

Blog Author
Director of Engineering @ upGrad. Motivated to leverage technology to solve problems. Seasoned leader for startups and fast moving orgs. Working on solving problems of scale and long term technology strategy.
Get Free Consultation

Select Coursecaret down icon
Selectcaret down icon
By clicking 'Submit' you Agree to  
UpGrad's Terms & Conditions

Our Popular Machine Learning Course

Frequently Asked Questions (FAQs)

1What is the functional importance of sentiment analysis on social media?

In this age of social media, consumer opinions hold infinite power to make or break a brand. Sentiment analysis is the best way to understand the driving factors behind consumer opinions and emotions accurately. With data available from sentiment analysis, businesses can find their target audience, identify social media influencers for their brand, and spot emerging market trends. Moreover, sentiment analysis on social media platforms also offers a comprehensive idea of their brand health; businesses can receive honest consumer feedback about their newly launched products and services to improve thereupon. It is also effective in handling online reputation by better identifying scopes of improvement and problem resolution.

2Can you learn Python on your own?

If you have some level of experience with programming, you can certainly start learning Python on your own. Python comes with a comparatively simple syntactic structure and is also intuitive, making it easy to understand even for beginners. Besides, it is easy to install the package and execute it on your machine from anywhere. You can expedite your learning by enrolling in Python courses, attending coding bootcamps. With an active developer community that is always willing to extend support, an abundance of learning resources, and documentation, beginners can learn the fundamentals of Python in about 7-8 weeks.

3Which is more difficult to learn between Python and C++?

Python and C++ are entirely different programming languages with entirely different behavior and features. However, both strongly support object-oriented programming. But considering the overall Python framework, it is straightforward, much easier, and quicker to learn than C++. In fact, many people learn Python to experience what simple coding feels like. Python is open-source, free, and offers tremendous reliability and ease when it comes to developing complex machine learning applications. While C++ is more efficient in terms of performance, it is not considered suitable over Python for machine learning projects.

Explore Free Courses

Suggested Blogs

Artificial Intelligence course fees
Artificial intelligence (AI) was one of the most used words in 2023, which emphasizes how important and widespread this technology has become. If you
Read More

by venkatesh Rajanala

29 Feb 2024

Artificial Intelligence in Banking 2024: Examples &#038; Challenges
Introduction Millennials and their changing preferences have led to a wide-scale disruption of daily processes in many industries and a simultaneous g
Read More

by Pavan Vadapalli

27 Feb 2024

Top 9 Python Libraries for Machine Learning in 2024
Machine learning is the most algorithm-intense field in computer science. Gone are those days when people had to code all algorithms for machine learn
Read More

by upGrad

19 Feb 2024

Top 15 IoT Interview Questions &#038; Answers 2024 – For Beginners &#038; Experienced
These days, the minute you indulge in any technology-oriented discussion, interview questions on cloud computing come up in some form or the other. Th
Read More

by Kechit Goyal

19 Feb 2024

Data Preprocessing in Machine Learning: 7 Easy Steps To Follow
Summary: In this article, you will learn about data preprocessing in Machine Learning: 7 easy steps to follow. Acquire the dataset Import all the cr
Read More

by Kechit Goyal

18 Feb 2024

Artificial Intelligence Salary in India [For Beginners &#038; Experienced] in 2024
Artificial Intelligence (AI) has been one of the hottest buzzwords in the tech sphere for quite some time now. As Data Science is advancing, both AI a
Read More

by upGrad

18 Feb 2024

24 Exciting IoT Project Ideas &#038; Topics For Beginners 2024 [Latest]
Summary: In this article, you will learn the 24 Exciting IoT Project Ideas & Topics. Take a glimpse at the project ideas listed below. Smart Agr
Read More

by Kechit Goyal

18 Feb 2024

Natural Language Processing (NLP) Projects &amp; Topics For Beginners [2023]
What are Natural Language Processing Projects? NLP project ideas advanced encompass various applications and research areas that leverage computation
Read More

by Pavan Vadapalli

17 Feb 2024

45+ Interesting Machine Learning Project Ideas For Beginners [2024]
Summary: In this Article, you will learn Stock Prices Predictor Sports Predictor Develop A Sentiment Analyzer Enhance Healthcare Prepare ML Algorith
Read More

by Jaideep Khare

16 Feb 2024

Schedule 1:1 free counsellingTalk to Career Expert
footer sticky close icon