Blog_Banner_Asset
    Homebreadcumb forward arrow iconBlogbreadcumb forward arrow iconData Sciencebreadcumb forward arrow iconWhat makes a “Data Science” Post Popular on Medium?

What makes a “Data Science” Post Popular on Medium?

Last updated:
17th Oct, 2018
Views
Read Time
6 Mins
share image icon
In this article
Chevron in toc
View All
What makes a “Data Science” Post Popular on Medium?

In one of my previous posts on Medium, I had written about how to scrape search results for a particular query string from Medium. In this post, we will go into details of analyzing the data scrapped for the search term “Data Science” to group posts based on Number of claps and Responses into different levels of popularity and also understand what makes these posts popular.

Learn data science courses from the World’s top Universities. Earn Executive PG Programs, Advanced Certificate Programs, or Masters Programs to fast-track your career.

The data scrapped from Medium search results was JSON file with extensive data about each search result. To explore the structure of JSON file, I used Notepad++ with JSON plugin. The JSON file had data about the posts, author of the post and publisher associated with that post (if any). Here’s the JSON data structure for a medium post:

The code to extract data from the JSON file can be found here. In addition to extracting data from the JSON file, I also added a field with the date when the post was scrapped.

Explore our Popular Data Science Courses

Data Science Summarized In One Picture

Exploratory Analysis of Posts Related to “Data Science”

On scraping results for search term “Data Science”, 831 posts were scrapped, out of which 31 were responses to a post and were excluded from the analysis. Here are the number of posts published over years, the data scrapped was from March 2013 to April 2018:

All the date fields like Created Date, First Published Date, Last Updated Date wherein milliseconds elapsed since Jan 1970. They were converted into a human readable date format using the function below

Top Essential Data Science Skills to Learn

# Function to Convert EPOCH Date to Human Readable format
def convertToDateString(date):
    return (datetime(1970, 1, 1) + timedelta(milliseconds=date)).strftime("%Y-%m-%d %H:%M:%S")

The next step was to look at what words were most commonly occurring in the titles of these posts. As you can see from the word cloud below, Data Science, Big Data, AI, Analytics, Machine Learning, Python, self-driven (about self-driving cars) are some of the most frequently occurring words.

The distribution of Number of Claps, Number of Responses is highly skewed. 708 posts have less than 500 claps. This shows that there are few posts which become popular. Here’s the distribution of claps:

The Reading Time (mins) of most articles is between 1 to 3 min.

On Medium, each post can have a maximum of 5 tags. Tags help readers find content more easily. The more relevant tags, the easier to find. As we can see in the image, Data Science is the most frequently used tag, followed by Machine Learning, Big data, Artificial Intelligence. Here are top 10 tags related to data science:


Why Data Science Jobs are in High Demand?

Creating Clusters Based on User Responses

There are three metrics to measure how popular a post is on Medium viz. #Claps, #Responses and #Recommends. To make a fair comparison, I also included feature #Days between First Published and data collection date.On this feature set, I applied k-means clustering and identified three clusters. As we can see from the image below, there is a huge difference between the three metrics across clusters (Popularity Groups). Also, we can see that for the less popular posts though their median days between publishing and scrapping is the highest their engagement is very low. Here are the metrics across clusters (Popularity Groups):

upGrad’s Exclusive Data Science Webinar for you –

ODE Thought Leadership Presentation

Understanding What Makes a Data Science Post Popular

As we can see from the image below, for more popular articles the median for high and medium popularity articles are 9 and 7. They also have more links compared to less popular articles. This means that Popular posts refer to other posts and other sources of information adding more value to the content. Difference between Popular and Non-Popular Posts

From the image above, we can also see that the post with medium popularity is closer to a highly popular group than to the less popular group.

Applications of Data Science and Machine Learning in NETFLIX

Read our popular Data Science Articles

With a simple k-means, we were able to identify popular and non-popular posts on Medium related to Data Science.

 

This blog was originally published on Medium by Aiswarya Ramachandran – an alumnus of UpGrad’s Data Science program with IIIT-Bangalore.

Profile
Aiswarya is a News Analyst in Thomson Reuters with 1.5 years of experience in deploying primary and secondary research techniques to derive Reporting Analysis, understanding customer segments and performing competition analysis thus facilitating achievement of companies growth objectives. Previously, she has been a research associate in Indian Institute of Science, Bangalore. She's an alumnus of UpGrad's Data Science Program with IIIT-Bangalore.

Frequently Asked Questions (FAQs)

1When it comes to Medium, how often should you post?

If you want to be successful on Medium and can't post every day, at least write 3 to 5 times each week. Consistency is the most essential thing you should strive for. Whatever timetable you come up with, be sure it is sustainable in the long term and stick to it.

2Is it possible for anyone to get published on Medium?

Anyone may create a free Medium account and begin blogging right away. Writers can submit standalone pieces, contribute to collections of collected stories, or create their own collection. With their simple editor, you can share your experiences with the world as a Medium writer. Publishing on Medium is completely free, and your stories will be shared with your followers as well as millions of other people who are interested in similar themes.

3On Medium, what is Towards Data Science?

The company, Towards Data Science Inc., is based in Canada. They use Medium to create a forum for thousands of individuals to share ideas and learn more about data science. Authors can choose to restrict access to their posts to members exclusively as part of the Medium ecosystem. Through the Medium Partner Program, you can reach a larger audience and earn money by publishing in Towards Data Science. In line with the Medium Terms of Service, which you agree to when creating a Medium account, you are also the sole owner of your work.

Explore Free Courses

Suggested Blogs

Top 12 Reasons Why Python is So Popular With Developers in 2024
99361
In this article, Let me explain you the Top 12 Reasons Why Python is So Popular With Developers. Easy to Learn and Use Mature and Supportive Python C
Read More

by upGrad

31 Jul 2024

Priority Queue in Data Structure: Characteristics, Types & Implementation
57691
Introduction The priority queue in the data structure is an extension of the “normal” queue. It is an abstract data type that contains a
Read More

by Rohit Sharma

15 Jul 2024

An Overview of Association Rule Mining & its Applications
142465
Association Rule Mining in data mining, as the name suggests, involves discovering relationships between seemingly independent relational databases or
Read More

by Abhinav Rai

13 Jul 2024

Data Mining Techniques & Tools: Types of Data, Methods, Applications [With Examples]
101802
Why data mining techniques are important like never before? Businesses these days are collecting data at a very striking rate. The sources of this eno
Read More

by Rohit Sharma

12 Jul 2024

17 Must Read Pandas Interview Questions & Answers [For Freshers & Experienced]
58170
Pandas is a BSD-licensed and open-source Python library offering high-performance, easy-to-use data structures, and data analysis tools. The full form
Read More

by Rohit Sharma

11 Jul 2024

Top 7 Data Types of Python | Python Data Types
99516
Data types are an essential concept in the python programming language. In Python, every value has its own python data type. The classification of dat
Read More

by Rohit Sharma

11 Jul 2024

What is Decision Tree in Data Mining? Types, Real World Examples & Applications
16859
Introduction to Data Mining In its raw form, data requires efficient processing to transform into valuable information. Predicting outcomes hinges on
Read More

by Rohit Sharma

04 Jul 2024

6 Phases of Data Analytics Lifecycle Every Data Analyst Should Know About
82932
What is a Data Analytics Lifecycle? Data is crucial in today’s digital world. As it gets created, consumed, tested, processed, and reused, data goes
Read More

by Rohit Sharma

04 Jul 2024

Most Common Binary Tree Interview Questions & Answers [For Freshers & Experienced]
10561
Introduction Data structures are one of the most fundamental concepts in object-oriented programming. To explain it simply, a data structure is a par
Read More

by Rohit Sharma

03 Jul 2024

Schedule 1:1 free counsellingTalk to Career Expert
icon
footer sticky close icon