Blog_Banner_Asset
    Homebreadcumb forward arrow iconBlogbreadcumb forward arrow iconData Sciencebreadcumb forward arrow iconWhat makes a “Data Science” Post Popular on Medium?

What makes a “Data Science” Post Popular on Medium?

Last updated:
17th Oct, 2018
Views
Read Time
6 Mins
share image icon
In this article
Chevron in toc
View All
What makes a “Data Science” Post Popular on Medium?

In one of my previous posts on Medium, I had written about how to scrape search results for a particular query string from Medium. In this post, we will go into details of analyzing the data scrapped for the search term “Data Science” to group posts based on Number of claps and Responses into different levels of popularity and also understand what makes these posts popular.

Learn data science courses from the World’s top Universities. Earn Executive PG Programs, Advanced Certificate Programs, or Masters Programs to fast-track your career.

The data scrapped from Medium search results was JSON file with extensive data about each search result. To explore the structure of JSON file, I used Notepad++ with JSON plugin. The JSON file had data about the posts, author of the post and publisher associated with that post (if any). Here’s the JSON data structure for a medium post:

The code to extract data from the JSON file can be found here. In addition to extracting data from the JSON file, I also added a field with the date when the post was scrapped.

Explore our Popular Data Science Courses

Data Science Summarized In One Picture

Exploratory Analysis of Posts Related to “Data Science”

On scraping results for search term “Data Science”, 831 posts were scrapped, out of which 31 were responses to a post and were excluded from the analysis. Here are the number of posts published over years, the data scrapped was from March 2013 to April 2018:

All the date fields like Created Date, First Published Date, Last Updated Date wherein milliseconds elapsed since Jan 1970. They were converted into a human readable date format using the function below

Top Essential Data Science Skills to Learn

# Function to Convert EPOCH Date to Human Readable format
def convertToDateString(date):
    return (datetime(1970, 1, 1) + timedelta(milliseconds=date)).strftime("%Y-%m-%d %H:%M:%S")

The next step was to look at what words were most commonly occurring in the titles of these posts. As you can see from the word cloud below, Data Science, Big Data, AI, Analytics, Machine Learning, Python, self-driven (about self-driving cars) are some of the most frequently occurring words.

The distribution of Number of Claps, Number of Responses is highly skewed. 708 posts have less than 500 claps. This shows that there are few posts which become popular. Here’s the distribution of claps:

The Reading Time (mins) of most articles is between 1 to 3 min.

On Medium, each post can have a maximum of 5 tags. Tags help readers find content more easily. The more relevant tags, the easier to find. As we can see in the image, Data Science is the most frequently used tag, followed by Machine Learning, Big data, Artificial Intelligence. Here are top 10 tags related to data science:


Why Data Science Jobs are in High Demand?

Creating Clusters Based on User Responses

There are three metrics to measure how popular a post is on Medium viz. #Claps, #Responses and #Recommends. To make a fair comparison, I also included feature #Days between First Published and data collection date.On this feature set, I applied k-means clustering and identified three clusters. As we can see from the image below, there is a huge difference between the three metrics across clusters (Popularity Groups). Also, we can see that for the less popular posts though their median days between publishing and scrapping is the highest their engagement is very low. Here are the metrics across clusters (Popularity Groups):

upGrad’s Exclusive Data Science Webinar for you –

ODE Thought Leadership Presentation

Understanding What Makes a Data Science Post Popular

As we can see from the image below, for more popular articles the median for high and medium popularity articles are 9 and 7. They also have more links compared to less popular articles. This means that Popular posts refer to other posts and other sources of information adding more value to the content. Difference between Popular and Non-Popular Posts

From the image above, we can also see that the post with medium popularity is closer to a highly popular group than to the less popular group.

Applications of Data Science and Machine Learning in NETFLIX

Read our popular Data Science Articles

With a simple k-means, we were able to identify popular and non-popular posts on Medium related to Data Science.

 

This blog was originally published on Medium by Aiswarya Ramachandran – an alumnus of UpGrad’s Data Science program with IIIT-Bangalore.

Profile
Aiswarya is a News Analyst in Thomson Reuters with 1.5 years of experience in deploying primary and secondary research techniques to derive Reporting Analysis, understanding customer segments and performing competition analysis thus facilitating achievement of companies growth objectives. Previously, she has been a research associate in Indian Institute of Science, Bangalore. She's an alumnus of UpGrad's Data Science Program with IIIT-Bangalore.

Frequently Asked Questions (FAQs)

1When it comes to Medium, how often should you post?

If you want to be successful on Medium and can't post every day, at least write 3 to 5 times each week. Consistency is the most essential thing you should strive for. Whatever timetable you come up with, be sure it is sustainable in the long term and stick to it.

2Is it possible for anyone to get published on Medium?

Anyone may create a free Medium account and begin blogging right away. Writers can submit standalone pieces, contribute to collections of collected stories, or create their own collection. With their simple editor, you can share your experiences with the world as a Medium writer. Publishing on Medium is completely free, and your stories will be shared with your followers as well as millions of other people who are interested in similar themes.

3On Medium, what is Towards Data Science?

The company, Towards Data Science Inc., is based in Canada. They use Medium to create a forum for thousands of individuals to share ideas and learn more about data science. Authors can choose to restrict access to their posts to members exclusively as part of the Medium ecosystem. Through the Medium Partner Program, you can reach a larger audience and earn money by publishing in Towards Data Science. In line with the Medium Terms of Service, which you agree to when creating a Medium account, you are also the sole owner of your work.

Explore Free Courses

Suggested Blogs

Data Science for Beginners: A Comprehensive Guide
5015
Data science is an important part of many industries today. Having worked as a data scientist for several years, I have witnessed the massive amounts
Read More

by Harish K

28 Feb 2024

6 Best Data Science Institutes in 2024 (Detailed Guide)
5020
Data science training is one of the most hyped skills in today’s world. Based on my experience as a data scientist, it’s evident that we are in
Read More

by Harish K

28 Feb 2024

Data Science Course Fees: The Roadmap to Your Analytics Career
5036
A data science course syllabus covers several basic and advanced concepts of statistics, data analytics, machine learning, and programming languages.
Read More

by Harish K

28 Feb 2024

Inheritance in Python | Python Inheritance [With Example]
17101
Python is one of the most popular programming languages. Despite a transition full of ups and downs from the Python 2 version to Python 3, the Object-
Read More

by Rohan Vats

27 Feb 2024

Data Mining Architecture: Components, Types & Techniques
10582
Introduction Data mining is the process in which information that was previously unknown, which could be potentially very useful, is extracted from a
Read More

by Rohit Sharma

27 Feb 2024

6 Phases of Data Analytics Lifecycle Every Data Analyst Should Know About
79395
What is a Data Analytics Lifecycle? Data is crucial in today’s digital world. As it gets created, consumed, tested, processed, and reused, data goes
Read More

by Rohit Sharma

19 Feb 2024

Sorting in Data Structure: Categories & Types [With Examples]
137473
The arrangement of data in a preferred order is called sorting in the data structure. By sorting data, it is easier to search through it quickly and e
Read More

by Rohit Sharma

19 Feb 2024

Data Science Vs Data Analytics: Difference Between Data Science and Data Analytics
67761
Summary: In this article, you will learn, Difference between Data Science and Data Analytics Job roles Skills Career perspectives Which one is right
Read More

by Rohit Sharma

19 Feb 2024

13 Exciting Python Projects on Github You Should Try Today [2023]
44748
Python is one of the top choices in programming languages among professionals worldwide. Its straightforward syntax allows software developers and dat
Read More

by Hemant

19 Feb 2024

Schedule 1:1 free counsellingTalk to Career Expert
icon
footer sticky close icon