Blog_Banner_Asset
    Homebreadcumb forward arrow iconBlogbreadcumb forward arrow iconArtificial Intelligencebreadcumb forward arrow iconTop 12 Linear Regression Interview Questions & Answers [For Freshers]

Top 12 Linear Regression Interview Questions & Answers [For Freshers]

Last updated:
12th Mar, 2021
Views
Read Time
8 Mins
share image icon
In this article
Chevron in toc
View All
Top 12 Linear Regression Interview Questions & Answers [For Freshers]

Data Science and Machine Learning Interviews revolve a lot around Machine Learning algorithms and techniques. Linear Regression is the most frequently asked of them as it is generally the most basic algorithm one studies. Not only that, Linear Regression is widely used across the industry in multiple domains. 

Top Machine Learning and AI Courses Online

Linear Regression Interview Questions & Answers

Question 1: How Does Linear Regression Work?

Linear Regression, as its name implies, tries to model the data using a linear relation of the independent variables to the dependent variable or the target. If there is just one independent variable/feature, it is called Simple Linear Regression. If there are multiple features, it is called Multivariate Linear Regression.

Ads of upGrad blog

Trending Machine Learning Skills

Regression, basically, means finding the best fit line/curve to your numerical data — a functional approximation of the data. That is, you want a mapping function of your input data to the output data (target). This mapping function is written as:

Ŷ = W*X + B

where B is the intercept and W is the slope of the line and Ŷ is the predicted output. The optimum values of W and B need to be found to find the best fit line

Question 2: How Does Linear Regression Find Optimal Point? 

Linear Regression uses the Least Squares method to find the optimal point where the squared error is minimum. It finds the optimal values of the weight by an iterative and approximation method called Gradient Descent. Initially, random values of the weights are taken and then the loss is calculated for each instance.

After calculating the cumulative error of the whole dataset, a small step towards the minima is taken and the weights are updated by this change. Slowly, by taking these small steps towards the minima, the values of the weights reach approximately to the minima and the algorithm exits.

Question 3: What is Learning Rate?

Learning Rate or alpha is a hyperparameter that needs to be of the optimal value for the algorithm to converge quickly with the least error. Alpha controls the magnitude of the step size taken during Gradient Descent for converging to global minima.

The bigger the value of alpha, the larger will be the step size and the convergence might be faster. If alpha is too small, then it might take a long time to converge. But if the alpha is too big then it might start overshooting and not converge at all. Finding the right value of alpha is done during Hyperparameter optimization.

Question 4: What are the Assumptions of Linear Regression?

Linear Regression makes a lot of assumptions about the data to make calculations easier. And that makes it a lot more vulnerable to poor results as the data might not agree with those assumptions. Some of the most vulnerable assumptions are:

  1. Linear Relationship: First and the most obvious assumption it makes is that the features are linearly related to the target. In other words, the best fit line will be linear. But this usually is not the case most of the times.
  2. No Multicollinearity: Linear Regression tries to estimate coefficients of all the features according to their impact on the target. But this calculation is hampered when features themselves are dependent/collinear to each other.
  3. Homoscedasticity: With reference to LR, Homoscedasticity means that the errors or the residuals have similar values. In other words, if you plot the residuals vs predicted values, there should be no clear pattern. However, if the data has heteroscedasticity, the assumption would be broken and results can’t be trusted.

Question 5: What are the Different Types of Gradient Descent in Linear Regression?

There are mainly 3 types of gradient descents

Vanilla Gradient Descent updates the weights after every epoch, which means that in essence, it takes the average loss of all the iterations of training instances and then updates the weights at the end of the epoch. 

This is not ideal as it might not capture details, hence Stochastic Gradient Descent updates the weights with the loss obtained in every iteration in every epoch. That’s a lot of updates! So this makes the optimization curve noisy and time-consuming as well.

Mini-Batch Gradient Descent is sort of a middle ground between Vanilla and Stochastic. It forms batches of the complete dataset and then updates the weights at the end of every batch. This not only makes the optimization better and faster but also helps when the dataset is huge and you cannot load all of it at once.

Question 6: What is Heteroscedasticity?

With reference to Linear Regression, Heteroscedasticity simply means that the residuals of the observations do not possess the same variances. This would mean that the observations are actually from different probability distributions with different variances. And this defies one of the assumptions of Linear Regression. The quickest way to check for Heteroscedasticity would be to plot residuals against the predictions and see for any pattern. If a pattern exists, there might be Heteroscedasticity present.

Question 7: What is Multicollinearity and How can it Impact the Model?

Multicollinearity occurs when multiple features in a regression model are correlated or dependent on each other to some extent. Change in the value of one feature will also force change the value of features collinear to it. In other words, such features add no more information to the model. This can lead to Overfitting as it might give unpredictable results on unseen data. 

Question 8: How to Measure Multicollinearity?

To measure Multicollinearity, the 2 most common techniques are – Correlation Matrix and Variance Inflation Factor(VIF). The correlation Matrix just contains the correlation values of each feature with every other feature. Extreme values signify a high correlation.

VIF is another method to quantify correlation, with the value of 1 meaning no Collinearity and >5 meaning high collinearity.

Question 9: What are the Loss Functions used in Linear Regression?

Mean Squared Error and Root Mean Squared Error are the two most common loss functions used in Linear Regression. 

Question 10: What Metrics are used for Linear Regression?

The most common metrics used for Linear Regression are R Squared score and Adjusted R Squared score. The higher the value of R2, the better is the performance of the model. However, this is not true all the times as R2 always increases upon adding new features. This means that even if the feature is not significant, the R2 value will still increase. This shortcoming is overcome by Adjusted R Square which increases only if the newly added feature is significant. 

Also Read: Linear Regression Models

Question 11: What are the Limitations of Linear Regression?

One limitation of LR is that it is quite sensitive to outliers in the data. Another limitation is the high bias in it due to its assumptions of the data. This can lead to a very poor model. 

Question 12: What are the Different Types of Regularized Regression Algorithms?

There are mainly two types of regularized versions of Linear Regression: Ridge and Lasso. Both the algorithms include a penalty term which helps reduce the overfitting of the linear model. Lasso applies the absolute penalty, so some terms or weights of features less significant reduce to zero. With Ridge, the coefficients of less significant features come close to zero as it uses squared penalties.

Ads of upGrad blog

Popular AI and ML Blogs & Free Courses

Conclusion

Linear Regression is the most fundamental algorithm in Machine Learning. In this tutorial, we covered some fundamental questions that are very frequently asked in interviews. The interviewers can also ask scenario-based questions by giving examples of some data and results.

upGrad provides a PG Diploma in Machine Learning and AI and a  Master of Science in Machine Learning & AI that may guide you toward building a career. These courses will explain the need for Machine Learning and further steps to gather knowledge in this domain covering varied concepts ranging from Gradient Descent to Machine Learning.

Profile

Pavan Vadapalli

Blog Author
Director of Engineering @ upGrad. Motivated to leverage technology to solve problems. Seasoned leader for startups and fast moving orgs. Working on solving problems of scale and long term technology strategy.
Get Free Consultation

Select Coursecaret down icon
Selectcaret down icon
By clicking 'Submit' you Agree to  
UpGrad's Terms & Conditions

Our Popular Machine Learning Course

Frequently Asked Questions (FAQs)

1Should freshers learn machine learning?

Even though machine learning jobs are predominantly for experienced professionals, freshers can also bag these jobs provided they have the required skill set and knowledge. Freshers need to acquire skills in programming using languages like R and Python, which are extensively used for writing machine learning programs. They need to be familiar with mathematics, especially statistics and probability, and have a basic understanding of distributed computing and machine learning algorithms. The rising demand for proficient machine learning engineers makes this field a highly lucrative career option. Thus, machine learning can offer a significant career boost to freshers.

2How much does a machine learning engineer earn?

The average salary of a machine learning engineer in India ranges from around INR 6.5 to 8 lakhs an annum. Candidates with a Master's degree or advanced degrees in data science or analytics and relevant work experience of 5 to 15 years can expect to earn INR 8 to 16 lakhs a year. Machine learning engineers with mid-level work experience are also known to earn at least INR 1 crore. However, how much a machine learning engineer makes depends on several factors like essential skill set, knowledge, overall work experience, certifications, location, relevant hands-on experience, problem-solving abilities, and more. It also depends on the role and compensation that the recruiting organization offers.

3Are data scientist jobs rewarding in India?

Currently, proficient data scientists are in high demand in India, but there are not enough of them to fill the vacancies. This demand is mainly created by Indian businesses and multinational organizations that are increasingly shifting towards digitization to stay ahead of the competition. Increasing digital footprints are generating massive volumes of user data, creating more job opportunities for data scientists. With 4-10 years of work experience, data science professionals can earn in the range of INR 35-60 lakhs, while more experienced professionals can earn up to INR 1 crore a year.

Explore Free Courses

Suggested Blogs

Artificial Intelligence course fees
5375
Artificial intelligence (AI) was one of the most used words in 2023, which emphasizes how important and widespread this technology has become. If you
Read More

by venkatesh Rajanala

29 Feb 2024

Artificial Intelligence in Banking 2024: Examples & Challenges
6099
Introduction Millennials and their changing preferences have led to a wide-scale disruption of daily processes in many industries and a simultaneous g
Read More

by Pavan Vadapalli

27 Feb 2024

Top 9 Python Libraries for Machine Learning in 2024
75567
Machine learning is the most algorithm-intense field in computer science. Gone are those days when people had to code all algorithms for machine learn
Read More

by upGrad

19 Feb 2024

Top 15 IoT Interview Questions & Answers 2024 – For Beginners & Experienced
64421
These days, the minute you indulge in any technology-oriented discussion, interview questions on cloud computing come up in some form or the other. Th
Read More

by Kechit Goyal

19 Feb 2024

Data Preprocessing in Machine Learning: 7 Easy Steps To Follow
152698
Summary: In this article, you will learn about data preprocessing in Machine Learning: 7 easy steps to follow. Acquire the dataset Import all the cr
Read More

by Kechit Goyal

18 Feb 2024

Artificial Intelligence Salary in India [For Beginners & Experienced] in 2024
908641
Artificial Intelligence (AI) has been one of the hottest buzzwords in the tech sphere for quite some time now. As Data Science is advancing, both AI a
Read More

by upGrad

18 Feb 2024

24 Exciting IoT Project Ideas & Topics For Beginners 2024 [Latest]
759415
Summary: In this article, you will learn the 24 Exciting IoT Project Ideas & Topics. Take a glimpse at the project ideas listed below. Smart Agr
Read More

by Kechit Goyal

18 Feb 2024

Natural Language Processing (NLP) Projects & Topics For Beginners [2023]
107583
What are Natural Language Processing Projects? NLP project ideas advanced encompass various applications and research areas that leverage computation
Read More

by Pavan Vadapalli

17 Feb 2024

45+ Interesting Machine Learning Project Ideas For Beginners [2024]
328092
Summary: In this Article, you will learn Stock Prices Predictor Sports Predictor Develop A Sentiment Analyzer Enhance Healthcare Prepare ML Algorith
Read More

by Jaideep Khare

16 Feb 2024

Schedule 1:1 free counsellingTalk to Career Expert
icon
footer sticky close icon