performing over sampling using SMOTE, drawing
recommendations and conclusions.
2 RELATED WORKS
Mohan, M., & Jadhav, A. (2022). uses machine
learning techniques like Hierarchical logistic
regression, decision tree, random forest, Ada Boost.
Factors like multiple subscription, switching
frequency, content satisfaction, price satisfaction
have higher impact on customer churn. These are
found with most effective algorithm among all those
Random Forest, this approach has higher accuracy
compared to others.
Retention strategies in order to reduce churn in
OTT platforms are clearly discussed in Senthil
Kumar, Needhi Devan, 2023. Finding the most
significant attributes for churn of a particular
individual, the OTT platform can take necessary steps
in order to reduce churn, Content satisfaction shows
more effect on churn, so by taking the videos, movies,
that are highly satisfied by the viewers churn can be
reduced. The highly satisfied content can be collected
by viewers or customers feedback, review of a
particular movie or video, etc. Showing the related
content to the viewers is another strategy of OTT.
This can be done by having the data of one viewer,
what kind of movies they are continuously watching,
what genre they are interested. Author used logistic
regression, multi-layer perceptron, random forest,
decision trees, and gradient boosting machines and
also bought the accuracy of 80%. However, they
faced the problems with the data, the model built was
complex, model drift.
Churn of one organization depends on the
competitors. With the increase of technology, OTT
platforms are increasing day-by-day, this can become
a big hurdle for one platform. So, these should build
strategies by keeping competitors in mind. These are
explained in detailed in Manish Mohan, Anil Jadhav
(2022). The availability of competing services, other
platform price, have effect in churn.
Other than machine learning algorithms,
Comprehensive Understanding was done in Srivalli
Leela., et al, 2021. They stated that there will be
increase of paid subscribers by 16.1% by the year
2028 i.e., the subscriber market will increase from
USD104.2 billion to USD293 billion.
Over-The-Top (OTT) providers and Internet
Services Providers (ISPs) joint service management
approach based on Customer Lifetime Value (CLV)
and benefits of joint services management are
discussed in A. Ahmad, et al, 2017. They also stated
that this can improve the customer experiences,
increase customer loyalty which are key factors in
reducing churn. Over-The- Top (OTT) providers and
Internet Services Providers (ISPs) joint service
management approach based on Quality of
Experience (QoE) and benefits of joint services
management A. Ahmad, et al, 2016. This is a measure
of satisfaction got by particular viewer with respect to
the service they received. Regression analysis is
performed between QoE and Churn in order to get a
relationship among those.
Over-The-Top (OTT) Churn is more affected by
content provided by particular OTT platforms and
also the price charged for that, plans and subscription
options provided. This is analyzed by performing
content analysis and economic analysis Priya
Malhotra, Akshay Kumar (2021). Found that
customers are increasing because they feel that OTT
platforms are for providing entertainment, treat as
stress busters. This was found based on various
factors like variety of content, affordability of OTT
subscriptions.
Sachika Luthra, The author stated there are
increase of OTT subscribers during Covid-19. Almost
7.5 million subscribers have been increased from the
year 2019 to 2020. Factors like Covid-19 pandemic,
increasing availability of high-speed internet, growth
popularity of streaming devices showed significant
effect on this particular growth.
3 METHODOLOGY
3.1 About the Dataset
The dataset contains 16 attributes along with the
target variable 'churn' that is binary which states 0-no,
1- yes. The 15 independent variables are Year,
customer_id, phone_no, Gender, Age, multi_screen,
no_of_days_subcribed, mail_subscrided, weekly_mi
ns_watched, maximum_daily_mins,minimum_daily
_mins, weekly_max_night_mins, videos_watched,
maximum_days_inactive, customer_support_calls.
The dataset contains 2000 entries.
3.2 Data Pre-Processing
The initial step of the project includes pre-processing
steps like removing unnecessary attributes, handling
null values, outlier detection, some visualizations,
creation of dummies. Here we removed customer_id,
year, phone_no attributes. While treating with null
values, we found the attributes gender,
maximum_days_inactive, churn have null values.