基于对数回归的电影票房影响因素分析
Analysis of Influencing Factors of Movie Box Office Based on Logarithmic Regression
摘要: 电影越来越成为一种潮流的象征,日益丰富着我们的日常生活。此次案例分析的数据为截至目前中国大陆上映的100部电影的票房数据及其影响因素,选取首日票房、上映首周票房、档期、题材、评分(猫眼)、平均票价、播放形式、电影时长、预告片播放量(亿)、想看人数(万人)、演员影响力、导演影响力、预算(万元)这13个指标作为电影票房的影响因素。首先,我们对数据进行预处理,对处理好的数据进行分类描述性统计分析,再对自变量与因变量进行相关性分析,通过分析发现,想看人数和首日票房之间存在较强的相关性,故在回归建模时删掉这两个变量;其次,我们使用逐步回归模型进一步筛选变量,使用对数回归模型探索对电影票房具有显著影响的指标;最后,我们得到档期、题材、播放形式、上映首周票房、评分、平均票价、预告片播放量这7个变量对票房有显著影响。较高票房的两类电影是动作片和喜剧片;导演影响力对于电影票房不存在很大的影响;票房较高的电影得分也较高,得分基本在9分左右,但并不是得分越高的电影票房也就越高。
Abstract: Cinema is becoming more and more a symbol of a trend, enriching our daily lives day by day. The data of this case analysis is the box office data and its influencing factors of the 100 films released in Chinese mainland so far, and selects 13 indicators such as the box office on the first day, the box office of the first week of release, the schedule, the theme, the score (Maoyan), the average ticket price, the broadcast form, the movie duration, the number of trailers played (100 million), the number of people who want to watch (10,000 people), the influence of actors, the influence of directors, and the budget (10,000 yuan) as the influencing factors of the movie box office. Firstly, we preprocessed the data, performed a categorical descriptive statistical analysis on the processed data, and then performed a correlation analysis between the independent variable and the dependent variable, and found that there was a strong correlation between the number of people who wanted to watch and the box office on the first day, so these two variables were deleted in the regression modeling. Secondly, we used a stepwise regression model to further screen the variables, and used a logarithmic regression model to explore the indicators that had a significant impact on the box office. Finally, we get that the 7 variables of schedule, theme, broadcast format, box office in the first week of release, rating, average ticket price, and trailer playback have a significant impact on the box office. The two categories of films that gross higher are action and comedy; The director’s influence does not have a great impact on the box office; Movies with higher box office scores are also higher, with a score of around 9 points, but it is not that movies with higher scores will also have higher box office.
文章引用:陈静羽, 李瑞雪, 黄月池, 史江兰. 基于对数回归的电影票房影响因素分析[J]. 电子商务评论, 2024, 13(4): 5861-5873. https://doi.org/10.12677/ecl.2024.1341825

参考文献

[1] 于兰婷. 影响国产电影票房的因素分析[J]. 中国电影市场, 2021(10): 17-23.
[2] 刘志新. 中国电影票房影响因素分析[J]. 合作经济与科技, 2019(17): 114-116.
[3] 程粮君. 电影票房影响因素分析——以2016-2017年票房过亿元的国产电影为例[J]. 声屏世界, 2018(4): 37-41.
[4] 杜久升, 赵贝贝, 侯争. 基于逐步回归的学习行为与成绩评估模型研究[J]. 测绘通报, 2023(S2): 148-151.
[5] Benoit, K. (2011) Linear Regression Models with Logarithmic Transformations. London School of Economics, London, 22, 23-36.
[6] 魏艳华, 王丙参, 张艺馨. 利用蒙特卡罗方法对QQ图检验的改进与比较[J]. 统计与决策, 2020, 36(16): 13-17.
[7] 霍伟光, 曹静杰, 陈雪, 等. 基于Cook距离的阻尼多道奇异谱分析分离绕射波[J]. 石油地球物理勘探, 2024, 59(4): 771-781.
[8] 王鹏, 李斌, 李佳伦, 等. 基于对数函数的岩石三轴强度回归预测模型[J]. 矿业研究与开发, 2023, 43(4): 103-109.
[9] 田密, 熊自民. 基于MARS与AIC准则的泥石流冲出距离数据驱动预测方法[J/OL]. 武汉大学学报(工学版), 2024: 1-11.
http://kns.cnki.net/kcms/detail/42.1675.T.20230828.0924.002.html, 2024-07-01.