主基底分析方法及在水质监测指标筛选中的研究
Principal Basis Analysis and Application in Feature Selection of Water Quality Data
DOI: 10.12677/MOS.2016.54025, PDF, HTML, XML, 下载: 1,422  浏览: 3,500  国家自然科学基金支持
作者: 邹辉:北京航空航天大学经济管理学院,北京;中国农业大学理学院,北京;邹志红*, 王晓静:北京航空航天大学经济管理学院,北京
关键词: Gram-Schmidt变换主基底变量筛选Gram-Schmidt Transform Principal Basis Variable Selection
摘要: 随着人们对环境的日益重视和监测技术的提高,水质监测中出现了越来越多变量相关的多变量数据。其中,太子河水质数据属于数据相关的多变量数据。由于传统方法的局限性,本文利用基于Gram-Schmidt变换的主基底分析方法进行太子河水质数据的监测指标筛选工作。这种方法能够在原数据信息损失尽可能小的前提下,排除所有的冗余变量以及变量集合中的重叠信息,有效地对大规模变量集中的信息进行筛选,从而得到一个标准正交的主基底。并且,通过对所选基底的“净信息含量比”的测度,可以有效地选择具有代表性的水质监测变量。有利于对水质监测工作进行科学合理的改进。数值实验表明,使用Gram-Schmidt变换的主基底分析方法对太子河水质数据进行分析是有效的。
Abstract: With the increasing emphasis on the environment and the improvement of monitoring technology, there appear more and more multivariate data in which the variable sets have multi-collinearity problem. The water quality data of Taizi River belong to this kind of data. In order to avoid the li-mitation of the traditional method, the principal basis analysis method based on the Gram- Schmidt transform is used to the feature selection of the water quality data of the Taizi River. This method selects information effectively from the large-scale variable set with the minimal loss of original information. Meanwhile, this method can exclude all redundant variables and reduplicate information. Furthermore, it can obtain a mini-dimensional orthogonal basis. Using the measure-ment of the net information content ratio of the selected features, it is effective to select the rep-resentative water quality monitoring variables. It is conducive to the improvement of water quality monitoring work and the experimental results indicate the effectiveness of this method.
文章引用:邹辉, 邹志红, 王晓静. 主基底分析方法及在水质监测指标筛选中的研究[J]. 建模与仿真, 2016, 5(4): 198-204. http://dx.doi.org/10.12677/MOS.2016.54025

参考文献

[1] Shrestha, S. and Kazama, F. (2007) Assessment of Surface Water Quality Using Multivariate Statistical Techniques: A Case Study of the Fuji River Basin, Japan. Environmental Modelling & Software, 22, 464-475.
http://dx.doi.org/10.1016/j.envsoft.2006.02.001
[2] Kowalkowski, T., Zbytniewski, R., et al. (2006) Application of Chemo-metrics in River Water Classification. Water Research, 40, 744-752.
http://dx.doi.org/10.1016/j.watres.2005.11.042
[3] Wang, X., Lu, Y., et al. (2007) Identification of Anthropogenic Influences on Water Quality of Rivers in Taihu Watershed. Journal of Envi-ronmental Sciences, 19, 475-481.
http://dx.doi.org/10.1016/S1001-0742(07)60080-1
[4] Juahir, H., Zain, S.M., et al. (2011) Spatial Water Quality Assessment of Langat River Basin (Malaysia) Using Environmetric Techniques. Environmental Monitoring and Assessment, 173, 625-641.
http://dx.doi.org/10.1007/s10661-010-1411-x
[5] Venkatesharaju, K., Somashekar, R.K., et al. (2010) Study of Seasonal and Spatial Variation in Surface Water Quality of Cauvery River Stretch in Karnataka. Journal of Ecology and the Natural Environment, 2, 1-9.
[6] Singh, K.P., Malik, A., et al. (2005) Water Quality Assessment and Apportionment of Pollution Sources of Gomti River (India) Using Multivariate Statistical Techniques—A Case Study. Analytica Chimica Acta, 538, 355-374.
http://dx.doi.org/10.1016/j.aca.2005.02.006
[7] Zhou, F., Liu, Y., et al. (2007) Application of Multivariate Statistical Methods to Water Quality Assessment of the Watercourses in Northwestern New Territories, Hong Kong. Environmental Monitoring and As-sessment, 132, 1-13.
http://dx.doi.org/10.1007/s10661-006-9497-x
[8] Wang, Y., Liu, C., et al. (2013) Spatial Pattern Assessment of River Water Quality: Implications of Reducing the Number of Monitoring Stations and Chemical Parameters. Environmental Monitoring and Assessment, 186, 1781- 1792.
http://dx.doi.org/10.1007/s10661-013-3492-9
[9] Tanaka, Y. and Mori, Y. (1997) Principal Component Analysis Based on a Subset of Variables: Variable Selection and Sensitivity Analysis. American Journal of Mathematical and Management Sciences, 17, 61-89.
http://dx.doi.org/10.1080/01966324.1997.10737430
[10] 王惠文, 仪彬, 叶明. 基于主基底分析的变量筛选[J]. 北京航空航天大学学报, 2008, 34(11): 1288-1291.