Rudnev S.G.1,2, Nikolaev D.V.1,3, Korostylev K.A.1,3, Starunova O.A.1,3, Schelykalina S.P.1,3, Eryukova T.A.1,3, Kolesnikov V.A.1,3, Starodubov V.I.1
1Federal Research Institute for Health Organization and Informatics of Ministry of Health of the Russian Federation, Moscow
2Institute of Numerical Mathematics of the Russian Academy of Sciences, Moscow
3“Medas” Scientific Research Centre, Moscow
4Pirogov Russian National Research Medical University, Moscow
Contacts: Sergey G. Rudnev, e-mail:
This e-mail address is being protected from spam bots, you need JavaScript enabled to view it
This article was prepared under the framework of the Russian
Science Foundation project ‘Development of methodology for population
screening of physical growth and development, state of health and
nutrition. Assessment of epidemiological risks’ (grant no. 14-15-01085).
Abstract. Significance. The national
network of Health Centers is a complex distributed system that
continuously, since 2010, generates mass data on preventive screening.
Manual analysis of quality and reliability of the data collected in
Health Centers is not possible, while official reporting of Health
Centers may not, in some cases, reflect the real situation. So it is
necessary to develop automated algorithms for quality control and
enhancement of reliability of preventive screening data.
The purpose of the study was to implement elements of
the big data technology for analyzing results of preventive screening in
Health Centers exemplified by the bioimpedance measurement data,
retrospectively evaluate quality and reliability of data, and explore
their applicability for epidemiological monitoring.
Materials and methods. Bioimpedance data from the
Federal Information Resource of Health Centers database was combined
with the submitted data of bioimpedance measurements according to the
letter by the Ministry of Health of the Russian Federation
#14-1/10/2-3200 as of October 24, 2012, as well as with the submitted
data according to the letter by the Federal Research Institute for
Health Organization and Informatics of the Russian Health Ministry
#7-5/434 as of July 2, 2015. The initial number of records in the
bioimpedance database was 2.35 million. The data were obtained from 320
Health Centers in 62 Federal Subjects and eight Federal Districts of the
Russian Federation.
Results. In half of the Health Centers the quality of
bioimpedance data was 93.5% or higher. However, the proportion of
incorrect data grew steadily reaching 28.1% in 2014. The incorrect data
consisted mainly of frauds (50.6%) and measurement errors (48.5%). The
number of records in the database after removal of incorrect data and
repeated measurements equaled to 1.64 million. Based on calculated
parameters of the distributions of body mass index using the software
package GAMLSS, the prevalence of overweight, obesity and wasting in the
study group was estimated among males according to the WHO criteria.
The age-standardized obesity prevalence in males was 11.0% at the age of
5-17 years, and 17.7% at the age of 18-85 years.
Discussion. The use of big data technology
allowed to evaluate quality of data and identify incorrect data of
bioimpedance measurements. This offers an opportunity for taking
managerial decisions to correct the identified violations. Results of
the comparison with independent anthropometric data show
representativeness of the Health Centers’ data for children and
adolescents.
Conclusions. 1) Based on bioimpedance data, our
mass data analysis showed that quality and accuracy of the raw data on
preventive screening in Health Centers was gradually decreasing. This
suggests ineffectiveness of control measures.
2) Effective quality management of Health Centers’ activities is possible through the use of big data technology.
3) Data of the Federal Information Resource of Health Centers may be
suitable for epidemiological monitoring upon application of the
selection criteria.
Keywords. Health Centers; Federal Information Resource
of Health Centers; preventive screening; big data; frauds detection and
removal; data compression; data standardization.
References
-
Avtomatizirovannaya sistema monitoringa FGBU «TsNIIOIZ» Minzdrava
Rossii. [Automated system for monitoring of the Federal Research
Institute for Health Organization and Informatics of Ministry of Health
of the Russian Federation]. [Online] [cited 2015 Aug 19]. Available
from: http://asmms.mednet.ru (In Russian).
-
Boytsov S.A., Chuchalin A.G., Arutyunov G.P., Bilichenko T.N., Bubnova
M.G., Ipatov P.V., et al. Profilaktika khronicheskikh neinfektsionnykh
zabolevaniy: rekomendatsii. [Prevention of chronic non-communicable
diseases: the recommendations]. Moscow: GNITs profilakticheskoy
meditsiny MZ RF; 2013. 138 p. (In Russian).
-
Goskontrakty ¹K-29-T/341 ot 06.11.2009 g., ¹K-29-T/105 ot 23.09.2010
g., ¹019500000212000144-0132259-02 ot 25.10.2012 g. [State contracts no.
Ê-29-Ò/341 of November 6, 2009, no. Ê-29-Ò/105 of September 23, 2010,
and no. 019500000212000144-0132259-02 of October 25, 2012]. (In
Russian).
-
Isaev E.A., Kornilov V.V. Problema obrabotki i khraneniya bol'shikh
ob"emov nauchnykh dannykh i podkhody k ee resheniyu. [The problem of
processing and storage of large amounts of scientific data and
approaches to its solution]. Matematicheskaya biologiya i bioinformatika 2013; 8 (1): 49-65. (In Russian).
-
Kobyakova O.S., Starovoytova E.A., Kulikov E.S., Kirillova N.A., Deev
I.A., Boykov V.A., et al. Model' posetitelya tsentrov zdorov'ya Tomskoy
oblasti i rasprostranennost' osnovnykh faktorov riska khronicheskikh
neinfektsionnykh zabolevaniy sredi posetiteley tsentrov zdorov'ya. [The
model of a visitor of health centers of Tomsk region and the prevalence
of main risk factors of chronic non-communicable diseases in visitors of
health centers]. Zdravookhranenie Rossiyskoy Federatsii 2014; 58 (1): 16-19. (In Russian).
-
Kul'ba S.N., Voynov V.B., Pozharskaya E.N., Glumov A.G., Gaponenko
M.V., Sokolova E.E.,, et al. Osobennosti indeksa massy tela u
shkol'nikov Rostovskoy oblasti [The features of body mass index in
schoolchildren of Rostov region]. Valeologiya 2014; 4: 62-69. (In Russian).
-
Mayer-Shönberger V, Cukier K. Big data: revolution that will transform
how we live, work, and think [Transl. from Eng]. Moscow: Mann, Ivanov i
Ferber; 2014. 240 p. (In Russian).
-
Nikolaev D.V., Smirnov A.V., Bobrinskaya I.G., Rudnev S.G.
Bioimpedansnyy analiz sostava tela cheloveka [Bioimpedance analysis of
human body composition]. Moscow: Nauka; 2009. 392 p. (in Russian).
-
Pogosova N.V., Vergazova E.K., Ausheva A.K., Suvorov S.S., Boytsov S.A.
Tsentry zdorov'ya: dostignutye rezul'taty i perspektivy. [Health
centers: achieved results and prospects]. Profilakticheskaya meditsina 2014; 17 (4): 16-24. (In Russian).
-
Rudnev S.G., Soboleva N.P., Sterlikov S.A., Nikolaev D.V., Starunova
O.A., Chernykh S.P., et al. Bioimpedansnoe issledovanie sostava tela
naseleniya Rossii [Bioimpedance study of body composition in the Russian
population]. Moscow: RIO TsNIIOIZ; 2014. 493 p. (In Russian).
-
Smirnov N. Bol'shie dannye: kogda mashina luchshe eksperta. [Big data: when the computer is better than expert]. Direktor informatsionnoy sluzhby [serial online] 2014 [cited 2015 Oct 26]; (12): 24. Available from: http://www.osp.ru/cio/2014/12/13044348/ (In Russian).
-
Starodubov V.I., Rudnev S.G., Nikolaev D.V., Korostylev K.A.
Federal'nyy informatsionnyy resurs tsentrov zdorov'ya: sovremennoe
sostoyanie i perspektivy razvitiya [Federal information resource of
health centers: current state and developmental perspectives]. Sotsial'nye aspekty zdorov'ya naseleniya [serial online] 2015 [cited 2015 Oct 26]; 45 (5). Available from: http://vestnik.mednet.ru/content/view/706/27/lang,ru/ (In Russian).
-
Starunova O.A. Sozdanie programmnoy sredy dlya statisticheskoy
obrabotki dannykh bioimpedansnykh izmereniy. [Creating a software
environment for statistical data processing of bioimpedance
measurements]. Sbornik statey molodykh uchenykh fakul'teta VMK MGU 2011; (8): 130-135. (In Russian).
-
Shal'nova S.A., Deev A.D. Tendentsii smertnosti v Rossii v nachale XXI
veka (po dannym ofitsial'noy statistiki) [Russian mortality trends in
the early XXI century: official statistics data]. Kardiovaskulyarnaya terapiya i profilaktika 2011; 10 (6): 5-10. (In Russian).
-
Chen Z, Chen J, Collins R, Guo Y, Peto R, Wu F, et al. China Kadoorie
Biobank of 0.5 million people: survey methods, baseline characteristics
and long-term follow-up. Int J Epidemiol 2011; 40 (6): 1-15.
-
Clasey JL, Bradley KD, Bradley JW, Long DE, Griffith JR. A new BIA equation estimating the body composition of young children. Obesity 2011; 19 (9): 1813-1817.
-
De Onis M, Onyango AW, Borghi E, Siyam A, Nishida C, Siekmann J.
Development of a WHO growth reference for school-aged children and
adolescents. Bull WHO 2007; 85 (9): 660-667.
-
Franssen FME, Rutten EPA, Groenen MTJ, Vanfleteren LE, Wouters EFM,
Spruit MA. New reference values for body composition by bioelectrical
impedance analysis in the general population: results from the UK
Biobank. J Am Med Dir Assoc 2014; 15 (6): e1-448.e6.
-
Grimnes S, Martinsen O. Bioimpedance and bioelectricity basics, 3rd ed. London: Academic Press; 2014. 584 p.
-
Houtkooper LB, Going SB, Lohman TG, Roche AF, Van Loan M. Bioelectrical
impedance estimation of fat-free body mass in children and youth: a
cross-validation study. J Appl Physiol 1992; 72 (1): 366-373.
-
Jackson LV, Thalange NKS, Cole TJ. Blood pressure centiles for Great Britain. Arch Dis Child 2007; 92 (4): 298-303.
-
Janssen I, Heymsfield SB, Ross R. Low relative skeletal muscle mass
(sarcopenia) in older persons is associated with functional impairment
and physical disability. J Am Ger Soc 2002; 50 (5): 889-896.
-
Kushner RF, Schoeller DA. Estimation of total body water by bioelectrical impedance analysis. Am J Clin Nutr 1986; 44 (3): 417-424.
-
Kwon H-J, Ha Y-C, Park H-M. The reference value of skeletal muscle mass index for defining the sarcopenia of women in Korea. J Bone Metab 2015; 22 (2): 71-75.
-
Manyika J, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C, et al. Big
data: the next frontier for innovation, competition, and productivity.
McKinsey Global Institute; 2011. 156 p. Available at: http://www.mckinsey.com/insights/business_technology/big_data_the_next_frontier_for_innovation/
-
Nightingdale CM, Rudnicka AR, Owen CG, Cook DG, Whincup PH. Patterns of
body size and adiposity among UK children of South Asian, black
African-Caribbean and white European origin: Child Heart And health
Study in England (CHASE Study). Int J Epidemiol 2011; 40 (1): 33-44.
-
Stasinopoulos DM, Rigby RA. Generalized additive models for location scale and shape (GAMLSS) in R. J Stat Software 2007; 23 (7): 1-46.
-
Verdich C, Barbe P, Petersen M, Grau K, Ward L, Macdonald I, et al.
Changes in body composition during weight loss in obese subjects in the
NUGENOB study: Comparison of bioelectrical impedance vs. dual-energy
X-ray absorptiometry. Diabetes Metab 2011; 37 (3): 222-229.
-
Victora CG, Barros FC, Lima RC, Behague DP, Gonçalves H, Horta BL, et
al. The Pelotas birth cohort study, Rio Grande do Sul, Brazil,
1982-2001. Cad Saúde Pública 2003; 19 (5): 1241-1256.
Views: 13180
|