Journal:Big data and public health systems: Issues and opportunities

From LIMSWiki
Jump to navigationJump to search
Full article title Big data and public health systems: Issues and opportunities
Journal International Journal of Interactive Multimedia and Artificial Intelligence
Author(s) Rojas de la Escalera, David; Carnicero Giménez de Azcárate, Javier
Author affiliation(s) Sistemas Avanzados de Tecnología, Health Service of Nararre
Primary contact Email: javier dot carnicero dot gimenez at cfnavarra dot es
Year published 2018
Volume and issue 4 (7)
Page(s) 53–59
DOI 10.9781/ijimai.2017.03.008
ISSN 1989-1660
Distribution license Creative Commons Attribution 3.0 Unported
Download (PDF)


In recent years, the need for changing the current model of European public health systems has been repeatedly addressed, in order to ensure their sustainability. Following this line, information technology (IT) has always been referred to as one of the key instruments for enhancing the information management processes of healthcare organizations, thus contributing to the improvement and evolution of health systems. On the IT field, big data solutions are expected to play a main role, since they are designed for handling huge amounts of information in a fast and efficient way, allowing users to make important decisions quickly. This article reviews the main features of the European public health system model and the corresponding healthcare and management-related information systems, the challenges that these health systems are currently facing, and the possible contributions of big data solutions to this field. To that end, the authors share their professional experience on the Spanish public health system and review the existing literature related to this topic.

Keywords: big data, health system, healthcare organizations, health information systems, epidemiological surveillance, strategic planning


The health system

According to the World Health Organization (WHO), “a health system consists of all organizations, people and actions whose primary intent is to promote, restore or maintain health. This includes efforts to influence determinants of health as well as more direct health-improving activities. A health system is therefore more than the pyramid of publicly owned facilities that deliver personal health services.”[1] Furthermore, every health system performs the following set of basic functions[2]:

  • delivering health services to individuals and to populations
  • creating resources
  • providing stewardship
  • financing the system

The center of any health system must be the first of these functions, since healthcare constitutes the paramount goal and therefore the reason for the existence of the health system itself. Around it, other functions are organized, essential for ensuring healthcare delivery and public health. Among these, the following must be remarked upon:

  • epidemiological surveillance, which comprises the collection and analysis of large volumes of data directly or indirectly related to people’s health, so as to detect or prevent possible health problems regarding public health
  • planning and overseeing the management of the health system, which allows healthcare organizations to set out their strategic goals, allocate the necessary resources, assess the degree of compliance of these goals and apply corrective measures if required
  • clinical research, focused on generating knowledge and applying it to the development of new diagnostic and therapeutic techniques
  • education and teaching, in order to train new professionals and keep the practicing ones appropriately updated and competent

The health cluster or ecosystem

From a structural point of view, a health system is neither an isolated nor homogeneous entity, but rather it comprehends or relates to entities of diverse nature, both public and private, with interests of their own, as well as shared interests. This ensemble is known as health cluster or ecosystem, and among its components the following must be pointed out[3]:

  • central or federal government and regional or local authorities
  • healthcare services, conceived as organizations responsible for the management of a determined healthcare network
  • hospitals
  • primary care centers
  • emergency services
  • pharmacies
  • convalescent centers
  • health professionals acting as external providers to the health system
  • public health services
  • insurance companies, mutual societies, and other entities which finance healthcare
  • schools for the education and training of doctors, nurses, and other health professionals
  • research centers
  • professional associations and colleges
  • foundations and learned societies
  • stakeholders, such as patients associations
  • pharmaceutical and other health technology industries

Challenges faced by the health system

For decades, the public health systems of European countries, created following the end of World War II, have been frequently mentioned as a reference model to be followed, especially in those aspects regarding coverage, quality of service, and contribution to the welfare of society. However, the scene in which these systems arose has suffered a series of major changes, being the most important the following ones[4]:

  • the aging of the population, with a continuous increment of chronic and degenerative diseases
  • the financial crisis, which causes important budget cuts in the public funds meant to finance the health systems activities, and makes it more difficult—or even impossible—for the citizens to compensate these cuts with out-of-pocket expenses
  • the creation of new techniques and drugs, more effective but also more expensive, mainly due to the necessity to compensate the research costs caused by their development
  • the increasing demands of the citizens, who require more and better healthcare services in a setting that seeks patient empowerment and promotion of personalized medicine

As a token of the first two determinants, aging of the population and public budget cuts, the Spanish case is addressed below. Table I shows the progress of these two indicators during the period between 2003 and 2014.

Table 1. Demographics and health expenditure in Spain (2003–2014)
Sources - demographics: Demographic Information System, Spanish National Statistics Institute (INE); health expenditure date: OECD Health Statistics
Year Total population Population 15–64 years Population older than 64 years Dependency ratio Public health expenditure Private health expenditure
People % over total M€ % GDP M€ % GDP
2003 42,717,064 29,396,965 7,276,620 17.03% 24.75% 43,158.4 5.37% 17,354.5 2.16%
2004 43,197,684 29,777,965 7,301,009 16.90% 24.52% 46,992.4 5.46% 18,651.1 2.17%
2005 44,108,530 30,511,110 7,332,267 16.62% 24.03% 51,351.5 5.52% 20,094.2 2.16%
2006 44,708,964 30,849,177 7,484,392 16.74% 24.26% 56,662.2 5.62% 21,520.7 2.14%
2007 45,200,737 31,188,079 7,531,826 16.66% 24.15% 61,612.0 5.70% 23,101.9 2.14%
2008 46,157,822 31,869,008 7,632,925 16.54% 23.95% 68,147.1 6.11% 24,392.9 2.19%
2009 46,745,807 32,145,023 7,782,904 16.65% 24.21% 73,035.6 6.77% 23,863.0 2.21%
2010 47,021,031 32,153,527 7,931,164 16.87% 24.67% 72,852.6 6.74% 24,593.7 2.28%
2011 47,190,493 32,082,758 8,093,557 17.15% 25.23% 71,800.0 6.68% 25,510.2 2.38%
2012 47,265,321 31,980,402 8,222,196 17.40% 25.71% 68,262.9 6.47% 26,594.3 2.55%
2013 47,129,783 31,718,285 8,335,861 17.69% 26.28% 65,718.5 6.26% 26,981.3 2.62%
2014 46,771,341 31,281,943 8,442,427 18.05% 26.99% 65,975.7 6.34% 28,558.1 2.74%

These data reveal that the Spanish population has increased from 42.72 to 46.77 million people during the 2003-2014 period, while the percentage of people older than 64 years has risen from 17.03% to 18.05% over total population, and the dependency ratio, which indicates the ratio between population older than 64 years and population between 15 and 64 years old, has risen from 24.75% to 26.99%. On the other hand, public health expenditure in 2003 meant 5.37% GDP, reaching a peak of 6.77% in 2009 and falling to 6.26% in 2013, experiencing a small recovery in 2014, with 6.34% GDP. Regarding private health expenditure, it was at minimums around 2.14%-2.17% GDP but it has risen year after year since the beginning of the financial crisis in 2008, reaching 2.74% GDP in 2014.

All things considered, the impact of all these determinants is so important that the sustainability of this model of public health system has been questioned in recent years.

The transformation of public health systems

Despite the fact that the challenges explained above make clear that a deep transformation of this health system model is needed, and IT is often considered as one of the main facilitators for this change, it is not admissible to think that health systems are going to lose their essential features. Health systems must improve people’s health, from both an individual and a collective point of view, and this final goal will not change in spite of the introduction of new technologies impacting big data.

The patient must always be the center of any health system and, in the same way, health information must always be the center of a health information system, which will be introduced below. The actions of a clinic professional focus on the achievement of specific healthcare goals customized for each one of their patients, improving or maintaining their health status. Besides knowledge, healthcare requires a connected and personalized relationship between the provider and the patient, so that interventions are tailored to the patient’s unique preferences and behavior, as with, for instance, drug adherence. Different people will have different reasons for non-adherence.[5]

On their behalf, health systems managers must seek the compliance of the general goals defined by their organizations. These goals will be the aggregate of the individual goals related to each one of the professionals in their clinical staff. In addition, these managers will also be responsible for the allocation of the necessary resources and the financing of the whole activity in their organizations.

On the whole, health systems must focus their efforts on the creation of value for both the patient and society. To that end, clear goals must be defined that find an appropriate balance between the patient’s personal interests and the collective interest of society. For instance, in the event of a surgical intervention it is mandatory to measure some indicators such as mortality rate, adverse events, time of recovery, care costs, or time for the patients to return to their jobs at full capacity. Nevertheless, it is necessary to also take into account other indicators, maybe more subjective and thus harder to measure, but equally important because of their impact on the patient, such as post-surgery functionality, pain suffered, or the cost of all these factors from a quality-of-life point of view. If health systems are not focused on their patients’ interests and on achieving the corresponding goals, they will hardly be able to change and ensure their sustainability.[6]

This article reviews the main features of the European public health system model and the corresponding healthcare and management-related information systems, the problems and challenges that these health systems are currently facing, and the solutions that big data tools may potentially offer in that respect. To that end, the authors have based this work on their professional experience with the Spanish public health system, an analysis of the scene that the latter is facing in the upcoming years and decades, and a review of the existing literature on big data applied to health.

Big data solutions in a healthcare environment

The health information system

The individual performance of the different components of the health cluster, as well as their interactions, causes the creation of multiple data flows, which are also greatly varied, since they involve several business processes. This set of data flows creates the health information system as a result.

As mentioned above, just as healthcare is the center of any health system, patient-related healthcare information must be the center of the health information system as well, since it may and will also be used for activities other than healthcare, such as epidemiological surveillance, planning, overseeing of management, clinical research, and education and training, as stated in the introduction. The fact that these data are stored in healthcare information systems is a consequence of them being generated during the patient’s care, but their usefulness goes clearly beyond this limit.

Therefore, the health information system must allow users to register, process, consult and share large amounts of data, ensuring their availability at the appropriate moment and point of the health cluster. On the healthcare side, this cluster reveals itself as a huge generator and at the same time consumer of enormous sets of information, related to personalized healthcare processes that take place on a daily basis and in a massive way. Healthcare is considerably intense regarding data treatment, by constantly creating immense datasets and frequently requiring access to knowledge sources.

For IT to be fully integrated in the health system value chain, it is mandatory to have a health information system which serves as an instrument for knowledge management being useful to all its users. Healthcare professionals cannot perform their duties properly without registering and using patients’ information, or without accessing the knowledge sources that allow them to make decisions on a solid basis. Public health departments need to know the population health status in order to detect or prevent potential collective health issues, as well as defining the necessary corrective and preventive measures. To those ends, these professionals must rely on data generated during every patient’s healthcare encounter, properly aggregated, as well as other data sources.

Managers are not able to plan a strategy, oversee its performance, and assess the achieved outcomes without a tool that allows them to process all the necessary information and provides them with accurate data, timely and in due form. These data are required from the very beginning, since the definition of an appropriate strategy must be based on the knowledge of the population’s health status, complemented with projections of its potential progress.

This complexity has been increased in recent years by a major change in healthcare organizations, which have evolved from a clearly paternalist way of interaction with their patients to a completely different one, focused on seeking their empowerment. In addition, patients are not content anymore with the information provided to them by their doctors, but search the internet for additional data about their diseases, engage on social networks, make their own decisions, and register information on their health records. This new role is indeed required if, for instance, health authorities seek to promote one of the most important lines of action in the field of chronic patient management, self-care encouragement, which has a beneficial impact on both the patient and the health system. However, this requires also a more varied interaction between them, combining traditional simple events, like setting up appointments with a general practitioner, with more complex actions, like monitoring health data analyzed and stored by wearable devices.

Despite this, it is clearly positive that both society and the medical community have evolved from a discussion about giving patients clearance to access their own healthcare information, to a totally different one about seeking the best way for the patients to register data in their health records, either in an active and conscious way or in an passive and automated one via specific devices. In any case, it must be always taken into account that the management of healthcare information is not a process unrelated to healthcare but rather an inseparable component of healthcare itself, hence its management and supervision are the healthcare professionals’ responsibility, even though the patients take a more active role. Furthermore, every professional must accept this new reality and provide the patient with the necessary training so that this initiative ends up being successful.[7]

Apart from this, the temptation of exploiting the information stored on different social networks turns out to be very powerful. It is true that, to the health system, it is a possibility worth exploring, but several conditioning factors must also be considered. The first one is data protection as a consequence of people’s right to privacy, something that, from the beginning, seems to collide clearly with the business model of social networks themselves, designed to share large amounts of information in a quick, heterogeneous, and, up to some point, uncontrolled way.

Precisely these features represent another important conditioning factor, since social networks are nothing but huge repositories that store unstructured, poorly classified, or simply uncategorized data, not to mention the more than likely irrelevance of most of them regarding healthcare and, moreover, their doubtful veracity, a feature essential to this field. Given their market penetration, with millions of users around the world, it seems advisable to assess the possibility of using social networks as an information source for health systems, as long as a model can be defined that solves or at least mitigates all the inconveniences mentioned above.

Potential contributions of big data to health systems

The field of big data analytics is rapidly expanding, up to the point that it has begun to play a main role in the evolution of healthcare practices and research, by providing tools to register, manage, and analyze huge amounts of both structured and unstructured data produced by current healthcare information systems.[8]

Health-related big data streams can be classified into three categories[5]:

  • Traditional healthcare data are generated within the health system and stored in datasets such as health records, medical imaging tests, lab reports, or pathology results, among others. Analyzing this information allows to achieve a better understanding of disease outcomes and their risk factors, and also to reduce health system costs, thus making them more efficient.
  • “Omics’’ data deal with large-scale datasets in the biological and molecular fields, such as genomics, microbiomics or proteomics, for instance. The study of this information leads to deeper knowledge about how diseases behave, accelerating the individualization of medical treatments.
  • Data from social media allow to figure out how individuals or groups use the internet, social media, apps, sensor devices, wearable devices or any other tools to better inform and enhance their health.

Additionally, the inclusion of geographical and environmental information may further increase the ability to interpret gathered data and extract new knowledge.[9][10]

Combinations of several types of data must also be taken into account. The concept of personalized medicine, partially introduced above, seeks to combine the patient’s health record and genomic data in order to support the clinical decision-making process, making it predictive, personalized, preventive and participatory, an idea known as “P4 Medicine.”[11]

At the micro level, personalized medicine aims to customize the diagnosis of a disease and the subsequent therapy by taking into account the individual patient’s characteristics, instead of relying on decisions taken according to general guidelines, defined as a result of population-based studies and clinical trials. This will require the integration of clinical information, mainly patient records, and biological data such as genome or protein sequences. These data are generated from different and heterogeneous sources, and they have very diverse formats.[11]

In fact, healthcare data no longer needs to be restricted to traditional datasets such as electronic health records. For instance, mobile or wearable devices monitoring physiological signals can provide timely access to multiple data points that are increasingly interconnected. Traditionally, the data generated by these sorts of devices have not been stored for more than a brief period of time, being discarded afterwards and therefore preventing any extensive investigation to benefit from the exploitation of these data. However, attempts to use this kind of data have been increasing lately, in order to improve patient care and management.[8][12]

Nevertheless, there is a difference between collecting data, having access to data, and knowing how it should be used to improve healthcare. Now that the technology for handling massive amounts of data is available, the next step is developing tools for information sharing and knowledge management, which are seriously limited by the lack of system interoperability.[11][12]

For instance, with full interoperability the ability to collect data in a timely manner from several different sources leads to an increase in registries. Disease registries are still in an early stage, but they might be valuable tools when it comes to supporting patient-centered self-management of chronic illness and defining customized treatment plans. Besides, the integration of computer analysis with appropriate care will help doctors to improve diagnostic accuracy. In a similar way, the integration of medical images with other types of electronic health record data and genomic data can also improve the accuracy of a diagnosis and reduce the time required for it.[8][12]

A major emphasis of personalized medicine is to match the right drug with the right dosage to the right patient at the right time. Moreover, gene sequencing and the use of the subsequent genetic data in diagnosis and treatment will be essential to the future of personalized medicine, with actions such as the prescription of drugs based on genomic profiles of individual patients, known as pharmacogenomics. However, analytics of high-throughput sequencing techniques in genomics is a problem inherent to big data itself, since the human genome consists of 30,000-35,000 genes. Some ongoing projects aim to integrate clinical data from the genomic level to the physiological level of a human being. These initiatives will surely help when it comes to deliver personalized healthcare.[8][11][12]

At the macro level, faster access to data allows any hospital to define and apply quality improvement policies based on the constant monitoring of outcomes, ensuring that the strategic goals of the organization are achieved. Hospitals have also used electronic health records, datasets originally intended to document individual healthcare processes, to identify system-related inefficiencies and quality issues. Faster access to data has also been hugely useful for the identification and management of disease outbreaks, allowing public health initiatives to be targeted to specific areas via population analysis.[12]

Mining of electronic health record data has made it possible for researchers to identify possible sources of adverse events. Healthcare professionals have used this information to improve organizational practices and reduce error rates. Moreover, many clinical information systems such as electronic health records and computerized physician order entry systems capture a large amount of metadata about their use, which can be used for auditing purposes, thus allowing the organization to detect user-device interaction problems, shrinking safety margins and minimizing other technology-related safety issues and concerns before any adverse event takes place.[12]

The potential impact of big data is not easy to estimate, let alone at such an early stage. A report sponsored by the McKinsey Global Institute states that the proper use of big data within the United States healthcare sector might allow improvements with an estimated value of more than $300 billion every year, two-thirds of which would be achieved by reducing the healthcare expenditure of the whole country.[13]

However, healthcare IT history has made clear that technology-based panaceas do not exist. The potential of IT for transforming health systems seems to be widely accepted as a consequence of its contribution to the improvement of healthcare processes, but IT has also caused new issues and risks, such as user-computer interaction problems or technology-induced errors. As a consequence, it seems clear that more IT outcome-based research is still needed in order not only to prove its value but also to quantify it.[12][14]

Requirements for the use of big data within the health information system

While the ability to manage massive amounts of data provides a huge opportunity to develop methods and applications for advanced analysis, the real value of big data will only be achieved if the information extracted from these data is useful to improve clinical decision-making processes and patient outcomes, as well as lower healthcare costs.[11] To that end, several basic requirements must be met, though they are very similar to the requirements of the health information system itself.

First of all, it is essential to ensure the quality of the information. This involves the development of thorough protocols which define the criteria required for data input, validation, harmonization, registry, processing, and transmission to other components of the information system. In fact, several of the main requirements of data mining are the technical correctness of data, the accuracy and statistical performance, and the update or reassessment of the analysis.[15]

In the health field, the information managed is so complex and heterogeneous that it is necessary to employ data carefully structured and, as long as it is possible, categorized. This is useful for data identification and error control purposes. Furthermore, healthcare information is a perfect example of three major features, commonly known as the three Vs, widely accepted as defining characteristics of big data: volume, variety, and velocity. In addition to these a fourth V, the veracity of healthcare data, is obviously critical for its meaningful use.[8]

All possible information sources and data flows within the health information system must be perfectly identified as well. Since the information system must store all data required for the performance of the different corporate functions of the health system, it is clear that all of its components must be interoperable, as stated above, so that any data can be accessed from any point of the health system that needs them. Hence another cardinal requirement is the interoperability of systems, subsystems, and components, defined as their capability of exchanging information without altering the meaning of the exchanged data, regardless of their source and their use within each system.

For instance, a medical consultation generates information used for the patient’s healthcare, the management of the employed resources, and the billing of the service, but it can also be used in the medium and long-term for outcome assessment, strategic planning, research, education, epidemiological surveillance, or even as evidence in legal proceedings. Moreover, the aggregate of every data generated during that consultation and the ones generated during millions of similar healthcare events will be useful to create knowledge, on which [[clinical decision support system]s will be based.

Therefore the cycle comprehends the transition from data to information, from information to knowledge, and from knowledge to practice. All of this needs the interoperability of clinical information systems, logistic and economic-financial systems, business intelligence systems, and university/R&D center systems, among others. As a consequence, every system must be capable of filtering the information received in order to extract the data it needs, so as to not compromise their processing, thus avoiding the risk of producing adulterated results.

Finally, from a technological point of view, it is mandatory to have a high-performance IT infrastructure on which to rely for the generation, storing, processing, and exchange of large data volumes in a quick and efficient way. Luckily, hardware, software, and communications solutions have experienced a huge progress in recent years, so technological viability is hardly an obstacle nowadays.

Some additional issues and barriers

The implementation of big data solutions and tools in the healthcare field requires addressing not only the organizational and technological issues detailed above, but also several legal and ethical questions.

From a legal point of view, the first cause of conflict may be data propriety. As explained in previous sections, every data properly processed and analyzed can be turned into knowledge, and the latter can be easily made profitable. The first companies working this angle are tech giants such as Google, which provides personalized advertisements based on navigation and search history, and Facebook, which admitted to focus part of its efforts on sociological research based on its users’ data, and has even tried to take possession of these information in a completely unilateral way.[7]

Given that the generation, registry, and processing of all this information requires a powerful hardware and software infrastructure, and therefore a large investment by these companies, their intention to make it profitable may be considered legitimate to a certain point, especially if they are not charging users for the service provided. However, limitations regarding the use of the stored data must be clearly established, something that seems to be far from being solved with the current legal framework, which is quite confusing. For instance, in the case of Spain, this framework combines European Union, national, regional, and sectorial (both health and e-government) regulations.[7] Moreover, most of this legislation is outdated to a large extent, since it was passed in a time when IT progress was nowhere near what we have today.[16]

Once at this point, a revision of these regulations, taking into account the current potential of big data solutions, as well as the foreseeable one on the short and mid-term, seems to be more than appropriate. Of course, this revision must be addressed with the goal of balancing the individual interests of patients (right to privacy) and professionals (legal certainty in the performance of their healthcare and management duties), as well as the general interests of society (research, education, or improvement of healthcare services, among others). To that end, protocols must be defined that combine both a priori measures, such as data anonymization, and a posteriori measures, such as thorough audits regarding the access and use of data. Keeping the human factor in mind, one of the most crucial a priori measures will always be raising the awareness of patients, professionals, and organizations.

From an ethical point of view, quite a few similarities to the legal field can be observed. The fact that IT is going to play an increasingly important role in health systems seems to be widely accepted, since its potential as a key instrument for the transformation of the current model is appreciated. Nevertheless, there is also a great concern about the lack of transparency in the management of the large amounts of data guarded by healthcare organizations. For this reason, the promotion of more and better control measures is backed by bioethics experts, starting with the development of a specific legal framework that can be turned into clear and visible actions, thus transmitting a sense of security and contributing to promote the trust in healthcare data mining.[15]

Big data in SERGAS: A case study

Within the Spanish National Health System, healthcare is accountable to the Autonomous Communities, which represent the regional level of the government, and each has a health service. In the case of the community of Galicia, this would be the Galician Health Service (Servizo Galego de Saúde, SERGAS).

SERGAS relies on a business intelligence (BI) solution for the exploitation of structured datasets, these being provided by a regional database in which information supplied by the different hospitals and primary care centers of this health service is aggregated. In addition, a management system for information related to human resources and pharmaceutical expenditure provides structured data as well.

In order to complement this BI system, SERGAS has implemented big data technologies so as to exploit unstructured data stored in the patients’ electronic health records. This innovation makes SERGAS the first Spanish health service to use big data in a systematic way. On a total budget of 982,278 euros, several projects have been developed regarding the following lines of action:

Rare disease management

  • detection of suspicious cases
  • creation of a rare diseases registry

Chronic disease management

  • detection of diabetes mellitus type 2 patients, chronic obstructive pulmonary disease patients, and patients with pluripathology, yet uncategorized as such in their health records
  • calculation of prevalence and incidence indicators, as well as risk factors

Clinical research

  • decision-making support regarding the selection of the most appropriate kind of vascular endoprostheses (stents)

Nosocomial infection management • research and categorization of detected cases • automated alerts

Surveillance of several syndromes • case identification • detection of food toxi-infection and acute respiratory symptom outbreaks

Exploitation of lab test results

  • currently in progress

As a whole, these systems are handling information belonging to 2.9 million patients, provided by 63 different data sources. As of 2016, 59 million normalized events have been compiled, 12 million documents (of 50 different kinds) have been semantically processed, and 500,000 cases have been detected.

Regarding information security, SERGAS applies a set of corporate criteria, with standard measures such as the definition of user profiles and access authorization levels, the anonymization of aggregate data, and the performance of audits to verify regulation compliance. Additionally, there are several committees that define the guidelines for the management of ethics and governance, always within the current legal framework.


As with IT in general, the successful implementation of big data solutions in a healthcare environment will depend on their capability to generate added value that benefits patients, professionals, and organizations. No one seems to doubt the need to improve public health systems by evolving their current model, or the potentially valuable contributions of big data in this respect, but the great complexity that characterizes the implementation of these kinds of tools seems to be proven too, according to the requirements, and, in some cases, obstacles of a different nature that must be dealt with.

Once technological viability is apparently achieved, it is time for healthcare organizations and authorities to face the challenge of studying the possibilities of big data and seeking the best way of applying it to the solution of their issues, problems, and needs. In order to achieve this, they must not start wondering what information they have now and what they can achieve with it, instead focusing on what information they need and how they can get it. The most frequent problem will not be the availability of the necessary data, but the screening of the relevant information and how to assess it. In summary, the most important thing is not having the data, since this is already happening, but being able to ask the right questions at the right moment, process them to provide only the necessary and relevant information, and show the latter to healthcare professionals in such a way that they can assimilate it in a quick, correct, and easy manner in order to make the right decisions at the right time.

On the healthcare side, big data must become the foundation of clinical decision-making support systems, and also an instrument for data aggregation concerning public health departments, as well as research and education. On the management side, managers will have access to more accurate and timely knowledge concerning the real status of their organizations and adopt a proactive plan instead of a retrospective one. In addition, they will be capable of detecting deviations from objectives earlier and applying the appropriate corrective and, preferably, preventive measures.

In conclusion, the implementation of big data must be one of the main instruments for change in the current health system model, changing it into one with improved effectiveness and efficiency, taking into account both healthcare and economic outcomes of health services, thus being meaningful to patients and also to society, all while taking advantage of patients’ potential as active participants in their own care.


The authors wish to thank Ms. Pilar Carnicero and Mr. Guillermo Vázquez for their contributions in elaborating on and improving this article.


  1. World Health Organization (2007). Everybody's business -- Strengthening health systems to improve health outcomes: WHO's framework for action. World Health Organization. pp. 44. ISBN 9789241596077. 
  2. "The Tallinn Charter: Health Systems for Health and Wealth". World Health Organization. 27 June 2008. 
  3. Rojas, D.; Carnicero, J. (2015). "A Model Of Information System For Healthcare: Global Vision and Integrated Data Flows". In Berhardt, L.V.. Advances in Medicine and Biology. 82. Nova Science Publishers. ISBN 9781634636339. 
  4. Carnicero, J.; Rojas, D.; González, A. et al. (2016) (PDF). La explotación de datos de salud: Retos, oportunidades y límites. Sociedad Española de Informática de la Salud. ISBN 9788460889472. 
  5. 5.0 5.1 Hansen, M.M.; Miron-Shatz, T.; Lau, A.Y.; Paton, C. (2014). "Big Data in Science and Healthcare: A Review of Recent Literature and Perspectives - Contribution of the IMIA Social Media Working Group". Yearbook of Medical Informatics 9: 21–6. doi:10.15265/IY-2014-0004. PMC PMC4287084. PMID 25123717. 
  6. Porter, M.E.; Lee, T.H. (2013). "The Strategy That Will Fix Health Care". Harvard Business Review 10: 50–70. 
  7. 7.0 7.1 7.2 Martínez, R., Rojas, D. (2014). "Gestión de la seguridad de la información en atención primaria y uso responsable de Internet y de las redes sociales". In Carnicero, J.; Fernández, A.; Rojas de la Escalera, D.. Manual de salud electrónica para directivos de servicios y sistemas de salud. 2. United Nations. 
  8. 8.0 8.1 8.2 8.3 8.4 Belle, A.; Thiagarajan, R.; Soroushmehr, S.M.R. et al. (2015). "Big data analytics in healthcare". BioMed Research International 2015 (2015): 370194. doi:10.1155/2015/370194. 
  9. Luo, J.; Wu, M.; Gopukumar, D.; Zhao, Y. (2016). "Big Data Application in Biomedical Research and Health Care: A Literature Review". Biomedical Informatics Insights 8: 1–10. doi:10.4137/BII.S31559. PMC PMC4720168. PMID 26843812. 
  10. National Academies of Sciences, Engineering, and Medicine (2016). Big Data and Analytics for Infectious Disease Research, Operations, and Policy: Proceedings of a Workshop. The National Academies Press. doi:10.17226/23654. ISBN 9780309450140. 
  11. 11.0 11.1 11.2 11.3 11.4 Panahiazar, M.; Taslimitehrani, V.; Jadhav, A.; Pathak, J. (2014). "Empowering Personalized Medicine with Big Data and Semantic Web Technology: Promises, Challenges, and Use Cases". Proceedings of the IEEE International Conference on Big Data 2014: 790–5. doi:10.1109/BigData.2014.7004307. PMC PMC4333680. PMID 25705726. 
  12. 12.0 12.1 12.2 12.3 12.4 12.5 12.6 Kuziemsky, C.E.; Monkman, H.; Petersen, C. et al. (2014). "Big Data in Healthcare - Defining the Digital Persona through User Contexts from the Micro to the Macro. Contribution of the IMIA Organizational and Social Issues WG". Yearbook of Medical Informatics 2014: 82–9. doi:10.15265/IY-2014-0014. PMC PMC4287094. PMID 25123726. 
  13. Manyika, J.; Chui, M.; Brown, B. et al. (May 2011). "Big data: The next frontier for innovation, competition, and productivity". McKinsey Global Institute. 
  14. Carnicero, J.; Rojas, D.; Martínez, R. et al. (2016) (PDF). XI Informe SEIS - Las TIC y la seguridad de los pacientes: Primum non nocere. Sociedad Española de Informática de la Salud. ISBN 9788461740246. 
  15. 15.0 15.1 León, P. (2016). "Capítulo III: Bioética y explotación de grandes conjuntos de datos". In Carnicero, J.; Rojas, D.; González, A. et al. (PDF). La explotación de datos de salud: Retos, oportunidades y límites. Sociedad Española de Informática de la Salud. ISBN 9788460889472. 
  16. Andérez, A. (2016). "Capítulo IV: Disposiciones legales aplicables". In Carnicero, J.; Rojas, D.; González, A. et al. (PDF). La explotación de datos de salud: Retos, oportunidades y límites. Sociedad Española de Informática de la Salud. ISBN 9788460889472. 


This presentation is faithful to the original, with only a few minor changes to grammar, spelling, and presentation, including the addition of PMCID and DOI when they were missing from the original reference. Citation three is listed in the references of the original but inadvertently omitted from the inline citations; it has been placed in the text at what is believed to be the appropriate citation point. Additionally, the original document placed citations 11 and 12 before nine and 10; this version shows citations in order of appearance, by design.