As of early January 2025, the Centers for Disease Control and Prevention (CDC) is reporting an increase in the epidemic trend of influenza and COVID-19. As we enter the peak season for respiratory-related illnesses in the US, many healthcare leaders continue to question whether our public health data and analytics infrastructure has evolved to meet the challenges of today and beyond.
It is well known that the COVID-19 pandemic exposed various shortcomings in public health data and analytics infrastructure.
Consequently, the legacy public health data and analytics infrastructure hindered healthcare professionals and first responders from effectively collecting and sharing data with organizations across the public health ecosystem. This undermined efforts to more effectively understand and respond to the pandemic as organizations struggled with:
Since the pandemic, we have made great strides toward modernizing our data and analytics infrastructure. Below are just a few of the primary initiatives that have helped build a more reliable future for public health data and analytics:
Though considerable progress has been made we have miles to go before we come close to meeting expectations. While the CDC’s multi-billion, multi-year Data Modernization Initiative is in place, the BCHC reports that only 29% of large local health departments have received additional funding for their data modernization projects.
Estimates from HIMSS show that truly reforming the current public health data infrastructure will require a massive investment of as much as $36.7 billion over the next 10 years. Funding is needed to address a variety of issues that continue to plague our nation's public health data/analytics infrastructure today.
The Path Forward: Policymakers at the federal, state, and local levels must continue to recognize the vital importance of continuing to fund public health data/analytic infrastructure improvements. Additionally, opportunities exist with other government agencies such as Medicaid to collaborate with public health organizations and leverage policy/program initiatives such as Medicaid Enterprise Systems and 1115 Demonstration Waivers to improve Medicaid beneficiary outcomes and the effectiveness of public health initiatives.
Our current public health ecosystem continues to be highly decentralized, suffering from severe data fragmentation. The CDC, for example, operates 100+ different disease surveillance systems while many state, tribal, local, and territorial Public Health Authorities (STLT PHA) take their own technology solution approach to supporting disease surveillance and many other public health functions.
The Path Forward: While data may be fragmented, it does not mean it can’t be unified and better leveraged. Data from different sources, such as STLT PHA disease surveillance systems, CDC, EHRs, HIE’s, laboratories, homeless management information systems, and more, can prove much more vital when effectively and reliably unified.
Contemporary technology solutions use advanced analytics and machine learning/AI to quickly and efficiently ingest data from a variety of siloed sources. They clean, validate, standardize, deduplicate, and unify data to form a complete 360-degree view of citizen health. Unified data can provide more holistic transparency into public health, fuel new initiatives such as social health information exchanges and/or modernized disease management and surveillance systems, and ultimately help all stakeholders better understand and respond to emerging public health threats.
Incomplete data presented a massive challenge during the pandemic and persists today. Early pandemic reporting in 2020 showed that over 55% of race & ethnicity data was missing from COVID-19 case data. Even after the issues were widely public, a 2022 analysis showed that about 36% of a CDC COVID-19 database (~50 million cases) still lacked race and ethnicity data. Another 2023 study showed more than 60% of sexual orientation and gender identity (SOGI) data was missing in adult EHR patient records.
These issues present an incomplete picture of diseases and their impact on different population segments, resulting in avoidable negative outcomes and health disparities. Additionally, data standardization continues to be a significant issue with many healthcare providers and public health agencies using different systems and data collection/reporting standards. The United States Government Accountability Office called out data standardization as one of their top areas of opportunity to improve public health data management efforts.
The Path Forward: Public health data can be sourced from a variety of different sources, systems, and locales. Stakeholder representation across these ecosystems must continue to collaborate and drive efforts to agree on common data standards. As initiatives such as those within HTI-1, HTI-2, and TEFCA take shape, the hope is that more momentum will be gained towards standardization efforts.
There must also be a commitment to expanding/improving data collection efforts at the point of service. Professionals at the point of service are often not well-enabled to collect data such as SDoH and may not be trained on how to explain their importance in a culturally sensitive manner to patients.
The same BCHC study referenced earlier showed that a staggering 96% of its respondents indicated they were struggling to effectively exchange data with state health agencies. This could be due to incompatible systems (legacy/outdated), insufficient interoperability development support (skill levels, headcount, etc.), and a lack of support for contemporary interoperability exchange standards such as FHIR.
Interoperability challenges make it difficult to effectively track health trends and outcomes. For example, in our current system, COVID-19 and Influenza (notifiable conditions) patients who visit a healthcare provider/lab may trigger an initial notification of their condition to STLT PHA/National Surveillance systems. However, because different healthcare providers, labs, and STLT PHA’s could all be using different systems and exchanging standards, there is a chance that data may be in an incompatible format with an STLT PHA system. This may lead to an error-prone manual case capture or the possibility of data never being captured at all. Other problems may also be present such as a mismatch of data exchange standards/formats resulting in data irregularities (unreliable/misunderstood demographic data such as race/gender/SOGI) and/or poor matching efforts to track case activity across the same patient.
The Path Forward: The public health landscape includes thousands of systems with different data analytic infrastructures used by healthcare providers, labs, pharmacies, public health agencies, health systems, insurance companies, and community organizations. These systems come from many different vendors, were built in different time periods, and are tailored as per the needs of each organization.
Government-led initiatives such as CMS-9115-F, CMS-0057-F, HTI-1, HTI-2, and TEFCA will continue to provide the groundwork for much-needed interoperability progress while industry efforts to evolve FHIR and other standards that support interoperability will also prove vital. Technology vendors continue to press the edge of innovation and those who provide standards-based, open, API forward systems will act as a central nervous system for complex, multi-system environments that promote and encourage interoperability and the secure exchange of healthcare data.
Despite the potential of Artificial Intelligence (AI) to improve public health data analysis, adoption in public health departments remains limited. According to the National Association of County and City Health Officials’s (NACCHO) Public Health Informatics Profile 2024, around 76% of local health departments (LHD) hesitate to use AI. This is especially concerning for larger jurisdictions serving populations of over 500,000 or more. Also, most organizations employ AI for communication purposes only. This gap in usage prevents public health agencies from leveraging the power of AI to quickly identify patterns/trends, optimize resource allocation, and predict the impact of mitigation measures and clinical interventions.
The Path Forward: Selected examples to leverage the power of AI:
The pandemic taught us many difficult lessons. Consequently, we have done much to improve our public health data/analytic infrastructure. However, significant progress is still needed to achieve a state of modernization that can truly support proactive responses during times of new/emerging public health threats.
In efforts to coordinate funding, data unification/completeness/standardization, interoperability/exchange, and the use of artificial intelligence, collaboration will be critical across the entire public healthcare ecosystem. Public health agencies will need to work closely with sister agencies, healthcare providers, payers, community-based organizations, technology developers, and many other private partners to drive towards success and our goal of being best prepared for the next emerging public health threat.