This page, Massachusetts syndromic surveillance data, nowcast and moving epidemic methods , is offered by
Bureau of Infectious Disease and Laboratory Sciences
Department of Public Health

Massachusetts syndromic surveillance data, nowcast and moving epidemic methods

Learn how we use the Massachusetts syndromic surveillance data for nowcasting and moving epidemic thresholds as applied in the viral respiratory illness dashboards.

Skip table of contents

You skipped the table of contents section.

Background on syndromic surveillance data

The data source for this analysis is emergency department (ED) visits from the National Syndromic Surveillance Program (NSSP) ESSENCE platform.
- NSSP is a collaboration among CDC, federal partners, local and state health departments, and academic and private sector partners to collect, analyze, and share electronic patient encounter data received from multiple health care settings. For more information on NSSP, visit NSSP on cdc.gov.
Currently, 100% of emergency departments in the Commonwealth are sending data to ESSENCE, allowing for a complete picture of ED visits.
When a patient is admitted, discharged, or transferred, the hospital’s electronic medical record system triggers real-time HL7 messages, which travel through Mass HIWay to the National Syndromic Surveillance Program.
Data can then be accessed and analyzed in ESSENCE. These records contain information about the visit, patient, and reason for the visit, including diagnosis codes, but do not include the patient’s name nor the patient’s home address. There is very limited identifiable information about the patient included.
While ESSENCE receives updates to health records instantaneously, delays in test results, hospital coding, etc. result in often-substantial delays between the visit and the notification of the patient’s diagnosis. This is why we advise caution when interpreting data that are only one to two weeks old.

Data

Statewide respiratory ED visits from December 31st, 2023, to December 28th, 2024, were retrieved using the CDC Broad Acute Respiratory DD v1, CDC COVID-Specific DD v, CDC Influenza DD v1, and CDC Respiratory Syncytial Virus DD v1 queries. For more information, visit the NSSP CoP Knowledge Repository Syndrome Library.
- Data from the past year were assumed to follow similar patterns of reporting and delay as current data, so this time frame was used to train the nowcasting method for the current season. The data used to make these estimates will be refreshed periodically to preserve the efficiency of the estimate

Nowcasting method

Using data from 2024, time was measured between when a report was received to when the respiratory illness information was considered complete. The record initiation timestamp was subtracted from the message-receipt timestamp for the first reporting time of a respiratory diagnosis code (as defined by the corresponding CDC query), for each visit.
For each date of the time period sampled, (starting with 12/31/2023-12/28/2024), percent completeness was determined at 1-week intervals from date of visit, defined as the number of visits flagged by the query at each 1-week interval since visit date (1 week after, 2 weeks after, etc.), divided by the final known total count of diagnosed visits for that day.
- Completeness for Date X at N Weeks = (Count of Visits for Date X with a Respiratory Code by N Weeks from Date X) ÷ (Final Known Count of Respiratory Visits for Date X)
Reporting completeness percents for all dates were compiled for each interval. For each interval since visit date, median (50th percentile) and 2.5th and 97.5th percentile completeness measurements were determined, giving an informed point estimate and estimate range for completeness of data at the given age (i.e., the percent completeness of the counts N weeks since date of interest).
These point estimates and ranges were applied to current weekly data in order to estimate, based on reported data’s age, what respiratory reporting counts would be once the data are fully reported (nowcast).
- Example: If a 1-week-old count of visits is 1,100, and 1-week-old data are estimated to have 88% completeness, the final count (accounting for reporting delays) can be estimated to be 1,100/0.88 = 1,250.
- The 2.5th and 97.5th percentile completeness estimates are then used to create upper and lower range nowcast estimates of final count.

Assessment

A retrospective validation study of this method examined 20 snapshots of weekly data pulls of Broad Acute Respiratory visits from 2024, generating nowcast predictions for the 10 weeks prior to each snapshot. (n = 20 weeks × 10 predictions = 200). The estimate counts produced via these snapshots could be compared to known counts in order to assess the method's performance. The method produced point estimates of counts that were significantly correlated with actual values (R=1; p < 2.2e-16)
Prediction intervals were found to contain the true final value within their range (model coverage) 83% of the time. The remaining 17% of results that were outside the interval fell only slightly outside: widening the prediction interval by 1% in each direction (99% of low estimate, 101% of high estimate) brought the coverage up to 97%.
Residuals (= estimated value - actual value) were small, often demonstrating a slight underestimation of the final numbers, and were concentrated around late December and early January, a time period that often sees high respiratory visits, lower hospital staffing, and greater-than-normal reporting delays.

Conclusions

This method examines past reporting delays for respiratory visits in order to account for reporting delays of current data. The method has been found to reliably estimate the final total counts of respiratory visits from their incomplete (still updating) counts.
Through nowcasting, we are given a better and more timely sense of current trends than with counts alone. This allows us to spot changes in rates of respiratory illnesses in a timelier fashion.

Background on the Moving Epidemic Method (MEM)

The moving epidemic method is a set of steps for categorizing disease rates into activity levels, based on what was observed in past seasons.
The method was first introduced by Vega et al. in 2004, developed to categorize influenza activity. A modified version of the MEM was adopted by the CDC in 2015 in order to track seasonal influenza. In recent years, several public health departments have developed versions of the MEM for additional illness categories, such as COVID-19, RSV, and respiratory illness overall.
The MEM consists of 2 sets of calculations. Each calculation uses data from the 6 most recent waves (for non-COVID-19 syndromes, the 2020-2021 respiratory season is excluded, as no significant respiratory wave was observed):
- Baseline calculation: Determines the level of activity at which an epidemic (such as the seasonal influenza or a COVID-19 wave) has begun. Once respiratory activity has reached/exceeded this level, the seasonal wave has begun. Respiratory activity below this level is considered “Very Low”.
  - For non-COVID-19 syndromes, the very low incidence periods, the times during which respiratory activity is outside of a peak and remains very low, come regularly each year and are defined as weeks 20-39 of the year. For COVID-19, the very low incidence periods were classified using a wave-identification algorithm, as described by Vega et al.
  - This calculation uses the 5 highest weekly rates from each very low incidence period of the 6 most recent waves. (n=30)
  - Baseline = the upper limit of the one tailed 95% CI of the arithmetic mean of those rates.
- Activity levels calculation: Categorizes levels as either below baseline or within the epidemic (once activity has passed the baseline). The MA DPH Respiratory Illness Dashboard divides activity into Very Low (below Baseline), Low, Moderate, High, and Very High.
  - For non-COVID-19 syndromes, the respiratory wave was defined as weeks 40-19 of the year. For COVID-19, waves were classified using a wave-identification algorithm, as described by Vega et al.
  - This calculation uses the 5 highest weekly rates from each of the 6 most recent waves. (n=30)
  - The thresholds are calculated based on the upper bounds of the one-sided confidence intervals of the geometric mean of those rates at:
    - Very Low: Less than the Baseline calculation
    - Low: Greater than or equal to the Baseline calculation and less than 50%
    - Moderate: Greater than or equal to 50% and less than 90%
    - High: Greater than or equal to 90% and less than 97.5%
    - Very High: Greater than or equal to 97.5%

Help Us Improve Mass.gov with your feedback

Did you find what you were looking for on this webpage?

Yes

If you have any suggestions for the website, please let us know. How can we improve the page?

Please do not include personal or contact information.

The feedback will only be used for improving the website. If you need assistance, please contact the Bureau of Infectious Disease and Laboratory Sciences. Please limit your input to 500 characters.

Please remove any contact information or personal data from your feedback. You will NOT get a response.

If you need assistance, please contact the Bureau of Infectious Disease and Laboratory Sciences.

Please let us know how we can improve this page.