Profile Picture OpenData

created Jul 28 2021

updated Jun 28 2023


On 6/28/2023, data on cases by vaccination status will be archived and will no longer update.
This dataset represents San Francisco COVID-19 positive confirmed cases by vaccination status over time, starting January 1, 2021. Cases are included on the date the positive test was collected (the specimen collection date). Cases are counted in three categories: (1) all cases; (2) unvaccinated cases; and (3) completed primary series cases.
1. All cases: Includes cases among all San Francisco residents regardless of vaccination status.
2. Unvaccinated cases: Cases are considered unvaccinated if their positive COVID-19 test was before receiving any vaccine. Cases that are not matched to a COVID-19 vaccination record are considered unvaccinated.
3. Completed primary series cases: Cases are considered completed primary series if their positive COVID-19 test was 14 days or more after they received their 2nd dose in a 2-dose COVID-19 series or the single dose of a 1-dose vaccine. These are also called “breakthrough cases.”
On September 12, 2021, a new case definition of COVID-19 was introduced that includes criteria for enumerating new infections after previous probable or confirmed infections (also known as reinfections). A reinfection is defined as a confirmed positive PCR lab test more than 90 days after a positive PCR or antigen test. The first reinfection case was identified on December 7, 2021.
Data is lagged by eight days, meaning the most recent specimen collection date included is eight days prior to today. All data updates daily as more information becomes available.
Case information is based on confirmed positive laboratory tests reported to the City. The City then completes quality assurance and other data verification processes. Vaccination data comes from the California Immunization Registry (CAIR2). The California Department of Public Health runs CAIR2. Individual-level case and vaccination data are matched to identify cases by vaccination status in this dataset. Case records are matched to vaccine records using first name, last name, date of birth, phone number, and email address.
We include vaccination records from all nine Bay Area counties in order to improve matching rates. This allows us to identify breakthrough cases among people who moved to the City from other Bay Area counties after completing their vaccine series. Only cases among San Francisco residents are included.
Updates automatically at 08:00 AM Pacific Time each day.
Total San Francisco population estimates can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS). To identify total San Francisco population estimates, filter the view on “demographic_category_label” = “all ages”.
Population estimates by vaccination status are derived from our publicly reported vaccination counts, which can be found at COVID-19 Vaccinations Given to SF Residents Over Time.
The dataset includes new cases, 7-day average new cases, new case rates, 7-day average new case rates, percent of total cases, and 7-day average percent of total cases for each vaccination category.
New cases are the count of cases where the positive tests were collected on that specific specimen collection date. The 7-day rolling average shows the trend in new cases. The rolling average is calculated by averaging the new cases for a particular day with the prior 6 days.
New case rates are the count of new cases per 100,000 residents in each vaccination status group. The 7-day rolling average shows the trend in case rates. The rolling average is calculated by averaging the case rate for a particular day with the prior six days. Percent of total new cases shows the percent of all cases on each day that were among a particular vaccination status.
Here is more information on how each case rate is calculated:
1. The case rate for all cases is equal to the number of new cases among all residents divided by the estimated total resident population.
2. Unvaccinated case rates are equal to the number of new cases among unvaccinated residents divided by the estimated number of unvaccinated residents. The estimated number of unvaccinated residents is calculated by subtracting the number of residents that have received at least one dose of a vaccine from the total estimated resident population.

3. Completed primary series case rates are equal to the number of new cases among completed primary series residents divided by the estimated number of completed primary series residents. The estimated number of completed primary series residents is calculated by taking the number of residents who have completed their primary series over time and adding a 14-day delay to the “date_administered” column, to align with the definition of “Completed primary series cases” above.
  • 6/28/2023 - data on cases by vaccination status are no longer being updated. This data is currently through 6/20/2023 (as of 6/28/2023) and will not include any new data after this date.
  • 4/6/2023 - the State implemented system updates to improve the integrity of historical data.
  • 2/21/2023 - system updates to improve reliability and accuracy of cases data were implemented.
  • 1/31/2023 - updated “sf_population” column to reflect the 2020 Census Bureau American Community Survey (ACS) San Francisco Population estimates.
  • 1/31/2023 - renamed column “last_updated_at” to “data_as_of”.
  • 1/22/2022 - system updates to improve timeliness and accuracy of cases and deaths data were implemented.
  • 7/15/2022 - reinfections added to cases dataset. See section SUMMARY for more information on how reinfections are identified.
  • 7/15/2022 - references to “fully vaccinated” replaced with “completed primary series” in column “vaccination_status".
  • 7/15/2022 - rows with “partially vaccinated” in column “vaccination_status” removed from dataset.

Community Rating
Current value: 0 out of 5
Your Rating
Current value: 0 out of 5
vaccination, vaccines, cases, covid-19, covid
Row Label
SODA2 Only
Licensing and Attribution
Data Provided By
Source Link
Department Metrics
Publishing Department
Public Health
Detailed Descriptive
Geographic unit
Not applicable
Publishing Details
Publishing frequency
Not updated (historical only)
Data change frequency
Not updated (historical only)
This view cannot be displayed