Profile Picture OpenData

created Jun 8 2021

updated Jan 26 2023


Note: On 1/31/2023 we will be updating our data to reflect the 2020 Census Bureau American Community Survey (ACS) San Francisco Population estimates.
Note: Beginning on 1/5/2023, data for COVID-19 cases reported in Skilled Nursing Facilities (SNFs) will no longer be included in this dataset. These data were previously identified where Characteristic_Type = “Skilled Nursing Facility Occupancy”. An archived copy of SNF data through 12/31/2022 (as of 1/5/2023) is available here: ARCHIVED: COVID-19 Cases by Population Characteristics Over Time.
Note: As of March 2022, the race/ethnicity label changed from Native American to American Indian or Alaska Native to align with the Census.
Note: On January 22, 2022, system updates to improve the timeliness and accuracy of San Francisco COVID-19 cases and deaths data were implemented. You might see some fluctuations in historic data as a result of this change. Due to the changes, starting on January 22, 2022, the number of new cases reported daily will be higher than under the old system as cases that would have taken longer to process will be reported earlier.
Note: On September 12, 2021, a new case definition of COVID-19 was introduced that includes criteria for enumerating new infections after previous probable or confirmed infections (also known as reinfections). A reinfection is defined as a confirmed positive PCR lab test more than 90 days after a positive PCR or antigen test. The first reinfection case was identified on December 7, 2021. Some fluctuation in historic data may result when this change is implemented on July 15, 2022.
This dataset shows San Francisco COVID-19 cases by population characteristics and by specimen collection date. Cases are included on the date the positive test was collected.
Population characteristics are subgroups, or demographic cross-sections, like age, race, or gender. The City tracks how cases have been distributed among different subgroups. This information can reveal trends and disparities among groups.
Data is lagged by five days, meaning the most recent specimen collection date included is 5 days prior to today. Tests take time to process and report, so more recent data is less reliable.
Data on the population characteristics of COVID-19 cases and deaths are from:
* Case interviews
* Laboratories
* Medical providers
These multiple streams of data are merged, deduplicated, and undergo data verification processes. This data may not be immediately available for recently reported cases because of the time needed to process tests and validate cases. Daily case totals on previous days may increase or decrease. Learn more.
Data are continually updated to maximize completeness of information and reporting on San Francisco residents with COVID-19.
Data notes on each population characteristic type is listed below.
* We include all race/ethnicity categories that are collected for COVID-19 cases.
* The population estimates for the "Other" or “Multi-racial” groups should be considered with caution. The Census definition is likely not exactly aligned with how the City collects this data. For that reason, we do not recommend calculating population rates for these groups.
Sexual orientation
* The City began asking adults 18 years old or older for their sexual orientation identification during case interviews as of April 28, 2020. Sexual orientation data prior to this date is unavailable.
* The City doesn’t collect or report information about sexual orientation for persons under 12 years of age.
* Case investigation interviews transitioned to the California Department of Public Health, Virtual Assistant information gathering beginning December 2021. The Virtual Assistant is only sent to adults who are 18+ years old. Learn more about our data collection guidelines pertaining to sexual orientation.
* The City collects information on gender identity using these guidelines.
* Underlying conditions are reported when a person has one or more underlying health conditions at the time of diagnosis or death.
Transmission type
* Information on transmission of COVID-19 is based on case interviews with individuals who have a confirmed positive test. Individuals are asked if they have been in close contact with a known COVID-19 case. If they answer yes, transmission category is recorded as contact with a known case. If they report no contact with a known case, transmission category is recorded as community transmission. If the case is not interviewed or was not asked the question, they are counted as unknown.
Persons are identified as homeless based on several data sources:
* self-reported living situation
* the location at the time of testing
* Department of Public Health homelessness and health databases
* Residents in Single-Room Occupancy hotels are not included in these figures.
These methods serve as an estimate of persons experiencing homelessness. They may not meet other homelessness definitions.
Single Room Occupancy (SRO) tenancy
* SRO buildings are defined by the San Francisco Housing Code as having six or more "residential guest rooms" which may be attached to shared bathrooms, kitchens, and living spaces.
* The details of a person's living arrangements are verified during case interviews.
Updates automatically at 05:00 AM Pacific Time each day. Redundant runs are scheduled at 07:00 AM and 09:00 AM in case of pipeline failure.
Dataset will not update on the business day following any federal holiday.
This dataset includes many different types of characteristics. Filter the “Characteristic Type” column to explore a topic area. Then, the “Characteristic Group” column shows each group or category within that topic area and the number of cases on each date.
New cases are the count of cases within that characteristic group where the positive tests were collected on that specific specimen collection date. Cumulative cases are the running total of all San Francisco cases in that characteristic group up to the specimen collection date listed.
Population estimates are only available for age groups and race/ethnicity categories. San Francisco population estimates for race/ethnicity groups are from the 2019 5-year American Community Survey. San Francisco population estimates for age groups are from the National Center for Health Statistics.
This data may not be immediately available for recently reported cases. Data updates as more information becomes available.
To explore data on the total number of cases, use the COVID-19 Cases Over Time dataset.
Certain population characteristics that were once included in this dataset are no longer being reported publicly. An archived copy of these data can be found at this dataset here: ARCHIVED: COVID-19 Cases by Population Characteristics Over Time.
The archived dataset contains data on the following population characteristics that are no longer being reported publicly:
  • Skilled Nursing Facility Occupancy

Community Rating
Current value: 0 out of 5
Your Rating
Current value: 0 out of 5
Row Label
SODA2 Only
Licensing and Attribution
Data Provided By
Source Link
Department Metrics
Publishing Department
Public Health
Detailed Descriptive
Geographic unit
Not applicable
Publishing Details
Publishing frequency
Data change frequency
This view cannot be displayed