Profile Picture OpenData

created Jun 8 2021

updated Jan 22 2024


This archived dataset includes data for population characteristics that are no longer being reported publicly. The date on which each population characteristic type was archived can be found in the field “data_loaded_at”.
Data on the population characteristics of COVID-19 cases are from: 
* Case interviews 
* Laboratories 
* Medical providers 

These multiple streams of data are merged, deduplicated, and undergo data verification processes.  
* We include all race/ethnicity categories that are collected for COVID-19 cases.
* The population estimates for the "Other" or “Multi-racial” groups should be considered with caution. The Census definition is likely not exactly aligned with how the City collects this data. For that reason, we do not recommend calculating population rates for these groups.
* The City collects information on gender identity using these guidelines.
Skilled Nursing Facility (SNF) occupancy
* A Skilled Nursing Facility (SNF) is a type of long-term care facility that provides care to individuals, generally in their 60s and older, who need functional assistance in their daily lives. 
* This dataset includes data for COVID-19 cases reported in Skilled Nursing Facilities (SNFs) through 12/31/2022, archived on 1/5/2023. These data were identified where “Characteristic_Type” = ‘Skilled Nursing Facility Occupancy’.

Sexual orientation
* The City began asking adults 18 years old or older for their sexual orientation identification during case interviews as of April 28, 2020. Sexual orientation data prior to this date is unavailable.
* The City doesn’t collect or report information about sexual orientation for persons under 12 years of age.
* Case investigation interviews transitioned to the California Department of Public Health, Virtual Assistant information gathering beginning December 2021. The Virtual Assistant is only sent to adults who are 18+ years old. Learn more about our data collection guidelines pertaining to sexual orientation.

* Underlying conditions are reported when a person has one or more underlying health conditions at the time of diagnosis or death.

Persons are identified as homeless based on several data sources:
* self-reported living situation
* the location at the time of testing
* Department of Public Health homelessness and health databases
* Residents in Single-Room Occupancy hotels are not included in these figures.
These methods serve as an estimate of persons experiencing homelessness. They may not meet other homelessness definitions.

Single Room Occupancy (SRO) tenancy
* SRO buildings are defined by the San Francisco Housing Code as having six or more "residential guest rooms" which may be attached to shared bathrooms, kitchens, and living spaces.
* The details of a person's living arrangements are verified during case interviews.

Transmission Type
* Information on transmission of COVID-19 is based on case interviews with individuals who have a confirmed positive test. Individuals are asked if they have been in close contact with a known COVID-19 case. If they answer yes, transmission category is recorded as contact with a known case. If they report no contact with a known case, transmission category is recorded as community transmission. If the case is not interviewed or was not asked the question, they are counted as unknown.
This dataset has been archived and will no longer update as of 9/11/2023.
Population estimates are only available for age groups and race/ethnicity categories. San Francisco population estimates for race/ethnicity and age groups can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS).
This dataset includes many different types of characteristics. Filter the “Characteristic Type” column to explore a topic area. Then, the “Characteristic Group” column shows each group or category within that topic area and the number of cases on each date.
New cases are the count of cases within that characteristic group where the positive tests were collected on that specific specimen collection date. Cumulative cases are the running total of all San Francisco cases in that characteristic group up to the specimen collection date listed.
This data may not be immediately available for recently reported cases. Data updates as more information becomes available.
To explore data on the total number of cases, use the ARCHIVED: COVID-19 Cases Over Time dataset.
  • 9/11/2023 - data on COVID-19 cases by population characteristics over time are no longer being updated. The date on which each population characteristic type was archived can be found in the field “data_loaded_at”.
  • 6/6/2023 - data on cases by transmission type have been removed. See section ARCHIVED DATA for more detail.
  • 5/16/2023 - data on cases by sexual orientation, comorbidities, homelessness, and single room occupancy have been removed. See section ARCHIVED DATA for more detail.
  • 4/6/2023 - the State implemented system updates to improve the integrity of historical data.
  • 2/21/2023 - system updates to improve reliability and accuracy of cases data were implemented.
  • 1/31/2023 - updated “population_estimate” column to reflect the 2020 Census Bureau American Community Survey (ACS) San Francisco Population estimates.
  • 1/5/2023 - data on SNF cases removed. See section ARCHIVED DATA for more detail.
  • 3/23/2022 - ‘Native American’ changed to ‘American Indian or Alaska Native’ to align with the census.
  • 1/22/2022 - system updates to improve timeliness and accuracy of cases and deaths data were implemented.
  • 7/15/2022 - reinfections added to cases dataset. See section SUMMARY for more information on how reinfections are identified.

Community Rating
Current value: 0 out of 5
Your Rating
Current value: 0 out of 5
Row Label
SODA2 Only
Licensing and Attribution
Data Provided By
Source Link
Department Metrics
Publishing Department
Public Health
Detailed Descriptive
Geographic unit
Not applicable
This view cannot be displayed