Note: As of April 16, 2021, this dataset will update daily with a five-day data lag.
A. SUMMARY
This dataset represents the COVID-19 positive confirmed cases by sexual orientation for adult cases (18 or older) where the lab result was reported after April 28th.
Read more about
California's mandate that all counties report data on sexual orientation and gender identity.
Demographic data (including sexual orientation) are based on information reported from case interviews, laboratories, and providers. This data may not be immediately available for recently reported cases and data will change to reflect as information becomes available. Cumulative counts of 5 or fewer are excluded from the dataset.
B. HOW THE DATASET IS CREATED
Information on COVID-19 case demographic details (like sexual orientation) is reported from a combination of data sources including electronic and faxed laboratory reports, case interviews, medical providers, and electronic medical record systems. These multiple streams of data are merged, deduplicated, undergo quality assurance and other data verification processes, and are continually updated to maximize completeness of information and reporting on San Francisco residents with COVID-19.
C. UPDATE PROCESS
Updates automatically at 05:00 Pacific Time each day. Redundant runs are scheduled at 07:00 and 09:00 in case of pipeline failure.
D. HOW TO USE THIS DATASET
This data may undercount certain minorities who have faced stigma and discrimination, particularly in medical settings (and therefore may not disclose their gender identity or sexual orientation, for example).
Missing data (coded here as "missing") may become known over time as more information is gathered.
Sexual Orientation groups resulting in 5 or fewer cumulative cases are dropped for privacy reasons.