VACCINATIONS: two-doses and one-dose #2083
Replies: 38 comments 4 replies
-
As always thank you @lucasrodes! The vaccination README now includes a detailed explanation of how our 3 metrics ( Regarding calculations and data processing, I expect that things will get messy when some countries start mixing 1-dose vaccines and 2-dose vaccines in their vaccination program, but won't necessarily release the right data for us to calculate our metrics correctly. For now, we have this data for the UK (and England/Scotland/NI/Wales), and Israel. |
Beta Was this translation helpful? Give feedback.
-
Hi @edomt,
Had a look before, great job!
Couldn't agree more with this! Backwards compatibility is a must at this point Having said this, for now, I guess this is still a problem for the future, once one-dose vaccines are in use. This - the dosage variability in different vaccines - came to my mind today and didn't want to forget, so I noted it down as an issue 😄. |
Beta Was this translation helpful? Give feedback.
-
Question: How to show the plot of |
Beta Was this translation helpful? Give feedback.
-
@yoch The data explorer (the very big graph at the top of the page) only includes doses. But if you scroll down, you'll find a section "What share of the population has received at least one dose of the COVID-19 vaccine?" with data on people vaccinated, and "What share of the population has been fully vaccinated against COVID-19?" on people fully vaccinated. |
Beta Was this translation helpful? Give feedback.
-
Okay, thank you ! IMHO the section "What share of the population has received at least one dose of the COVID-19 vaccine?" is more relevant and deserve to be the main picture, because "doses administered per 100 people" introduce some confusion, especially when comparing with one doses vaccine in future. |
Beta Was this translation helpful? Give feedback.
-
Came here to say this. The top chart on the website is doses, and framed "as per 100 people". This sets up an expectation that when we get to 100, we're done. But this really means we need to get to 200! So the rollout is actually half as fast as we'd like. Deciding what the main graph is on the page is important and I humbly recommend it be either updated to "number of people receiving at least one dose" or "number of people fully vaccinated" so that 100 is the real target... not something somewhere between 100 and 200. See here for a Twitter thread on the confusion: https://twitter.com/mikesherov/status/1354103902875099138?s=21 thanks for the great work you all do and for the openness of this project! |
Beta Was this translation helpful? Give feedback.
-
Thanks for the feedback @yoch & @mikesherov Our choice to show total doses at the top of the page is a deliberate one (our team actually spends a fair amount of time discussing & debating how to ideally present information on each of our COVID-19 page). I recently wrote a Twitter thread here explaining our rationale for currently (i.e. not necessarily permanently) preferring total doses:
|
Beta Was this translation helpful? Give feedback.
-
While I appreciate the rationale about accurately representing velocity of roll out, it still doesn't serve as a foundation that contains a stable denominator to the ratio. "Per 100 people" will continue to belie the notion of progress. I believe most people will want to know actual progress towards "everyone has been vaccinated" or "herd immunity achieved". The natural understanding of 100 as the denominator expressing "percent of the way there" is inescapable, and will continue to mislead visitors. I suggest usability testing this because it's really counter to natural user expectation, even with the warning blurb. |
Beta Was this translation helpful? Give feedback.
-
Hi @mikesherov I understand your concern, and thank you for the time you took to voice it. As I said above, this is the current situation, and we may decide to change the layout of the page and reorder the charts. In particular, "people fully vaccinated" may become the main metric we want to highlight in the future, both from a public health point of view and in terms of public interest. On the other hand, we do get hundreds of messages per day that give us feedback on our work, both directly on the site (through our feedback button in the bottom-right corner of each page) and outside (Twitter, GitHub). And more than 50% of this feedback is currently about our vaccination page. When something looks wrong or is misinterpreted by a lot of our users, we almost instantly receive dozens of questions about it, and we act quickly to correct or clarify it. This is simply not the case about this issue. Again, we're sincerely interested in user feedback and over the last 12 months, we've made hundreds of changes to our COVID-19 pages & charts based on it. But there's very little evidence for the confusion you're describing here, and it seems that the chart's subtitle is currently enough to help people understand how this metric works. Edouard |
Beta Was this translation helpful? Give feedback.
-
Thanks for the answer, the rationale is quite convincing. But maybe it need to be more explicit (I don't know exactly how). I've seen on Twitter that even good scientists can be confused by the presentation of results. You can see for example: https://twitter.com/EricTopol/status/1354182091148152833 |
Beta Was this translation helpful? Give feedback.
-
Would it be possible to create another column indicating the total number of people with 1+ doses? This is e.g. recorded on the CDC website (https://covid.cdc.gov/covid-data-tracker/#vaccinations). The benefit of such recording is that it gives enough information to back-out the number of people receiving the 1-dose vaccine, which currently appears to have a significant efficacy difference with most two-dose vaccines. Currently as defined, the J&J vaccine would count towards both the columns "people_fully_vaccinated" and "people_vaccinated", making the statistics unidentifiable. |
Beta Was this translation helpful? Give feedback.
-
@MichaelLLi Good point! Just to check If I understood it correctly (which I am not sure), does your proposal match the example below? Currently we have:
New column addition:
where Not needed, but could also have a |
Beta Was this translation helpful? Give feedback.
-
I'm actually not sure if I fully understand your suggestion @MichaelLLi: what's given by the US CDC is "Number of People Receiving 1 or More Doses", which is already what our "people_vaccinated" variable is. |
Beta Was this translation helpful? Give feedback.
-
Thanks to the OWID team for the incredible efforts on this! Here's a suggestion on how to reconcile the various tensions between doses and people as well as the 100 vs 200% issue, which I am thinking of implementing on the site I am running on the pandemic by World Bank income classification. Data availability issues vs the need for a global indicator of coverageCountries report very differently on their vaccination efforts. In the OWID dataset we can distinguish between three cases:
One observation is that It's curious that for the countries under (2) it turns out that In light of this, I agree with the earlier statement in this thread that it makes sense (for now) to focus on Single- and double-protocol equivalentsOne drawback as noted by @mikesherov is that full coverage is achieved at 200% as opposed to 100%. As Edouard noted, that communication issue can be handled by being explicit about it. But it's still an issue and not the only one. Before the advent of single-shot protocols, we could safely assume that 200% represents full coverage of the population. With single-shot protocols, we may be able to achieve full coverage much earlier so the threshold will no longer be constant and depend on the type of vaccines deployed. This is an issue in charts like this one. A potential solution would be to construct a variable of An alternative to this would be to convert Finally, note that this is not a suggestion to drop |
Beta Was this translation helpful? Give feedback.
-
Hi @fibke Many thanks for writing down your thoughts on this (and apologies for the late reply!). The equivalence of
Until now, we've always been able to know whether shots belonged to a single- or double-shot protocol. J&J is the only 1-dose protocol that is being used, and only in two countries (US & South Africa). South Africa only uses J&J, so that keeps things simple. And the US reports the data in a very detailed way, which lets us know clearly which doses are J&J ones vs all the other 2-dose protocols. Since February, we've recentered our communication around While I like the approach you suggested of converting this information to SPE/DPE to allow for better comparison, for now I think this would introduce much more complexity than is necessary in the current "data landscape". Our (OWID's) main goal remains to make the research and data on COVID-19 vaccinations more understandable and accessible. Currently, |
Beta Was this translation helpful? Give feedback.
-
It would be extremely useful if we could access data on number of vaccinations versus type. Currently I am working on an estimate of R_0 for the Delta variant, based on the current exponential growth rate in cases in the UK. That estimate will be used to assess the vulnerability of the US to a new wave in infections. But to do that, it is necessary to estimate the susceptible fraction of the population, which includes a fraction of the people who have been vaccinated. For instance, the MNRA vaccines offer ca 90% protection, which means that 10% of those who have received those vaccines are susceptible. For the J&J vaccine, the rate of protection is ca 70%, meaning 30% are susceptible. |
Beta Was this translation helpful? Give feedback.
-
@SchiffS Are you referring to this data or is something missing from that? |
Beta Was this translation helpful? Give feedback.
-
Hello @edomt, Regarding France, the source that you use reports 49.41M fully vaccinated people but, your file only counts 45.25M. I do know that there is a gap of few days, but is there an explanation for this huge difference? For comparison, the numbers for NZ are perfectly reported (NZ numbers are way simpler to understand!!). |
Beta Was this translation helpful? Give feedback.
-
Hi @PierreKiwi :) In our data, fully-vaccinated people are those who have received all doses prescribed by their vaccination regimen (e.g. 2 doses for Pfizer/BioNTech, Moderna, Oxford/AstraZeneca, etc. and 1 dose for Johnson & Johnson, CanSino, etc.). Some countries—including France—use alternative definitions, such as having been infected with SARS-CoV-2 in the past and having received 1 dose of a two-dose regimen. We currently ignore these alternative definitions to preserve the common definition of fully vaccinated, i.e. all doses required in the vaccine regimen. This allows for an optimal comparability between countries. |
Beta Was this translation helpful? Give feedback.
-
@edomt - hi, Your approach makes sense as it is def. a thorny problem. You want to offer data which is reliable and meaningful. Nevertheless I would argue that OWID results can be misleading for two reasons: OWID does not seem to apply the same logic to each country and eligibility age/demographics are not the same between countries. Let me break them down.
At this stage, would it not be way easier for OWID to simply use the official data of each country (especially when they are reliable) for both one dose and fully vaccinated rather than trying to follow the official vaccine regimen (which is already hijacked in some ways (boosters, mixing, one dose + covid = fully vaccinated))? Regarding my second point. Because demographics between countries are very different (young African country, old Europeans ones) and there is no consistency about eligibilty (some countries vaccinate from 3 like the UAE (TIL!) while many allow it from 12+ (France, Canada, NZ), and others from 18+), what about reporting % based on the eligible population rather than the overall one? From an OWID point of view, I do understand that it is easier to use the total population as denominator because some countries do not report more detailed data. But I found the % not really representative of the vaccinal cover. For example, using their eligible population, Canada, France, and NZ jump quite significantly:
Regards |
Beta Was this translation helpful? Give feedback.
-
Hi @PierreKiwi On the contrary, we do apply the same methodology to every country! Or at least as much as we can :)
Regarding your two last suggestions, beyond the fact that this wouldn't be a good idea in terms of comparability between countries, it would also be extremely difficult for us to change either our denominators, or our method of data collection. This would involve revising scripts and historical data for ~200 countries. |
Beta Was this translation helpful? Give feedback.
-
Hi @edomt, I'm trying to calculate the number of daily vaccinations of each dose, using people_vaccinated, people_fully_vaccinated and total_vaccinations. I'm approximating missing values, then calculating a daily rate from the differnce between consecutive cumulative values. I have data on the proportion of vaccines delivered which are one- or two-shot vaccines from another source. I was wondering if you were able to help with the following: Number of booster shots should be total_vaccinations minus people_vaccinated. The number of people recieving one shot of a one-shot only vaccine is people_fully_vaccinated multiplied by the proportion of vaccines in that country which are one-shot only. The number of people recieving the second shot of a two-shot vaccine is people_fully_vaccinated multiplied by the proportion of vaccines in that country which are two-shot. The number of people receiving the first shot of a two-shot vaccine is people_vaccinated minus the number of people recieving the only shot of a one-shot vaccine. This generates negative values though so I think my calculation of those receiving only shot of one-shot is incorrect. I've read the definitions information on https://github.com/owid/covid-19-data/tree/master/public/data/vaccinations I understand that public health data can be difficult to standardise, especially under conditions of global epidemic. Any help would be greatly appreciated. |
Beta Was this translation helpful? Give feedback.
-
Hi @willgilks
This is unfortunately invalid: The best solution I can see would be:
|
Beta Was this translation helpful? Give feedback.
-
Hi all, Don't know if this is the right place to ask but I've been having issues with the The numbers I get dividing the Previous to 2021-01-29 most of the differences are insignificant and I believe could be related to rounding issues. However the gap between my calculation and the OWID's US examples: Population as is on
Am I missing any methodological steps here? |
Beta Was this translation helpful? Give feedback.
-
Thanks a lot for taking the time to answer to my comments. I was not implying that OWID should drop the relative to population aspect, but eventually try to add the relative to eligible population option. A lot of work for sure :). Regarding the differences between countries, thanks a lot for all the details. I now understand OWID approach and will look at the numbers with a different mindset. |
Beta Was this translation helpful? Give feedback.
-
I agree with Edouard that an indicator that constrains the denominator to the eligible population will be hard to construct. While it may help assessing the vaccination performance of countries assuming eligibility criteria are common across countries, a big drawback is that it runs the risk of conveying the notion that the non-eligible group doesn’t matter in measuring vaccination coverage.
|
Beta Was this translation helpful? Give feedback.
-
Hi @leonardodiegues - see #1983 |
Beta Was this translation helpful? Give feedback.
-
@edomt Thank you! |
Beta Was this translation helpful? Give feedback.
-
Hi, does the new_vaccinations columns include boosters? (Apologies if it's stated in the manual, just require confirmation) |
Beta Was this translation helpful? Give feedback.
-
if i deaccumulate people_vaccinated, people_fully_vaccinated, total_boosters, total_vaccinations, then the sum of the first three is greater than total_vaccinations on 12,030 days (91 countries involved), corresponding to 81,605,564 doses administered. In an attempt to deal with this, I maintain the total_boosters value but reduce people_vaccinated and people_fully_vaccinated proportionally to fit with the remaining (tv-total_boosters). Unfortunately, there's still 88 days where total_boosters per day exceeds total_vaccinations per day (by a total of 2,419,282 doses, 14 countries involved). lmk if my logic is wrong here. If i change total_vaccinations on these days to be the higher total_boosters value, then of course the cumulative total total_vaccinations increases. Instead, I could replace the 'bad' daily value with zero or interpolate it. Either way, what remains then is to distribute the unaccounted for cumulative doses among all days providing this doesn't increase their daily vaccinations to greater than the total vaccinations. This is very fiddly, and means that all historical data points are being changed every day. A less laborious solution might be to loess-smooth on the non-cumulative to remove likely artifacts (possibly omitting days from the model where the subgroup total exceeds total_vaccinations), then accumulate, then adjust by the reported cumulative total. So my question is, how do I deal with the 88 days where total_boosters per day is greater than total_vaccinations per day ? Is there a reasonably possibility that this can be corrected in the available data? attaching csv of problem days. Thanks for all your hard work, hugely apprieciated. |
Beta Was this translation helpful? Give feedback.
-
This is a centralized issue to discuss how to jointly handle two-dose and one-dose vaccines in the data.
As of 2021-01-12, all approved vaccinations are administered in two doses. However, some future vaccines are announced to require only one dose.
A first proposal could be as in England.csv, where a new column
people_fully_vaccinated
has been introduced. Therefore, when analyzing all doses jointly (from all providers), we might prefer to use this column instead oftotal_vaccinations
.2 Doses
Pfizer/BionTech, Moderna, AstraZeneca/Oxford university, Novavax, Medicago/GSK, CureVac, Gamaleya, Sinopharm I, Sinopharm II, SinoVac, Bharat Biotech
1 Dose
Johnson & Johnson, CanSino
Unclear
ZFSW
Reference: Covid vaccine tracker: The shots available and the doses administered, Financial Times, January 4 2021 (accessed on January 12, 2021)
Beta Was this translation helpful? Give feedback.
All reactions