Skip to Main Content

Learning from COVID-19 Data on Transmission, Health Outcomes, Interventions and Vaccination

 .

Learning from COVID-19 Data on Transmission, Health Outcomes, Interventions and Vaccination

February 24, 2021

Xihong Lin, PhD
Professor, Department of Biostatistics
Harvard T.H. Chan School of Public Health

Tuesday, February 23, 2021

ID
6223

Transcript

  • 00:03So, so first it is my honor
  • 00:05to today to welcome Doctor.
  • 00:07She only as our summer speaker,
  • 00:10Xihong Lin is a professor from Harvard,
  • 00:13jointly appointed by both as deaths.
  • 00:16Annabelle Slash Department and she has
  • 00:18received broad recognition and many
  • 00:21awards for her great contribution to the
  • 00:23field and her research has covered so
  • 00:26many different topics ranging from Mr
  • 00:28Logical work including hypothesis testing,
  • 00:30had dimension statistics
  • 00:31and color inference tools,
  • 00:33data applications.
  • 00:34And the computational statistics
  • 00:36such as Statical Genetics.
  • 00:38Scalable statical inference
  • 00:39as well as applications with
  • 00:41epidemiological and health data.
  • 00:43So today she will share with us
  • 00:46her work on analyzing large scale
  • 00:49coordinating databases from both
  • 00:50China and the US and provide several
  • 00:53takeaways and discuss priorities.
  • 00:55It's Insp endemic,
  • 00:57so I will not occupy everyones time for more.
  • 01:01I will hand it over to see home from here.
  • 01:05So should we start
  • 01:07alright? Thank you laying so
  • 01:08much and for inviting me and
  • 01:10for the very nice introduction
  • 01:12and so I will share my screen.
  • 01:141st oh, I cannot share the screen.
  • 01:16I have just met
  • 01:18your Co host so you should
  • 01:20be able to get it now.
  • 01:25Can you can you share your
  • 01:26screen I can do now? Thank you thanks.
  • 01:33Alright, can you see my screen?
  • 01:35Yes. Cool excellent alright?
  • 01:38So I'll share with you some of the
  • 01:42work and we have been doing last year
  • 01:46on the Covic 19 so this is the data
  • 01:50just downloaded earlier this this week.
  • 01:52So as you can see that the right now
  • 01:56they're over 110,000,000 cases in
  • 01:59US and also 2.4 two point 4,000,000
  • 02:02deaths were 110,000,000 cases and
  • 02:052.4 million deaths worldwide.
  • 02:07So if you look at the curve on the left,
  • 02:10that is the case curve and for
  • 02:12a few selective countries.
  • 02:14So as you can see for both the UK and
  • 02:16also United States and the number of
  • 02:18cases had been going down in January,
  • 02:21so that is a good sign.
  • 02:23And also Israel as you know that
  • 02:25Israel has been a really leader in
  • 02:27a vaccination and so their cases
  • 02:29have been going down as well.
  • 02:31But if you look at Africa I think
  • 02:33the number this particular country
  • 02:35you can see the number of cases.
  • 02:37Has been going up likely due to the new
  • 02:40violence in Africa and on the right.
  • 02:42That is the case curve with the desk curve.
  • 02:45You can see the patterns pretty similar,
  • 02:47especially you can see for this
  • 02:49African country and the number
  • 02:51of deaths has going up quickly,
  • 02:53so that is really worrisome.
  • 02:55So here's a talk outline,
  • 02:57so I'll start talking about the covid
  • 03:01transmission intervention and using the data,
  • 03:03and then they turn talk about
  • 03:06the USN word data.
  • 03:08Then I'll talk about epidemiological
  • 03:11characteristics of Mackovic.
  • 03:12Then I'll talk about the 221
  • 03:14playbooks and also the defining
  • 03:16challenges in particular about
  • 03:19the vaccine rollout and uptake.
  • 03:21Our focus more on the uptake and
  • 03:24also how can we do scalable testing?
  • 03:28And in particular,
  • 03:29I talk about this support design and
  • 03:32we call the hypergraph we called hyper.
  • 03:35That is based on the hypergraph
  • 03:39factorization.
  • 03:40So I started working on the Covic
  • 03:4219 research mainly by coincidence
  • 03:44and last February.
  • 03:46So my post,
  • 03:47our former postdoc column one
  • 03:49he is currently is professor in
  • 03:51school public Health at Wild on
  • 03:53Science and Technology University,
  • 03:56which is located in Wuhan,
  • 03:58and so last February,
  • 04:00'cause Wuhan was epicenter.
  • 04:01So I wrote him a message asking
  • 04:04how and he and his family were
  • 04:07doing and he told me that.
  • 04:10He and his colleagues were
  • 04:12analyzing the Wuhan data,
  • 04:13then at that time there was already one case,
  • 04:17and in Seattle and one case in Boston.
  • 04:20So I sense that the the the
  • 04:23virus might spread and so.
  • 04:25I decided to join them and working
  • 04:28on analyzing the Wuhan data so we
  • 04:31work the we work for several weeks
  • 04:34and in February especially child
  • 04:36when his Cody worked really hard and
  • 04:39then finish this preprint and we post
  • 04:43the preprint on March 6 and so then
  • 04:46with the hope that the findings we
  • 04:48want we want to share their findings
  • 04:51with the US and also other country
  • 04:54as soon as possible to help them.
  • 04:58The other countries,
  • 04:59and so I I did not expect this
  • 05:02preprint attract lots of attention
  • 05:04that you can see the number of
  • 05:07abstract view and download,
  • 05:09and also there lots of.
  • 05:13A lot of free trade and also the news outlet
  • 05:17coverage and also the policy documentation.
  • 05:20And after I twittered this paper in
  • 05:23the Twitter and so then this paper
  • 05:26preprint cover too much stuff and so we
  • 05:29decided to split the paper into split
  • 05:32the preprint into two paper and one
  • 05:35was published in JAMA and last April.
  • 05:38Another one was published in
  • 05:40Nature last summer and so this work
  • 05:43on the JAMA paper was led by.
  • 05:46I'm Pan and a towel,
  • 05:48and here they both were addressed.
  • 05:50pH alarm and country.
  • 05:52Who is the Dean of school?
  • 05:54Public health as well doing science
  • 05:57and technology University and the
  • 06:00Nature paper was led by Charlo.
  • 06:03And then emerge after the preprint
  • 06:05was posted them in Twitter.
  • 06:07And then I got quite a few interview request.
  • 06:10But as a station,
  • 06:12my first reaction was turned
  • 06:13down all the interviews,
  • 06:15so I turned on all the interviews
  • 06:18and in March, and then April,
  • 06:20I decided that probably is not a good idea.
  • 06:24It's good to talk with the media
  • 06:26and then they can understand the
  • 06:28scientific funding correctly.
  • 06:30Then that will help,
  • 06:31and the combat the Covic.
  • 06:33And so I decided to accept the interviews
  • 06:36and so you can see there's a quite
  • 06:39few coverage of the findings and
  • 06:41also the interviews and in the US medias.
  • 06:44And also in the UK medium and
  • 06:47also in scientific journals
  • 06:48such as Nature and Science.
  • 06:50And the one thing I I learned
  • 06:53was that 'cause when as a faculty
  • 06:56member in Academic Institute
  • 06:58and we were not trained to.
  • 07:01Speak to the media and so to the
  • 07:04scientific communication on the two.
  • 07:05The general public is very important,
  • 07:08so it's important to have more
  • 07:10training and in this area.
  • 07:12Another thing I learned was is
  • 07:14important that speak in simple
  • 07:16language and without the jargon and
  • 07:18so general public could understand.
  • 07:21And then I testified the in the
  • 07:23science and Technology University
  • 07:25$0.10 at Technology Committee
  • 07:27of UK Parliament on April 17.
  • 07:30And so this sense technology Committee
  • 07:33has about 8 to 10 Parliament
  • 07:35members like a senators on it.
  • 07:38And then they later on they also
  • 07:41invited few other witnesses.
  • 07:43And then they wrote a letter to the
  • 07:46Prime Minister Johnson and they make
  • 07:49a 10 recommendations in their letter.
  • 07:52Under so so then I I was honored
  • 07:55that several of my recommendation
  • 07:58we included in their recommendation
  • 08:01in the recommendation they made it
  • 08:04to the Prime Prime Minister Johnson.
  • 08:07And then, uh,
  • 08:08this last month and as a one year
  • 08:12anniversary of the colic outbreak and
  • 08:15nature published their third editorial
  • 08:18and focusing on the major findings,
  • 08:21a key finding from ideology.
  • 08:24And so we will.
  • 08:26We will honor that our paper was
  • 08:29featured on the in this editorial.
  • 08:33So let me first talk about
  • 08:36the Wuhan analysis.
  • 08:37Then we talk about the US and other
  • 08:39countries and the 1st I'll introduce
  • 08:42this effective reproductive number.
  • 08:44So this concept,
  • 08:45our value is right now and everybody
  • 08:48understand what that is and so
  • 08:50that measures average number
  • 08:52of infected people by one case.
  • 08:54So you can see on the right if
  • 08:57Artie called before that means one
  • 08:59person could affect to four people.
  • 09:02That means the virus spread.
  • 09:04So therefore, to in order
  • 09:05to control the pandemic,
  • 09:06the army to be less than one.
  • 09:10And so here's a woman analysis under.
  • 09:13So we found the two major features of Kovik.
  • 09:17The first feature is the virus
  • 09:19is highly transmissible,
  • 09:20and so the before January 23rd,
  • 09:23and there was no intervention.
  • 09:25As you can see,
  • 09:27the number of cases had went up quickly,
  • 09:30exponentially,
  • 09:31and so the first case was reported
  • 09:34on the in this orphan and seafood
  • 09:37market and this seafood market
  • 09:40was closed on January 1st.
  • 09:42Under then on March on January 23rd,
  • 09:45and that was a lunch of the lock down and so.
  • 09:50As you can see that before the lock down
  • 09:54the estimated RTT values is about 3.5,
  • 09:57so that means each purse case could be.
  • 10:00Like 3.5 people and so that is not good.
  • 10:04So that means this is really the
  • 10:06disease is very transmissible
  • 10:08and then after the lock down and
  • 10:11with the social distancing is
  • 10:13really how the RT value dropped to
  • 10:16a little over 1.2 and then.
  • 10:19But it's not good enough and so then
  • 10:22after February 2nd and that was a lunch
  • 10:25of the centralized isolation and quarantine.
  • 10:28So they basically the city build.
  • 10:31Two new hospital field Hospital and
  • 10:33also 16 field hospitals converted
  • 10:36from the stadium and expectation
  • 10:38Center and after that.
  • 10:40And then you can see the number of
  • 10:43cases drop down very quickly and the RT
  • 10:46values and by March 8th was about .27
  • 10:50and then the pandemic was controlled.
  • 10:55And so to estimate those RT values.
  • 10:58So basically the model we use is a person
  • 11:01partial differential equation model
  • 11:03and so this captured by the left here.
  • 11:06Then you can see here is a from them.
  • 11:10So here this is the symptom onset
  • 11:12and between exposure and symptoms
  • 11:14answer discord, incubation period.
  • 11:16So generally this is about 5 days and then
  • 11:19from the exposure to presymptomatic onsite.
  • 11:22This so this part period
  • 11:24called latent period.
  • 11:25So that means a patient is infected
  • 11:28and but the person is not transparent,
  • 11:32doesn't transmit the disease, and so.
  • 11:35Between the pre symptomatic period
  • 11:372 symptomatic period so this period,
  • 11:39the person,
  • 11:40even though a person doesn't have
  • 11:42a symptom and but very load,
  • 11:44is high enough and then the person
  • 11:48could become.
  • 11:49Infectious and so this period is
  • 11:51about two days between presymptomatic
  • 11:53two symptomatic and between
  • 11:55exposure to pre symptomatic period.
  • 11:57This is about 3 days,
  • 11:59and so we built in those components
  • 12:02and in the model,
  • 12:03and so we introduce this
  • 12:05presymptomatic compartment,
  • 12:06and also because at that time the testing
  • 12:09case will not that widely available,
  • 12:12and so therefore there were a lot
  • 12:14of cases which were uncertain,
  • 12:16and so therefore we built in this.
  • 12:20Unstained components,
  • 12:20the observed data here those are the
  • 12:23observed data and then those are.
  • 12:26Then we construct all those components and
  • 12:28drew the deep partial differential equation.
  • 12:31So this is isolation component.
  • 12:34And so so here the we
  • 12:36after you feed this model,
  • 12:39one can construct the
  • 12:41reproductive number or value.
  • 12:43So basically,
  • 12:43as I mentioned,
  • 12:45the datas are here and those are the data.
  • 12:48The rest part are basically coming
  • 12:50from the partial differential
  • 12:51equations and then we fit the
  • 12:54partial differential equation,
  • 12:55Apostle model and using MCMC and
  • 12:57the Gulf Coast parameter estimate.
  • 13:00And so here you can see that before
  • 13:03the intervention on our journey,
  • 13:05then if there was suppose there
  • 13:08were no intervention,
  • 13:09then this will the blue curve will
  • 13:12be the predicted number of infected
  • 13:15cases and so you can see that 170% of
  • 13:18the Wuhan population were infected.
  • 13:20That will reach the natural
  • 13:22herd immunity and so now has a
  • 13:2510 million population size.
  • 13:27Then that means 7,000,000 people need.
  • 13:30Need to be infected in order to
  • 13:32reach the herd immunity and so
  • 13:34this is not a good strategy and
  • 13:37many old people will die and the
  • 13:39sudden after supposed when user,
  • 13:41social distancing and lock down
  • 13:43and so you can see from Wuhan the
  • 13:46number of cases still going up
  • 13:48but not as fast as the without
  • 13:50intervention but after the.
  • 13:52Centralized isolation quality on top
  • 13:54of the social distancing and the
  • 13:56number of cases went down very quickly.
  • 13:59And so this is what what they
  • 14:02did in one hand.
  • 14:04So if the case subject was tested
  • 14:07positive and this person to patient
  • 14:09was admitted to the field hospital,
  • 14:12and so this is different from
  • 14:14the US in the sense that in the
  • 14:17last spring and US also build
  • 14:20multiple field hospital,
  • 14:22but they only admit as a severe diseases,
  • 14:25so amount and diseases cases
  • 14:27they were isolated at home.
  • 14:29And so that these notes are isolation at
  • 14:32home could still infect the family members.
  • 14:35And but in Wuhan all the mild cases and
  • 14:38were admitted to the field hospital,
  • 14:41so they were monitored.
  • 14:43If anybody became severe and the
  • 14:45patient was transferred to the ICU,
  • 14:47the regular hospital and for the
  • 14:49people who had a symptom and but who?
  • 14:53Because at that time there were
  • 14:55not enough testing kits and so
  • 14:57they were quitting the odds of
  • 14:59hotels and will University dorm.
  • 15:02If so, all the children and stay with
  • 15:05the parents so the family were together.
  • 15:07If anybody became a test positive
  • 15:10and the person was transferred to
  • 15:12the field hospital in two weeks,
  • 15:15if a person was tested negative
  • 15:17and the person went home,
  • 15:19and similarly for close contact and they
  • 15:22were quarantined as a hotel as well.
  • 15:25I see how can I ask a very basic question.
  • 15:28If I look at the Group One Group
  • 15:30two Group One is confirmed.
  • 15:32I guess if it means if you
  • 15:34perform some kind of PCR test
  • 15:36you are positive and Group 2 is
  • 15:37with symptom but not confirmed.
  • 15:39Can it go the other way?
  • 15:41Can you first be confirmed
  • 15:43but with no symptoms?
  • 15:45Oh yeah, I think there were cases.
  • 15:49Possible that would have had
  • 15:51no symptoms but test positive,
  • 15:53but at that time because they were
  • 15:55not enough testing kids so many
  • 15:58of the cases and who were able
  • 16:00to be tested at the same time,
  • 16:02so that's why there were lots
  • 16:04of undetected cases.
  • 16:05I said thanks.
  • 16:09And so this strategy worked
  • 16:11quite well in Wuhan.
  • 16:13So in less than two months they
  • 16:17reach 0 confirmed case and then by
  • 16:20March 18 and by April 8 and after.
  • 16:24To confirm the case for three weeks
  • 16:26and then the city was reopened.
  • 16:29So it's a whole thing only took
  • 16:31in two months and so the first
  • 16:34take home take home messages,
  • 16:36the social distancing and centralized
  • 16:38isolation quarantine were critical
  • 16:39for controlling the outbreak,
  • 16:41so using the social distancing
  • 16:43alone that help but was not good
  • 16:46enough so that helped make our
  • 16:49reduce around 1:00 and but did
  • 16:51not bend the curve and the reason
  • 16:53is the there were lots of the.
  • 16:57Community transmissions on the
  • 16:59social distancing help block the
  • 17:01community transmission that is
  • 17:03between household transmission but
  • 17:05within family transmission and cost.
  • 17:07Place transmission with common
  • 17:09and so so the social distance.
  • 17:12Distancing does not help block that and so,
  • 17:16especially with how many families are
  • 17:19multi generation families and they
  • 17:22live in apartment so it compared to
  • 17:24US as even harder under to isolate.
  • 17:28At home.
  • 17:29And the idea that centralized isolation,
  • 17:31creating to social distancing that help
  • 17:34bend the curve and stop the pandemic.
  • 17:38And so we validate those findings
  • 17:40and in other countries last spring.
  • 17:42So if you look at the curve on the left,
  • 17:45that is a Italy data.
  • 17:47So you can see that Italy also did
  • 17:50the social distancing that help reduce
  • 17:52the R and the R curve lingered around
  • 17:55one for over a month and did not bend
  • 17:58the curve but and also on the right
  • 18:01that Germany data in the spring.
  • 18:04The same thing under the curve did not band.
  • 18:09And the second feature from
  • 18:11analyzing the Wuhan data is the
  • 18:13Covic is highly converged on the,
  • 18:15so we estimated about 87% of
  • 18:17the cases were undetected.
  • 18:19So in other words,
  • 18:20the detected cases was only
  • 18:22the tip of the iceberg,
  • 18:24and so you can see that on the on
  • 18:28the left and the right bars are
  • 18:31the detected cases and then the.
  • 18:34Yellow bars also uncertain the cases,
  • 18:36so we estimated on the we estimate
  • 18:39entertainment rate and so we estimated
  • 18:42about 87% of the cases were uncertain,
  • 18:45and many of those cases were asymptomatic
  • 18:47or mildly symptomatic cases.
  • 18:49By adding the yellow and red that
  • 18:52can give us a prevalence estimate
  • 18:54that is about 2.5% in one hand,
  • 18:57and so this is similar to the
  • 18:59theological studies based on antibodies,
  • 19:02and that was about 3%.
  • 19:04And then US result very similar.
  • 19:07the CDC did theological study last
  • 19:10year and then the estimated about
  • 19:13862 twenty times the number of
  • 19:15cases were six to twenty time of
  • 19:19the cases which were reported.
  • 19:23And also those undetected tasted
  • 19:25post a high risk of resurgence
  • 19:27if one reopened too early,
  • 19:28lifting the controls, and so we estimate
  • 19:31the probability of the researchers.
  • 19:32Think about this is the first
  • 19:34day and one has a confirmed case.
  • 19:37When has a confirmed case.
  • 19:38It doesn't mean there is no
  • 19:40case at all because there are
  • 19:42still a lot of undetected cases.
  • 19:45And suppose when we open in 14
  • 19:47days by lifting all the control
  • 19:49measures and the first strategy is
  • 19:51after the first day observing the.
  • 19:53Zero confirmed Case No matter
  • 19:55whether the second day has the case
  • 19:58or not and when to reopen info.
  • 20:00This and the second strategy is one has
  • 20:04a confirmed 0 case for 14 consecutive days.
  • 20:08That basically means 000 our
  • 20:11way through and what is.
  • 20:13Research is probability.
  • 20:15So that is what we found that if one
  • 20:19reopen in 14 days after the first day,
  • 20:22observe 0 case.
  • 20:23So that means it can be zero and 120
  • 20:26again in this type of situation then
  • 20:29the researchers probability is 97%
  • 20:31and if one observes the zero case
  • 20:34for 14 consecutive days and then
  • 20:36the resurgence probability is 32%.
  • 20:38So what is tell us is we need to be
  • 20:41management and don't reopen too early.
  • 20:44So this is happened last.
  • 20:46May and many of the state in the
  • 20:49South do it reopened too early.
  • 20:51Then we saw those cases are searched
  • 20:53in the in the summer in the South.
  • 20:56So what's the take home away?
  • 20:59Take a take away message on the
  • 21:02number 2 is to control the pandemic.
  • 21:05A single control measures not enough
  • 21:08money to use multiple control measures,
  • 21:11and including the mask,
  • 21:13wearing social distancing and
  • 21:15massive testing,
  • 21:15contact tracing and also supported
  • 21:18isolation and quarantines and also
  • 21:20effective treatment and also the vaccine.
  • 21:23And so the in the JAMA paper
  • 21:26we call it multi faceted.
  • 21:28Intervention and then later on in
  • 21:31the summer and people give it a nice
  • 21:34name and called the Swiss cheese model.
  • 21:36So that is a nice name and so the
  • 21:40challenge is we we know those
  • 21:42in control measures,
  • 21:43but it's difficult to implement
  • 21:45those control measures and also keep
  • 21:48high compliance in many countries.
  • 21:49So the defining challenges the
  • 21:51public house control,
  • 21:53measure implementation and then
  • 21:54keep up with the compliance and
  • 21:57also the vaccine definitely is.
  • 21:59Really critical and we.
  • 22:01We know that by now there are
  • 22:04two successful ovac seen,
  • 22:06one in US1 is the face by Pfizer,
  • 22:09the other is more donor under
  • 22:11the efficacy is 95%.
  • 22:12This really really amazing
  • 22:14scientific advance.
  • 22:15Developing the vaccine in such a short time.
  • 22:20Under so we also.
  • 22:22Last spring we also developed a
  • 22:25website on that help estimate the
  • 22:27RT value as a different resolution
  • 22:30at the for different countries,
  • 22:33States and counties and so you can see that.
  • 22:38But we copies are key curve and
  • 22:42for a different.
  • 22:44Reach reaches and so this work
  • 22:46was led by Andy Xu, my student,
  • 22:49and she lucky there my poster.
  • 22:52And so this website was featured on
  • 22:55the in Nature Article last summer.
  • 22:59So how do we fit this model?
  • 23:02So because there are lots of data points,
  • 23:05so we want to estimate the curve so
  • 23:08therefore instead of using the partial
  • 23:11differential equation model and we
  • 23:13extended this epidemic model which was
  • 23:15originally proposed by query in 2013,
  • 23:18and so the model in this type of
  • 23:20epidemic model is quite different
  • 23:23from the traditional logistics
  • 23:25traditional statistical model.
  • 23:26So we need to build in the.
  • 23:30If the. An infectious component,
  • 23:33so here is supposed.
  • 23:34Why is the number of cases so think
  • 23:37about the number of cases and for
  • 23:40each day in Connecticut and then
  • 23:42one first need to calculate this
  • 23:44Lambda T and this Lambda T is called.
  • 23:47Basically calculates the number
  • 23:49of people at risk,
  • 23:50so that is calculated using the products
  • 23:52of the serial interval distribution.
  • 23:55Multiply the number of cases in the
  • 23:57previous period said like 7 days and
  • 24:00then the Ark is a parameter one moment.
  • 24:03Estimate so in the original model,
  • 24:05the estimate RT at each time point
  • 24:07that estimate a lot of parameters,
  • 24:09and then when building this
  • 24:11person model and with Lambda T as
  • 24:14offset and RT as a parameter.
  • 24:16But they asked me lots of parameters
  • 24:18and one also account for the
  • 24:20reporting deley by using a lag.
  • 24:22So we what we did here with we try to.
  • 24:27Accommodate on the Covic features
  • 24:29and so we estimate us zero interval
  • 24:32distribution and from this comma
  • 24:34distribution using the paper in
  • 24:36publishing in nature method and
  • 24:38then in order to estimate RT as many
  • 24:41many values and we assume a curve
  • 24:44and the estimate by using a spline.
  • 24:47So there are few angle in work
  • 24:49and so we want to estimate RT as a
  • 24:52function but cover it and also the
  • 24:56in the traditional epidemic model.
  • 24:58One assumed answer.
  • 24:59Him and trade is a constant
  • 25:02overtime and so the,
  • 25:03but in practice the entertainment
  • 25:05rate is not constant,
  • 25:07especially when the number
  • 25:08of tests are goes up.
  • 25:10What number of positive test
  • 25:12rate goes up and then uncertain
  • 25:15manner it will get better and.
  • 25:17So we want to answer payment way to
  • 25:20be a function of the coverage and
  • 25:23also we want to instead of fixing the
  • 25:26reporting deal if we want to use the
  • 25:29data to model the reporting deley
  • 25:31and using all those met component,
  • 25:33we can estimate the prevalence.
  • 25:35So here are some preliminary result and so.
  • 25:39The the code so you this is for the US
  • 25:42data you can see right now and many
  • 25:45countries a number of cases between
  • 25:47being going down really nicely and
  • 25:49so the current USRT value is about .78,
  • 25:52and so we hear when you can see
  • 25:54we have this arty curve that
  • 25:57expanded so below 1 now.
  • 25:59And also you can see the number of new
  • 26:01cases have been going down and also
  • 26:04the number of deaths has been going down.
  • 26:07But there is a lag between the best.
  • 26:09Under the case. And also this is
  • 26:13the state level are key value.
  • 26:15So just give example like for California
  • 26:18you can see that the art in California is
  • 26:21about .67 and so does this very nice banded
  • 26:24curve for the cases and also for that.
  • 26:28So now let me talk about the what are the
  • 26:31factors associated with Covic infection.
  • 26:34So as we start from the Wuhan data,
  • 26:37then I'll move to the US data.
  • 26:40So the data we estimated the
  • 26:43attack rate on the my age.
  • 26:45So you can see that the each
  • 26:48of the period separately,
  • 26:50and so you can see that for the
  • 26:53older people that purple and yellow,
  • 26:55and then the tax rate was.
  • 26:58Much higher than the younger people,
  • 27:00and so this is a good lesson.
  • 27:03And in the spring last spring,
  • 27:05then later on, as you know,
  • 27:07like in US and there were more cases.
  • 27:11Elderly cases in the spring and but
  • 27:14then the elderly is become very careful
  • 27:17and try to protect themselves and
  • 27:20most of the cases in the summer and
  • 27:22also in the fall were younger people.
  • 27:25And then on the right that shows that
  • 27:28the male and female that you from one
  • 27:31day to the attack rate was similar.
  • 27:34But health care worker,
  • 27:35the purple bar has much higher
  • 27:38infection rate,
  • 27:39especially before the intervention,
  • 27:40and then after interventions.
  • 27:42Acrid among the health care worker,
  • 27:45and was better and so that calls
  • 27:47for the importance of the PPS
  • 27:50and before the intervention.
  • 27:52People were not aware of the Covic,
  • 27:55and so, therefore, is this not many people,
  • 27:58not many health care workers.
  • 28:00Hard to pee pees.
  • 28:02So I give a talk on the Wuhan finding on
  • 28:06the March just before the school public.
  • 28:09Just before Harvard started the spring
  • 28:12break and throw in one of the slides,
  • 28:14I showed that the on the day
  • 28:17before the ABC News,
  • 28:19there's one picture of
  • 28:20the health care workers.
  • 28:22And so I showed up there so the
  • 28:24the the health care worker will
  • 28:26not properly protected in US and
  • 28:29so they had no protection suit
  • 28:31and no face shell for example.
  • 28:34And then the infection could
  • 28:36be go through eyes.
  • 28:38I did not realize that those three
  • 28:40slides on the showing the health care
  • 28:42workers not properly protecting US were
  • 28:45widely distributed during the weekend.
  • 28:48So the March 13 was a Friday.
  • 28:50Then on March 16, that was a Monday,
  • 28:53and so there was a national campaign on
  • 28:57the protection of health care worker,
  • 28:59which comprehensive PP is.
  • 29:01And so the in short time and there were.
  • 29:04Over 1.7 million Xan signatures
  • 29:07and sold in this.
  • 29:09No, the this picture was taken from my talk.
  • 29:12So during that period I got to
  • 29:14know a lot of health care workers
  • 29:17and many of them and wrote to me
  • 29:20and so it's it's kind of like a.
  • 29:22Nice to see,
  • 29:23like a little statistical
  • 29:25analysis and could
  • 29:26help the community.
  • 29:27And also in the spring on the so
  • 29:30I did something that station are
  • 29:33supposed to do and so that we spend
  • 29:37quite a bit time how working with
  • 29:41the state of Massachusetts and also
  • 29:43with abroad and so helping shifting
  • 29:46the PPE under swap on from China.
  • 29:49And so I was on the state, Massachusetts.
  • 29:53The task force in the spring and then like.
  • 29:57One thing I was really touched
  • 29:59last spring was.
  • 30:00Many, many peoples and step in to help
  • 30:03without asking expecting any credit.
  • 30:05So they really a wonderful experience
  • 30:08and by working with so many peoples
  • 30:11and who stepped in to help and so
  • 30:13like in the screen 'cause there
  • 30:16were not many flight from China to
  • 30:18US so was difficult to shift under
  • 30:21those medical supplies and two US
  • 30:23and then so was really wonderful.
  • 30:25Many people help out and so you can
  • 30:29see that there were four flight.
  • 30:32Shifting the usapyon swap watered
  • 30:34by the state of Massachusetts
  • 30:36from Shanghai to Boston under the
  • 30:38first was the flight of the first
  • 30:41flight leaving could own,
  • 30:42and because there were not many
  • 30:45commercial flight available
  • 30:46and travel flight available,
  • 30:48so this flight was converted from the
  • 30:50from the Air Canada Flight and the
  • 30:53passenger flight to a charter flight
  • 30:56and then the picture on the right is
  • 30:59the first flight arriving Boston.
  • 31:03And also I the innerspring or we
  • 31:06launch how we feel up and so this
  • 31:10app collects the information about
  • 31:12the Covic 19 symptoms and behaviors
  • 31:16and also testing a result.
  • 31:18And so this was in collaboration
  • 31:21with some junk.
  • 31:23Many of you probably know fun by
  • 31:26his work in CRISPR editing and
  • 31:29gene editing and CRISPR and also
  • 31:32also banned Superman.
  • 31:34Who is the CEO of country?
  • 31:36So this is really a great.
  • 31:39Collaboration between academia
  • 31:40and industry 'cause we are not
  • 31:43very good at developing up,
  • 31:45but people in industry.
  • 31:47They're much better developing app.
  • 31:49So so many volunteer helping with
  • 31:51this how we feel project and we
  • 31:54build a nonprofit organization and
  • 31:56with so many volunteers and then
  • 31:59this app has over 750,000 users
  • 32:02and also 50 million responses.
  • 32:04And so I'll present some of those
  • 32:07results and this is the first paper.
  • 32:10Out of this,
  • 32:11how we feel project was published in Nature.
  • 32:14Human behavior last summer.
  • 32:17So here last spring,
  • 32:18who were more likely to be tested?
  • 32:21And it turns out that people
  • 32:23who had symptoms,
  • 32:24CDC symptoms or health care workers and
  • 32:27essential workers and people of color,
  • 32:29they were more likely to be tested.
  • 32:31So that makes sense,
  • 32:32because in the spring the testing
  • 32:35kids were not as widely available,
  • 32:37so the vulnerable group should
  • 32:39have priority to be tested.
  • 32:40And so this also present analysis challenge,
  • 32:43because the people who were
  • 32:45tested or likely to be sicker.
  • 32:47And so therefore this is not a random sample,
  • 32:50so when we studies Association between the
  • 32:53factors associated with the infection,
  • 32:55we have taken into account
  • 32:57that people who were tested
  • 32:59was not a random sample and so therefore
  • 33:01in the analysis we use the inverse
  • 33:04probability weighted procedures and
  • 33:06to account for the selection bias.
  • 33:08So we found that male with a higher
  • 33:11risk of infection than females.
  • 33:13And also we found that people of color
  • 33:16were at higher risk of infection.
  • 33:19And also the essential workers
  • 33:21and health care worker and these
  • 33:24were at higher risk of infection.
  • 33:26Also, we found another household
  • 33:29exposures and and also community
  • 33:31exposure are significant risk factor
  • 33:33for infection and so you can see that
  • 33:37for the household exposures after
  • 33:38the show is almost 17 for Community
  • 33:41exposures as we show almost three.
  • 33:44So So what that mean is we need
  • 33:47to break the within household
  • 33:50and close place transmission and.
  • 33:53Cluding, the nursing home,
  • 33:55homeless shelters and prisons,
  • 33:57and so and also we need to control
  • 34:00the community transmission and so
  • 34:02this finding was supported by the
  • 34:05Massachusetts data that Massachusetts
  • 34:08last year reported that almost 90%
  • 34:11of covid cluster were household.
  • 34:13So what that mean is household
  • 34:16transmission is dominant is prevalent.
  • 34:18Dominant lots of transmissions.
  • 34:20And also we found the most important
  • 34:23symptoms and was not the fever and
  • 34:26cough was lots of peace and smell.
  • 34:29So in particular we found out
  • 34:31ratio is almost 33 associated with
  • 34:34loss of taste and smell.
  • 34:36About 40% of those who were past
  • 34:39positive had lost of taste buds,
  • 34:41taste and smell.
  • 34:42Among those who are not testing about
  • 34:466.6% among those who are test negative.
  • 34:49That was about 5%.
  • 34:50So this is an important symptom is also
  • 34:54distinguished from the flu symptom.
  • 34:56Then we also build a prediction
  • 34:59model giving there were not enough
  • 35:01tests available and then can we
  • 35:04use the screening on two and two?
  • 35:07Predict whether a person is
  • 35:09likely to be infected or not.
  • 35:12So by using the CDC symptom,
  • 35:14you can see that the RC
  • 35:16curve is AOC is about 70%.
  • 35:19Using all the variables and it's
  • 35:22about 80% if we use a simpler
  • 35:25model only used for variable,
  • 35:27including the three exposure
  • 35:28variable and also the loss of taste,
  • 35:31smell,
  • 35:32the symptom variables and then
  • 35:34you can see the AOC is also 80%.
  • 35:37And so this is very simple model but has
  • 35:40very good predictability for infection.
  • 35:42And when we build this model we use.
  • 35:45This actually proves a boost that is a
  • 35:48scalable gradient tree boosting method.
  • 35:51So now let me talk about the
  • 35:54defending challenge on the in 2021.
  • 35:57So first is the vaccine rollout and optic,
  • 36:00so the science was really
  • 36:02wonderful last year,
  • 36:04so developing the vaccine such as
  • 36:06short time with such high efficacy,
  • 36:09that's really amazing and so.
  • 36:12So the challenge is the vaccination program.
  • 36:15So basically,
  • 36:16how can we get the vaccine into people's arm?
  • 36:20And so so that basically includes the
  • 36:23distribution and also the administration.
  • 36:26Also,
  • 36:27it's important to have equitable
  • 36:29and scalable vaccination,
  • 36:31and also is important to
  • 36:33overcome vaccine hesitancy.
  • 36:34I'm going to focus on this one.
  • 36:39And the second defining challenge is
  • 36:43the massive scalable testing and so.
  • 36:47PCR test yes, a gold standard,
  • 36:50but it is expensive and to do
  • 36:53the massive regular testing.
  • 36:55So I'm going to talk about
  • 36:58efficient testing strategy using
  • 37:00the pooled testing and also the
  • 37:03other strategies rapid testing.
  • 37:05And the third component is the
  • 37:07implementation and compliance of
  • 37:09public health control measures.
  • 37:10So if you look at a quick job
  • 37:13of the cases in January is not
  • 37:16likely to do to the vaccine,
  • 37:19because only less than 10% of the
  • 37:22US population had been vaccinated.
  • 37:24I seem like the last last month and
  • 37:27the the implementation and compliance
  • 37:29and control measures and became
  • 37:32better and people pay more attention
  • 37:35to the behavior changes so that.
  • 37:37Definitely is an important message.
  • 37:41So let's look at the vaccine rate and so.
  • 37:45Overlap and this is from the one word data.
  • 37:48You can see.
  • 37:49Israel is definitely the role model
  • 37:51and so the right now they have an
  • 37:54average 70 doses and per 100 people.
  • 37:57And so after we account that some people
  • 38:00have two doses on average about 40%
  • 38:03people in Israel had been vaccinated
  • 38:05and with so that's really amazing.
  • 38:07And you have this less than 10%
  • 38:10if on the right you can see we
  • 38:12have a serious equity issue.
  • 38:15And in particular,
  • 38:16you can see basically nobody
  • 38:18in Africa has been vaccinated,
  • 38:20so that's really not good.
  • 38:23So the another defining challenges
  • 38:26vaccine hesitation.
  • 38:27So in order to achieve the
  • 38:29vaccine induced herd immunity,
  • 38:31we need to overcome vaccine hesitation.
  • 38:34So I'm going to present the
  • 38:36findings and from the how we feel
  • 38:39data show McCabe is my Postal.
  • 38:42He take a lead in this work in
  • 38:45collaboration with many colleagues.
  • 38:48So here is a way, a lunch,
  • 38:51the Maxim question in how we
  • 38:53fill up in early December.
  • 38:55So with here the result of analyzing the
  • 38:58first month data about 30,000 people.
  • 39:01So we develop a partnership with
  • 39:04Kinetica and last spring and so
  • 39:06that's why you can see we have more
  • 39:09respondents and in the kinetica,
  • 39:11and also because the countries
  • 39:13is located in California.
  • 39:15So we had more respondent respondent.
  • 39:18In California,
  • 39:19so if you look at overall vaccine hesitancy,
  • 39:23hesitancy read,
  • 39:24you can see like thoughts are more hesitant,
  • 39:27and so overall the vaccine hasn't
  • 39:30hesitancy rate is about 1818% from the
  • 39:33hallway field data and 82% on the.
  • 39:37What said they are likely were
  • 39:39more likely to take the vaccine.
  • 39:42So if you look hard, um,
  • 39:44vaccine hesitancy rate by race and ethnicity,
  • 39:47then you can see that people of color
  • 39:50are much more likely to be vaccine hesitant.
  • 39:53So in particular,
  • 39:55if you look at a black for example,
  • 39:58the vaccine hesitancy is is all.
  • 40:0046%, almost 50% so so compared
  • 40:03to white is about 15%,
  • 40:06but compared to Hispanic,
  • 40:08about 30% you can see a large
  • 40:11fraction of them are undecided group.
  • 40:14So what that mean is that a
  • 40:18community engagement through
  • 40:19the education of outreach is important.
  • 40:23To overcome vaccine hesitancy,
  • 40:25so here are the results.
  • 40:27Who are more likely to be vaccine
  • 40:30hesitant and so we found the
  • 40:32younger people are more likely to
  • 40:34be a vaccine hesitant and females,
  • 40:37and also health care worker essential
  • 40:40workers and also the people of color.
  • 40:43In particular,
  • 40:44black people are 3.5 times more likely
  • 40:47to be vaccine hesitant than white,
  • 40:49and people with pre existing
  • 40:52conditions and low income.
  • 40:54And also rural areas and also the
  • 40:57thoughts and also places with high kufic
  • 41:01burden and also the people who they are.
  • 41:05So those are more likely to be vaccine
  • 41:09hesitant people who wear masks and
  • 41:12also use the protective measures.
  • 41:15They are less likely to be vaccine hesitant.
  • 41:20Talk to us in summary.
  • 41:22So the the vulnerable group are
  • 41:24more likely to be vaccine hesitant,
  • 41:27and so they include people of
  • 41:30color health care worker,
  • 41:32essential worker and the young people female
  • 41:35and the regions with high kovik burdens.
  • 41:38And also the people with
  • 41:40pre existing conditions,
  • 41:41parents and low income.
  • 41:44And also people not using
  • 41:46the protective measures.
  • 41:48So an Irish last late last year,
  • 41:51the lunch, a community engagement alliance.
  • 41:54And so this is,
  • 41:56uh,
  • 41:56involved multiple centers and the one
  • 41:59of the goal is to do the Community
  • 42:03engagement to help with participation
  • 42:06in clinical trial and also.
  • 42:09Overcome the vaccine hesitancy.
  • 42:13So what this tells us is community
  • 42:17engagement for vaccination,
  • 42:18of which an education is important,
  • 42:21so that Pic home number 5 is important,
  • 42:24remained bigil and to scale up scale
  • 42:27up the control measure and vaccination
  • 42:30by protecting the vulnerable group,
  • 42:33including the health care workers
  • 42:35and essential workers and elderly.
  • 42:37And also it's important
  • 42:39to reach the zero kovik.
  • 42:42So what that mean is.
  • 42:44We need to be careful and reopen
  • 42:46slowly when the number of cases are
  • 42:49sufficiently small and also with
  • 42:51the control measures are so if when
  • 42:54we opened too early and we slipped
  • 42:57in the control measure like what
  • 42:59happened last summer and in the South,
  • 43:02and is likely to see the researchers
  • 43:04and also is important to pay
  • 43:06attention to the long color,
  • 43:09the long term effect especially among the
  • 43:12young people and then also the to a build.
  • 43:15I've seen uptick and it's important
  • 43:18to have community engagement and
  • 43:20outreach and build public trust.
  • 43:23So basically, how can we implement the?
  • 43:27Control measures and also implement
  • 43:30vaccination and ensure high compliance
  • 43:33is the defining challenge this year.
  • 43:36And the truth,
  • 43:38the other component is for this year is
  • 43:41how can we boost the testing capacity
  • 43:44and buy a cover by doing more test.
  • 43:47And so because it's uh if one needs
  • 43:50to do the test frequently and to do
  • 43:53the PCR test is difficult to to do
  • 43:56that for many institution because
  • 43:59it's costly and so they put the
  • 44:01testing provide an alternative.
  • 44:03So I'm going to talk about this
  • 44:06efficient put testing.
  • 44:07A design using the hyper
  • 44:09graph factorization first.
  • 44:11What is the protesting?
  • 44:12The goal is that would put testing is
  • 44:16to screen a large population with a few
  • 44:19tests and giving the limited resources.
  • 44:22So this will help reopen the school
  • 44:25safely and the simple idea is used.
  • 44:28This uh document design sofa.
  • 44:30Suppose we have 100 people and we
  • 44:33do 20 tests and then so we pulled
  • 44:37the people sample into different.
  • 44:39Pools and suppose there's only one case,
  • 44:42and then we test each pool support.
  • 44:44Each pool has a 10 people.
  • 44:46And then we tested each put do 10 pull
  • 44:49test and how we found this cool is costing.
  • 44:53Then we test every individual
  • 44:55in this pool so in.
  • 44:57Therefore instead of doing 100
  • 44:58test you only do 20 tests and
  • 45:01so this is the basic idea.
  • 45:03Put testing.
  • 45:04So what is the limitation of this
  • 45:07simple of protesting design?
  • 45:09And so this document design
  • 45:11allow one individual go to
  • 45:13one pool that is Q equal to 1,
  • 45:16then cycle through the pool until
  • 45:18all individuals are assigned.
  • 45:20So if you look at this example
  • 45:23with eight subjects and six pool,
  • 45:25then you can see that we assign
  • 45:27the face first six subject to the
  • 45:306 four ABCDEF and then recycle
  • 45:33the segments and each subject.
  • 45:35And do the pull A&B so only one person
  • 45:38per pool, so this is not optimal,
  • 45:41only one pool,
  • 45:42only one pool per person and this
  • 45:44could lead to a non redundancy
  • 45:46that also reduce the sensitivity.
  • 45:48So the question is can we do better?
  • 45:51Can we assign each individual
  • 45:53to more than one pool?
  • 45:54That basically makes a Q equal to two.
  • 45:57So let's start from something
  • 45:59like if there's a safe assign
  • 46:01one person to two pools.
  • 46:03So for example I assigned
  • 46:05the first person to.
  • 46:062A B second person to put a C and
  • 46:09third person to pull busy and so on
  • 46:12and then cycle through the order.
  • 46:15So that basically this idea
  • 46:17assign each person to two pools.
  • 46:19What is the problem?
  • 46:20The problem is by doing this simple
  • 46:22way the design is not balanced.
  • 46:25You can see that pull it has
  • 46:27five subjects and puppy has four
  • 46:29and pull up as only one subject.
  • 46:32Because when when does them the
  • 46:34pulling and by assigning one
  • 46:36person to more than one pool
  • 46:38while need to dilute the sample.
  • 46:40So if one has a different solution for
  • 46:43different pools that will affect the.
  • 46:45Accuracy under then the sensitivity.
  • 46:47So can we do better so that is
  • 46:50a basic idea of a more balanced
  • 46:52design we call the hyper design.
  • 46:55So this using the hyper graph factorization.
  • 46:58So the basic idea is we want to
  • 47:01make the spell is that possible?
  • 47:03So for example like here you can see
  • 47:07that assigned person A to pull a BE
  • 47:10person B person to pull CD person
  • 47:123 two pull ENF person four to pull.
  • 47:15PNC Person 5 to pull the D&F and so
  • 47:19on and so this. This idea is after you.
  • 47:22Each pool has four samples and
  • 47:25so you can see for the 1st pool
  • 47:28and the test is negative.
  • 47:30The second pull the test is positive.
  • 47:323rd pool passes positive and so on.
  • 47:35Then afterwards we do the pool testing
  • 47:38and then we can decode to see that
  • 47:41which person is likely to be a case.
  • 47:44And here you can see that.
  • 47:46After do the decoding person 3, four,
  • 47:49and seven are likely to be a positive,
  • 47:53and then we test each of them
  • 47:55individually and find out persons.
  • 47:581 Seven are the cases and so why
  • 48:00it is called hyper graph design.
  • 48:04And that because this is related
  • 48:06to the hypergraph,
  • 48:07and in complete awe metrics
  • 48:10and so you can think about this
  • 48:13as the six pools are the six.
  • 48:16Vertex is under the edges are the
  • 48:19people and soap example like a person.
  • 48:22One will assign this.
  • 48:23This is edge person one and so
  • 48:26that's assigned the pool at A&B
  • 48:29and then person two assigned to
  • 48:31C&D and so so this is so that's
  • 48:34why it's called a hypergraph.
  • 48:36So basically what we do is we
  • 48:38need to assign the individuals
  • 48:40and in the right sequence to make
  • 48:43them as balanced as possible and
  • 48:45not overlap as much as possible.
  • 48:47And so by doing this design,
  • 48:50when we kill equal to 216 pool,
  • 48:53we have 5 factorizations and
  • 48:55so you can see for
  • 48:57each factorization there's no overlap and
  • 49:00and also between every two consecutive.
  • 49:03Assignment under then there
  • 49:05is no overlap as well.
  • 49:08And by doing this hypergraph designs
  • 49:10and so you can see that we can
  • 49:13have a balanced pool and also is
  • 49:16very easy to implement and so this
  • 49:19Calculator and so and also very easy
  • 49:22to decode and so this is based on
  • 49:26the company company Atomic Comics.
  • 49:29Population so we can do this calculations
  • 49:32and for Q equal two and three,
  • 49:34but for Q equals greater than three,
  • 49:37the calculations much more challenging.
  • 49:39And so by doing that then you can see that.
  • 49:43And here we plot out the.
  • 49:46Efficiency against the prevalence,
  • 49:47so only if the prevalence is
  • 49:50low is worthwhile to do.
  • 49:51Put testing if the prevalence is high,
  • 49:53there's no need to do put testing so you can
  • 49:57see that doing the hyper design and it is.
  • 50:01Efficient and then the efficiency is
  • 50:03almost 6 compared to individual design
  • 50:06and also expect her than a redesign.
  • 50:09That efficiency is 4.6 and when the
  • 50:12preference become higher and then you
  • 50:14can see that the efficiency goes down
  • 50:17and then also comparing the hyper
  • 50:20design with a radius and efficient,
  • 50:22the sensitivity is pretty similar and
  • 50:25also when we have 384 subject per batch
  • 50:28and you can see that the hyper design
  • 50:31still outperformed the other design.
  • 50:34And for the Pytest peoples design.
  • 50:36And so it.
  • 50:39Especially when the prevalence become
  • 50:41higher and then you can see that uh,
  • 50:44sensitivity almost reach to 0.
  • 50:46And so this also thought we look at a
  • 50:49different design in different scenarios,
  • 50:52and we showed that is hyper design
  • 50:54is optimal and in terms of allocating
  • 50:57resources and so here we plot out
  • 51:00the X axis is the total number
  • 51:02of sample collect each day.
  • 51:04Suppose each day we collect 3000 samples.
  • 51:07Suppose we only have the resources
  • 51:09to do 12 foot tests.
  • 51:11Then you can see that efficiency screening
  • 51:14capacity using this Q equal to two is 122.
  • 51:17That is much.
  • 51:18A better and so so then also
  • 51:21if one has a Q equal to three,
  • 51:24that means a law allowing assigning
  • 51:26one person to three pools.
  • 51:28Then in that situation we need to
  • 51:32use the hypergraph and with the.
  • 51:35Those kind of. 20 different hyperedges.
  • 51:43So in summary,
  • 51:44to scale up widespread testing,
  • 51:47hyper this is based on hypergraph
  • 51:49factor factorization design provide
  • 51:51efficient pool design to maximize
  • 51:53the balance and efficiency,
  • 51:55and the protesting is useful when the
  • 51:57prevalence is low when the preferences highs,
  • 52:01there's no need for protesting,
  • 52:03just do the individual testing and we build
  • 52:06a website that allows the investigator
  • 52:09and the two design their own study.
  • 52:13And so to combat kovik and so
  • 52:15we are really in this together.
  • 52:18And so we have to be together
  • 52:21and be stronger.
  • 52:22And so it's important to let the data speak
  • 52:26and also develop evidence based strategy.
  • 52:29And we show that there are
  • 52:31two feature of the Covic.
  • 52:33One is is highly transmissible,
  • 52:36second is highly convert.
  • 52:37And also it's important to remain vigilant
  • 52:40and to use the multifaceted interventions.
  • 52:43And to combat Covid,
  • 52:44and so the there are multiple
  • 52:47defining challenges this year.
  • 52:49One is a Black Max magazine,
  • 52:51distribution, uptake and education.
  • 52:53The other is a scalable testing,
  • 52:55so we talk about the put testing and
  • 52:58so I want to thank the many of the
  • 53:02collaborators and so who made many
  • 53:04contributions to help with the project.
  • 53:07And also there's a quick announcement
  • 53:10and the cops and less at lunch.
  • 53:13This Covic 19 data.
  • 53:14Lisa Weaponer last December and
  • 53:17so this is every two weeks and
  • 53:19on Thursday from 12:00 to 1:00.
  • 53:22And so these are we have the
  • 53:24last few two month. Last month.
  • 53:27We have a wonderful speaker.
  • 53:29Great turn out.
  • 53:30So those are the speaker in the
  • 53:32coming weeks and from Denmark
  • 53:34Mukherjee who will talk about Covic
  • 53:37in Indian and Harvey Fineberg.
  • 53:39Many of you know and he's a
  • 53:42former president of National
  • 53:43Academy of Medicine and also.
  • 53:46Jim Young Kim is a former president
  • 53:48of World Bank and so they're going to
  • 53:51give the next week talks and thank you.
  • 53:55Thanks young for this
  • 53:57wonderful talk is very useful.
  • 53:59I want to weather the audience
  • 54:02have any questions for she home.
  • 54:05Yeah I have a question.
  • 54:07Yes, please song.
  • 54:08So I'm wondering,
  • 54:09will people who are willing to
  • 54:11respond to the how we feel study be
  • 54:14more likely to have lower hesitancy?
  • 54:20Um? I would think the how we feel.
  • 54:24We study people probably.
  • 54:27I would think that
  • 54:29probably likely to be true,
  • 54:32and so the how we feel samples
  • 54:35the because of people use the app,
  • 54:38so at least that they are coping
  • 54:41aware and they think a quickly is
  • 54:44problem and so it's possible that
  • 54:47in the national samples when we have
  • 54:51a more representative samples and
  • 54:53the hesitancy rate may be higher.
  • 55:01Donna has a question.
  • 55:03Yeah, I see her name was
  • 55:05incredible work you've done.
  • 55:07It's just absolutely
  • 55:09phenomenal and breathtaking.
  • 55:10How you've addressed
  • 55:11each issue arising in the kobid epidemic,
  • 55:15one by one and come up with such clarity.
  • 55:20To guide us. So my question
  • 55:22is about the hyper designs.
  • 55:23I've been aware of pool testing,
  • 55:25which I you know.
  • 55:26We all know it's been around for awhile,
  • 55:29but I'm just wondering, you know,
  • 55:31is there like a rule of thumb like
  • 55:34safe the prevalence rate is like 5%?
  • 55:36How many digit
  • 55:37in your graph like how many fewer tests
  • 55:40would you have to use using a hyper
  • 55:42design versus like the standard approach
  • 55:44that you know people would tend to use
  • 55:47which is to just test everybody. Yeah,
  • 55:49so that is if you can see that from here.
  • 55:53Yeah, it's a little hard to
  • 55:56see it's a little small.
  • 55:57Oh this hyper yeah I can the the
  • 56:00so you can see the efficiency
  • 56:02that is about hyper design.
  • 56:04Yes almost six so that means that we
  • 56:07can each task and have 6 people and
  • 56:10by individual design so you can see
  • 56:13that suppose you have 100 people.
  • 56:15This is 96 so 96 / 6 and then then
  • 56:19you can see that that is. How many?
  • 56:23How many fewer tasks it less than 20?
  • 56:26Yes, I think about it.
  • 56:28If you do individual test that is 100.
  • 56:32What is the?
  • 56:33What is the?
  • 56:34I didn't understand really what
  • 56:35the 96 and the 3:50.
  • 56:40Yeah batch, so there are 96 so in the
  • 56:42so if you think about when you win
  • 56:45you do test and then basically you
  • 56:47need to layout the sample in a batch.
  • 56:50If you think about it already
  • 56:52then think about that.
  • 56:53Basically they have eight.
  • 56:54You have to think about.
  • 56:56The Matrix is 8 by 12 matrix.
  • 56:58You put all the samples and
  • 57:00in this 8 by 12 array.
  • 57:05OK, thank you yeah.
  • 57:08And also if you look at the capacity
  • 57:11here you can see the capacity is
  • 57:13much better so you can see that.
  • 57:15Suppose I need to test 3000 people a day.
  • 57:19Execution can only afford half 12 tests
  • 57:22and then you can see the efficiency.
  • 57:25Screening capacity is almost 120.
  • 57:30So that is really good.
  • 57:31That's very, very good, yeah?
  • 57:39So I also have.
  • 57:43You still have a question
  • 57:44or say thank you very much.
  • 57:47OK, so I also have a related question.
  • 57:50So she how you mentioned that before.
  • 57:53A future work you want to perform a
  • 57:57regarding the reproduction number
  • 57:59estimation and this intervention
  • 58:01work is to consider different
  • 58:04other covariates when you are
  • 58:06modeling the reproduction rate.
  • 58:08So I wonder,
  • 58:09have you also considered like trying
  • 58:12to take into consideration different
  • 58:14type of various the mutation of
  • 58:17different various and then maybe certain
  • 58:21various various high reproduction rate?
  • 58:23And perhaps others.
  • 58:25This process?
  • 58:25Yeah, that
  • 58:26is excellent suggestions on the so yeah,
  • 58:29if we could have those data will be great
  • 58:32that we could include those in the model
  • 58:36besides the different type of variance.
  • 58:38And also like the vaccination rate.
  • 58:41That would be a very good variable included.
  • 58:44And so the challenge for us right now is,
  • 58:48as you know UK has been doing
  • 58:50a great job in the sequencing,
  • 58:53viral sequencing and so in other words the
  • 58:56surveillance and sequencing surveillance.
  • 58:58But not US, and so we have not doing a great
  • 59:02job in sequencing and so so therefore the
  • 59:06UK could monitor the new virus and well,
  • 59:09but I think with the one of the things
  • 59:12we need to do this year is to increase
  • 59:16the various viral sequencing capacity
  • 59:18so we could monitor the new variants.
  • 59:21So then also make the data available and to
  • 59:25the public and then that can be included.
  • 59:28In the analysis.
  • 59:30So what I've found that last year and
  • 59:33during the Covic people were much more
  • 59:35willing to share the data computer for an,
  • 59:39though this is really,
  • 59:40really wonderful and also the much
  • 59:43more preprint and compared to before,
  • 59:45and that were posted in about
  • 59:47archive and made archive,
  • 59:49and people were really willing
  • 59:51and to share their findings to
  • 59:54the Community as soon as possible.
  • 59:56So these are really wonderful
  • 59:58spirit about open science and.
  • 60:00And is.
  • 01:00:01Fired on the by many researchers last year.
  • 01:00:05Thanks, that's really informed him.
  • 01:00:10Sorry, go ahead who is
  • 01:00:11trying to ask a question.
  • 01:00:14Me, I mean yeah, I see who I have a
  • 01:00:19question regarding to this pulling pulling
  • 01:00:23test. You said the pulling test and
  • 01:00:26compare with the individual test.
  • 01:00:29The sensitivities are similar
  • 01:00:31and right now I'm thinking
  • 01:00:34if each individual does sensitivity they
  • 01:00:37can be test by individual test one this
  • 01:00:42individual mix with five other. Cure
  • 01:00:45lung disease samples.
  • 01:00:47Basically, the concentration is diluted,
  • 01:00:50so how does sensitivity will be keep
  • 01:00:54the same and how the next ways how
  • 01:00:59to compare if there's one positive
  • 01:01:02case with five individual their
  • 01:01:05normal cases an A normal situation
  • 01:01:08controls and compare with all six.
  • 01:01:12There just get exposure
  • 01:01:14with low concentration.
  • 01:01:16So there will be probably have
  • 01:01:19some sensitivity issue if
  • 01:01:22pulling together compared to
  • 01:01:24individual tests, then the error for
  • 01:01:27measurement error testing error for
  • 01:01:29the two different types of tests.
  • 01:01:33How did you consider them additional
  • 01:01:36to the hyper structured testing?
  • 01:01:39Yes, I think this is a great question.
  • 01:01:42Sorry I did not make that clear.
  • 01:01:44What I meant was that hyper design and are
  • 01:01:47ready that they had a similar sensitivity,
  • 01:01:49but the sensitivity is lower
  • 01:01:51than the individual tests.
  • 01:01:52If you look at the curve in about I see.
  • 01:01:57At the green and red they have
  • 01:01:59a similar sensitivity by the
  • 01:02:01compared to the individual test.
  • 01:02:03That is, this black line and it has
  • 01:02:05higher sensitivity and so then they
  • 01:02:08as you are definitely right when one
  • 01:02:10do one month does the pooled testing
  • 01:02:12because the sample needs to be diluted,
  • 01:02:15so therefore we need to pay a price
  • 01:02:17and then sensitivity will be lower.
  • 01:02:20Yeah, so the overall one the population.
  • 01:02:25Pilots large scale testing.
  • 01:02:27We may have more undetectable test.
  • 01:02:30Think about if six samples always happen
  • 01:02:33is 1 sample has a positive positive case,
  • 01:02:37so we may have some testing error
  • 01:02:40here. Yeah yeah, so yes.
  • 01:02:43Yes, I'm sorry.
  • 01:02:44Can I just jump in for a second?
  • 01:02:47So 'cause I think the comparison as
  • 01:02:50you could compare this hyper design
  • 01:02:52to just testing everybody or the
  • 01:02:54hyper design to the traditional
  • 01:02:56pool testing approach where you
  • 01:02:58just divide 100 people in each one
  • 01:03:00is in a single batch and I think
  • 01:03:03she hung what you're saying and it
  • 01:03:05makes sense to me intuitively is by
  • 01:03:08repeating people in multiple batches
  • 01:03:10were increasing the chances of having
  • 01:03:12doubles and triples in the same batch.
  • 01:03:14And then lowering the chance of having
  • 01:03:17false negatives as opposed to the
  • 01:03:20traditional design where you take the
  • 01:03:23100 people and they're only in one match.
  • 01:03:27Is that right?
  • 01:03:28Yeah, that's so that's right.
  • 01:03:30So you don't want to put a
  • 01:03:32hundred 100 people in one batch,
  • 01:03:34because if you do that,
  • 01:03:36then the sample need to diluted a lot and
  • 01:03:39then you will sacrifice the sensitivity.
  • 01:03:42So that's why when want to
  • 01:03:44do the optimal design,
  • 01:03:46want to account for both the
  • 01:03:48balance and also sensitivity and
  • 01:03:50including both of them and so then.
  • 01:03:52So that's why when we build this when we
  • 01:03:55define this efficient screening capacity,
  • 01:03:57this calculation.
  • 01:03:58That incorporated sensitivity
  • 01:04:00in the calculation as well.
  • 01:04:03I see thanks yeah.
  • 01:04:05So the the pool design.
  • 01:04:07So if you look at the traditional design,
  • 01:04:10so here you can see that each person,
  • 01:04:13the traditional design.
  • 01:04:14Basically each person is assigned
  • 01:04:16to a single pool and so this
  • 01:04:19is this called document design,
  • 01:04:21and so this design so you can
  • 01:04:24see that the six for six people.
  • 01:04:28In this example, like the person, one and.
  • 01:04:32Assigned to a person to assign to P,
  • 01:04:35and so this is not efficient design.
  • 01:04:38And so if we assign each person to
  • 01:04:40multiple pools and after decoding
  • 01:04:42that will improve the efficiency.
  • 01:04:47So generally the cute does
  • 01:04:48should not be too big.
  • 01:04:50So here you can secure equal to 1.
  • 01:04:53That means one person assigned
  • 01:04:54to one pool to equal to 2 means
  • 01:04:57a person sent to two pools and
  • 01:04:59just think about this is very
  • 01:05:01interesting and so you can see the
  • 01:05:03basically using using the graph and
  • 01:05:05the ABCD basically means the pool
  • 01:05:07and each edge indicated person.
  • 01:05:09So you can see this person one
  • 01:05:11is assigned to pull A&B and then
  • 01:05:13says that's why there's edge here
  • 01:05:15and person to assign to C&D.
  • 01:05:17So this person too.
  • 01:05:19And then so on.
  • 01:05:24Thanks young, I have one.
  • 01:05:26I have one last question.
  • 01:05:28If other people do not have
  • 01:05:30more question so I wonder is
  • 01:05:32also related to the sensitivity.
  • 01:05:34I wonder how we considered too instead
  • 01:05:36of using on the testing directly but
  • 01:05:39construct some posterior for each person.
  • 01:05:42Use other covariates including
  • 01:05:43your past history.
  • 01:05:44Whether you have higher risk an.
  • 01:05:48I wonder if we use such
  • 01:05:50personalized information combined
  • 01:05:52with this testing results,
  • 01:05:53can we have better sensitivity?
  • 01:05:57Very good question.
  • 01:05:59Yeah, I can see the potential.
  • 01:06:02I can see the potential for doing that.
  • 01:06:05Yeah, I think they're right now in
  • 01:06:10the screening program and the no.
  • 01:06:13Demographic information is collected,
  • 01:06:15and so only the sample collected.
  • 01:06:18So for example, like abroad they.
  • 01:06:22But bro, the dead on.
  • 01:06:25The spring when we first started
  • 01:06:28it was about maybe a 3000 of
  • 01:06:32sample per day and so right now.
  • 01:06:34As you know,
  • 01:06:36broad cover almost 9025% of the testing,
  • 01:06:39and in the New England areas almost
  • 01:06:43cover like 3,000,000 and test and so
  • 01:06:46the event of data were lots of data.
  • 01:06:50Sand were collected and.
  • 01:06:53In the testing,
  • 01:06:53but those data cannot be used for research.
  • 01:07:00Thanks, I don't know if the
  • 01:07:02audience have further question.
  • 01:07:04Maybe you can also email see how afterwards.
  • 01:07:08So we're running a little bit overtime,
  • 01:07:11but it's very. This wonderful talk.
  • 01:07:14Can we have learned so much
  • 01:07:16from Seahawks services talk?
  • 01:07:18Thank you again. Thank
  • 01:07:20you very much.