Learn-As-You-Go (LAGO) to Adapt the Intervention in an Ongoing Trial to Prevent Trial Failure

Name: Learn-As-You-Go (LAGO) to Adapt the Intervention in an Ongoing Trial to Prevent Trial Failure
Uploaded: 2025-01-28T18:48:16.85Z
Duration: 44 min 58 s
Description: Speaker: Judith Lok November 11, 2024

January 28, 2025

Speaker: Judith Lok

November 11, 2024

Information

ID: 12682
To Cite: DCA Citation Guide

Download Transcript

00:00Okay.
00:06This meeting is being recorded.
00:07So if you make comments,
00:08they would be here forever.
00:11So
00:12they are
00:14they'll study among statisticians
00:17and they're increasingly also applied
00:19in in public health and
00:21medical
00:23science. So the goal of
00:26of an adaptive design is
00:28to be able to modify
00:29and sometimes also stop a
00:31clinical trial
00:33after
00:34not in unplanned
00:36yeah even planned but unplanned.
00:38So the rules the rules
00:40to
00:41carry out in a depth
00:42of variable trial,
00:44they have to be specified
00:46in advance. The FDA is
00:47very clear about that if
00:48you won't register anything of
00:50your medicine
00:51with the FDA, you have
00:53to prepend everything.
00:55That doesn't mean that you
00:56have to keep the sample
00:58sizes the same and everything
00:59the same. It's just that
01:01you have to say how
01:02you will adapt it and
01:03you also, in those cases,
01:05you will probably have to,
01:07have a conversation with the
01:08FDA in advance
01:10of starting your study.
01:12So everything has to be
01:13pre planned when we adapt
01:15a clinical trial.
01:17Examples are sample size adjustments,
01:19removing treatment arms and changing
01:22the target population.
01:24But in public health intervention
01:25trials, sometimes there there there
01:27is more that people want
01:29to adapt.
01:31Let's try this out the
01:33notice.
01:47We have too much too
01:48much on the screen. And,
01:50so there is more that
01:52people want to want to
01:53adapt because in these large
01:55scale implementation trials,
01:57there's often more than one
01:59intervention. There could be
02:01a roll out of the
02:03intervention
02:03that may take one or
02:05may take three days. There
02:06could be coaching visits that
02:08that take place during over
02:10the course of time. And
02:11all these intervention
02:13components,
02:14if you would have to
02:15specify exactly how many coaching
02:17days, exactly all these things
02:19and the of these intervention
02:20components,
02:21that might just be too
02:23much to ask for. In
02:24in most of the trial,
02:25people typically have an idea
02:27about what would be the
02:28optimal intervention
02:29package,
02:30but then this this center
02:32does something else. This other
02:33thing sets about something else.
02:35And maybe over the course
02:36of time, it seems like
02:37one of the intervention
02:39components is more
02:41useful and people focus more
02:43on that. So what statisticians
02:45have often done is say,
02:47okay. If you're in an
02:47ethnic intervention about the trials
02:49ongoing,
02:50we don't wanna
02:51touch the data because that's
02:53not planned and there could
02:54be all kinds of issues
02:56with that.
02:57And Donna found out through
02:59her,
03:00sounding board people that it
03:03would be a good idea
03:04to also figure out if
03:06we can formalize
03:07this learning from prior data
03:10because that's what, LAGO is
03:12about. It's a at least
03:14two stage trial
03:16where
03:17the data from the first
03:18stage are in are analyzed,
03:21and they are used to
03:22figure out which would be
03:23a good intervention to do
03:25in the next step.
03:28So the intervention could be
03:29a treatment, a device, a
03:31new way to organize care
03:33or a combination of all
03:34that.
03:35And
03:37the Lago trial, an inherent
03:38part of a Lago trial
03:40is to optimize and prevent
03:42it. And that means maybe
03:43then you want to search
03:44an outcome or maybe you
03:45want a probability of success
03:47of at least ninety percent
03:49or maybe forty percent,
03:51Whatever but some kind of
03:52goal or maybe you want
03:54a mean blood pressure in
03:56those who have suffered from
03:59who have who have suffered
04:00from high blood pressure
04:01of say eighty
04:03or maybe nine
04:05depending on your aim. So
04:07maybe you have an idea
04:08about how you want to
04:09have the outcome
04:10and that an
04:11important part of a lab
04:13coat trial is to figure
04:14out what is the optimal
04:16way to achieve this goal
04:18where we not only look
04:20at efficacy, we also look
04:22at cost.
04:24Because,
04:24of course, if you look
04:25at efficacy, then you will
04:27just max out all the
04:28effect effective components,
04:30but then you may add
04:31end up with a implementation
04:33package
04:34that is very expensive
04:36and maybe also not very
04:37practical.
04:38So we also look at
04:40first
04:41to, to optimize here the
04:42batch.
04:47In the beta burst study,
04:50the intervention
04:51was the encouragement of the
04:52use of a checklist to
04:54improve
04:55safety of mother and the
04:57newborn
04:58child, and it goes to
05:00a WHO
05:01checklist of a lot of
05:02interventions
05:03that are known to help,
05:06prevent
05:07complications during birth, complications of
05:09the child and complications of
05:12the mother.
05:13And how they
05:15implemented this this this, enforcement
05:18of the checklist changed over
05:20a course of time. They
05:21first had a trial with
05:23with limited number of centers
05:24then then the next phase
05:25where they had they had
05:26another number of centers and
05:28then the big trial where
05:29they actually randomized the intervention.
05:33The VEDAMAR study took place
05:34in Uttar Pradesh in India
05:37and the outcomes,
05:39are the outcomes of a
05:40coaching based WHO
05:42safe burn safe childbirth
05:45checklist
05:46program
05:47in India.
05:48And this,
05:49this was published in the
05:50New England Journal of Medicine.
05:53One of the aspects of
05:54the better bird trial, it
05:55did provide useful information on
05:58how to improve
06:00process outcomes,
06:01but it failed to show
06:03a significant result on the
06:05on the primary outcome.
06:07So, technically, it was a
06:09failed trial, although still,
06:11they found effects on process
06:13outcomes.
06:15Perfect.
06:16So just a nine to
06:17twenty graph, did this check
06:18this is for,
06:20practitioners that are involved with
06:22Yeah. Well, I think it's
06:23just wash hands. One of
06:25the things is,
06:27make sure that the mother
06:29gets oxytocin
06:31during delivery. That's during the
06:33Very,
06:34very specific things that happened
06:36around
06:37In and around childhood. There's,
06:39like, I think twenty seven
06:40different
06:41recommended things that should be
06:42done
06:44before, during, and right after
06:46your childbirth, and most of
06:47them are very low tech.
06:50That works in hand.
06:52That's simple.
06:54But the third time not
06:55everybody does that.
06:59We looked at package components
07:01intervention, launch units, the number
07:03of coaching visits, and a
07:05data feedback form
07:07And, so what we are
07:09going to do is figure
07:10out how we could have
07:11used Laval in order to
07:13improve
07:14the outcomes for the better
07:16start.
07:20Judith, some of the people
07:21online said that if you
07:23move away from, I guess,
07:24the computers where the mic
07:26is, they're having trouble hearing
07:27you. Oh, okay. So it's
07:29a little bit weird.
07:31And I think the AV
07:32guys coming back can help
07:34too. There's also a coating
07:35that you'd be more comfortable
07:36with. Would that be easier
07:37for you? Yes. Let's do
07:39that. Let me load. Sure.
07:45While while you're doing that,
07:47before you even talk about
07:49including the outcomes,
07:50what were the outcomes?
07:53So they looked at
07:56The primary outcome of the
07:58study was a combined outcome
08:00of maternal mortality and neonatal
08:02mortality.
08:04And it's a huge trial,
08:05like, you know, something like
08:06over a hundred and fifty
08:08thousand
08:08births or something like that.
08:10Yeah. We've heard six hundred.
08:12Based on the way they've
08:13done it done the trial?
08:15Yeah. Yeah. Exactly. Exactly.
08:18These colleagues are all evidence
08:19based interventions
08:21that are routinely all done
08:23in high income countries where
08:24we have very, very low,
08:27maternal mortality rates and neonatal
08:29mortality rates. And those are
08:31all not in the US.
08:32We have high rates.
08:35So it couldn't be down
08:36the US because we have
08:37high rates. We have high
08:38rates. Yeah. We have high
08:39rates higher than in the
08:40Dallas. Very common. Excuse me.
08:42That's right. But it is
08:43very it's very this this
08:45terrible. Somebody like me is
08:46not at risk because I
08:47go to Brigham and Women's
08:48Hospital.
08:49Other people,
08:51oh, yeah. Other other
08:53people,
08:55are not affected.
08:56You can get lucky you
08:57can get notified.
09:00It's a problem.
09:04So this randomized leopovirus had
09:06thirty six or more patients
09:08and still on the back
09:09primary outcome, they didn't have
09:10a significant result.
09:13So,
09:14the Golub,
09:15Lago study is now how
09:17the probability of success. For
09:19example, the percentage of
09:21developed birth checklist is used
09:23depends on the intervention package
09:25x. Let's say x consists
09:27of both the launch duration
09:28and the number of coaching
09:30visits. This will be, considered
09:32in the middle of study
09:33because there were more intervention
09:35components,
09:36but they were highly coordinated.
09:38So we looked only at
09:39these two.
09:41For example, finding the intervention
09:42package xopt
09:44so that, it solves that
09:46the probability of success
09:48is at least zero point
09:49nine
09:50by minimizing the cost of
09:52x
09:53and subject to natural constraints
09:55on x. When you look
09:56at launch duration, you can
09:58probably imagine that it's probably
09:59not a good idea to
10:00have a launch duration of
10:02more than three or four
10:03days.
10:06And there's also a minimum
10:07because we cannot be we
10:08we we're assuming that at
10:10least the intervention needs to
10:11be rolled out so it
10:12would be considered a launch
10:14duration also of at least
10:15one day.
10:18In March, Exoq minimizes the
10:20cost of the intervention package
10:21while keeping the probability of
10:23success to be at least
10:24ninety percent.
10:26In
10:28the present of center effects,
10:30one could look at the
10:31average log or the centers
10:33of this probability of success
10:34and that's what we have
10:35been doing in the POLISA
10:37study.
10:38So the POLISA study is
10:39currently ongoing. I cannot show
10:41you results. That's why I
10:42don't have it on the
10:43slides. But the POLISA study
10:45is one where we are
10:46currently doing level optimization.
10:48So we are looking at
10:49this, at this particular
10:52outcome goal, we call it.
10:55So in MERGE, actual minimizes
10:57the cost of the intervention
10:58package while keeping the average
11:00probability of success to be
11:01at least ninety percent.
11:06So the level, the learn
11:07as you go aspect, we
11:08will start with two stages.
11:10Like, the level can also
11:12be done for more than
11:13two stages,
11:14But the basic things of
11:15the how we roll it
11:17out and what are the
11:18things that we do is
11:20better explained to the only
11:21two stages.
11:23And we consider the stage
11:25the situation where the stage
11:27one attaches
11:28determining amount because what capability
11:30you have to play your
11:31trial. And then what happens
11:33in the first stage happens.
11:35In the second stage, we
11:36will calculate a re a
11:38recommended intervention
11:40to use in the next
11:41stage of the trial. And
11:43that recommended
11:44intervention typically
11:45depends on the stage one
11:47data. So what you get
11:48is a dependence between the
11:50stage one data and the
11:51stage two data
11:53because the recommended intervention is
11:55learned
11:56from the stage one data.
11:59And the typical reason for
12:00wanting to learn as you
12:02go is, okay, we are
12:03in this trial and if
12:05it's broken, let's try to
12:07fix it. So Donna found
12:09out through her study group
12:10that it is also actually
12:12happening in the practice
12:13that people do want to
12:14learn. They see that their
12:16trial is not working and
12:17they do want to adapt
12:19the intervention to keep to
12:20be able to keep going,
12:22hoping for trial success.
12:26So when we learn as
12:28we go and we have
12:30a stage two intervention that
12:32depends on the stage one
12:34data, then we lose one
12:36important
12:37tool that we have in
12:38statistics
12:39and that is the independence
12:41of data across centers, the
12:43independence
12:44of data,
12:46between stages.
12:48Because
12:49the outcomes in the second
12:50stage are going to depend
12:52on the outcomes in the
12:53first stage,
12:55especially if the treatment has
12:56any effect
12:58because the learning
12:59of the recommended intervention
13:02depends on the first stage.
13:04So the usual techniques that
13:06you use to prove consistency
13:08and asymptotic normality if your
13:10estimators feel
13:12miserably in this case because
13:14there's dependence between the stages.
13:16And that means that also
13:17between the different centers, even
13:19if they
13:20if they're
13:22having independent patients,
13:24their dates are going to
13:25be dependent because their interventions
13:27are going to be dependent.
13:32Here we have
13:34a long list of things.
13:36So we do two stages
13:37for now and no early
13:38stopping that we can also
13:39relax.
13:40Penny will be the notation
13:42for stage.
13:44We are assuming the same
13:45number of patients in both
13:46stages for now. This can
13:48be relaxed. It's really not
13:49big deal.
13:50X one is the recommended
13:52intervention
13:53at stage one. It's free
13:55trial, so I denote it
13:56with a little x. And
13:58x two n is the
13:59recommended multivariate intervention at stage
14:02two. And we are assuming
14:03that it's a bounded random
14:05variable that is learned from
14:06the stage one data. So
14:08that's also the reason that
14:09it's x two n. So
14:11x one doesn't need an
14:12n because it's free trial,
14:14but x two n depends
14:15on how many patients are
14:17there in stage one and
14:19who is in stage one
14:20and what were their outcomes.
14:21So in order to, stress
14:23that, we have x two
14:25m.
14:27Then we are assuming that
14:28maybe in these trials, not
14:31everybody does ex every center
14:33does exactly as the recommended
14:35intervention is recommended.
14:37So we have a different
14:38notation, a j k n,
14:40for the actual intervention in
14:42center or side j of
14:43stage k.
14:45And we are assuming that
14:47it depends on maybe on
14:48the recommended intervention, hopefully on
14:50the recommended intervention,
14:52but not further on prior
14:54data. And that means that
14:55learning is done
14:57through the recommended intervention and
14:59that's it, not also through
15:01other means. This is a
15:03restriction that we need to
15:04ensure the validity
15:06of the conference intervals and
15:08the estimators.
15:11Then we have y for
15:13outcomes. Y k n is
15:15the outcomes of the n
15:16participants
15:17in stage k.
15:19And their binary outcomes in
15:21this case, I can do
15:22it also for continuous, but
15:24let's set the the stage
15:25for binary outcomes.
15:26And,
15:27y I k n is
15:28the outcome of participant I
15:31in stage k.
15:34That means that y I
15:35one n and y I
15:36two n denote outcomes of
15:38two different participants.
15:40This comes from stage one,
15:42and this is a participant
15:43in stage two.
15:46The n superscript is needed
15:47in stages after stage one
15:49because the intervention
15:50in those stages depends on
15:52the prior outcomes from stage
15:54one,
15:57under the councilman. Luke. Yeah.
15:58Just wanna,
15:59clarification.
16:01Stop if this is a
16:02silly question. But you said
16:03that we're assuming that we're
16:04only learning from the intervention,
16:07I think you meant, in
16:08this study. But I think
16:09that, often we might be
16:11learning
16:12at the same time from
16:13other studies that are not
16:14involved. Is that, a problem?
16:17So this might be a
16:17problem for the FDA
16:19because they want things preplanned.
16:22It might not be as
16:23much of a problem in
16:25other settings where the the
16:27important thing with LAGO is
16:29that we can somehow assume
16:31that the intervention,
16:33in the second stage
16:35converges in probability.
16:38So
16:41if we look at the
16:42stage one
16:43outcomes, we can learn from
16:45an MLE. We can learn
16:47from averages. We can learn
16:48from average purchase outcomes. Doesn't
16:50need to be the bias.
16:51It can be something else,
16:53but it needs to converge
16:54in probability.
16:55So if you have these
16:56outside things happening,
16:58we also have to at
16:59least conceptualize
17:00that they could converge to
17:01something.
17:10So the recommended intervention x
17:11two n is random determined
17:13by some function of the
17:14prior data. It depends on
17:16what researchers decide upon looking
17:18at the stage one results.
17:20And if at all possible,
17:21it should be preplanned.
17:23And that means in terms
17:24of random variables,
17:26x two n is not
17:28independent of the stage one
17:29data, and so outcomes in
17:31different stages are going to
17:33be dependent.
17:36We are now going to
17:37assume that x two and
17:38the recommended intervention depends deterministically
17:41on the stage one data
17:42in a prespecified
17:44rule,
17:45in order to not jeopardize
17:46the validity of the trial.
17:48For example, the decision goal
17:50could be based based, as
17:51I said, about means and
17:53averages,
17:54in the entire sample
17:56in a smooth and differentiable
17:58way. The decision rule should
18:00be based on an MLE
18:01calculated from the stage one
18:03data in a smooth
18:04differentiable data would be the
18:05other option.
18:09We are looking at asymptotic
18:11inference. In the meta birth
18:12study, that is not a
18:13big deal. In the POLISA
18:14trial, that is also not
18:15a big deal because in
18:16typically, these are large scale
18:18intervention trials,
18:20implementation trials.
18:22And we assume that the
18:23number of observations in each
18:25of the stages gets large
18:26at the same rate.
18:30So the main,
18:32the main assumption in LAGO
18:33is that given the recommended
18:35intervention, the stage two data
18:37are independent of the stage
18:38one data. So conditional on
18:40the recommended state intervention for
18:42stage two,
18:43the treatment and the outcomes
18:45in stage
18:47two are independent of the
18:48stage one data, both the
18:50treatment and the outcomes.
18:53So learning is only through
18:55the recommended intervention.
18:59And then the second assumption
19:01is that the recommended intervention
19:02converges in probability to a
19:04constant as the number of
19:05observations
19:06in stage one goes to
19:08infinity.
19:11The actual intervention
19:13is assumed to be a
19:15fixed
19:16function,
19:17maybe center specific,
19:19of the recommended intervention,
19:21and this makes sure that
19:24this function is also deterministically
19:26continuous and bounded
19:28that the actual intervention in
19:29the center also converges in
19:32probability to something fixed. And
19:34that's something that we really
19:35use in their proofs.
19:39The
19:40binary outcome label typically assumes
19:43a logistic regression for the
19:44outcome given the intervention.
19:47So if this is the
19:48intervention in center j,
19:50then an outcome of a
19:51patient in center j follows
19:53a logistic regression model with
19:56parameters that depend that that
19:58determine how
20:00the
20:01probability of success
20:02depends on the different intervention
20:04components.
20:06And then this case, the
20:08outcome goal would be that
20:10the,
20:12average probability of success,
20:15that is is,
20:17at least
20:18point nine where we look
20:20at the probability of set
20:22of success
20:23under the estimated beta, where
20:25beta is estimated based on
20:27the stage one data that
20:28are known
20:29at the time that this
20:30level optimization
20:31takes place.
20:33And this while minimizing the
20:35cost. If it was not
20:36minimizing the cost, we would
20:38not find a unique solution.
20:39We would just max out
20:40all the effective components, and
20:42that is not, believable.
20:47So stage one, we have
20:48a recommending settling intervention, collects
20:51outcomes, recommend intervention. We we
20:53calculate that. That's what we
20:55are currently doing for POLISA
20:57and then collects outcomes.
20:59And then we are going
21:00to estimate
21:02the parameters in this logistic
21:04regression model
21:06in order to figure out
21:07at the end of the
21:08study what is a good
21:11intervention or the optimal intervention,
21:13so the one that has
21:14success probability of at least
21:16ninety
21:17percent by minimizing costs.
21:20And we would do that
21:21final estimate of beta or
21:24the and and the final
21:25optimization
21:26based on the data from
21:27all stages combined.
21:29And this is different from
21:30a design that is called
21:32MOST,
21:33where they optimize the intervention
21:34in the first stage
21:36and then
21:37implement it in the second
21:38stage and then only the
21:39estimate use the second
21:42stage data to look at
21:44the effect of the intervention.
21:45In Lago,
21:46the idea is to use
21:47all data
21:48combined
21:49at the final analysis to
21:51estimate this logistic regression model
21:53and also to determine the
21:55optimal intervention.
22:00Here, I'm going to go
22:01fast
22:02because I think that is
22:03more for tomorrow.
22:04We have a likelihood here.
22:05So we have the probability
22:07of the first
22:09data by one, second stage,
22:12outcomes by two, and then
22:14the both interventions.
22:16And this is a fixed,
22:19intervention because it's pretrial determined.
22:21And this is the beta
22:22this is the beta that
22:23comes from.
22:26Where is
22:27where is my beta? Here's
22:29my beta.
22:30That beta.
22:34So first, the actual interventions
22:36come based on x one,
22:38and we're assuming that, it's
22:40not depending on beta, so
22:41this this this goes out.
22:44And then we have the
22:45first stage outcomes. They depend
22:47on beta.
22:49And then we have the
22:52the second stage interventions.
22:55They depend on the prior
22:56outcomes, but they typically don't
22:58depend on beta. And then
23:00you get the second stage
23:01outcomes and those do the.
23:04If you look at the
23:05MLE or whatever you wanna
23:07call this, then it will
23:08solve the same score equations
23:11that you would solve
23:12had the intervention not been
23:14learned.
23:15So this would be maybe
23:16the end of story
23:17if it wasn't the case
23:19that, okay, this tells the
23:20same score equations, that's great.
23:23But these score equations
23:24are highly
23:25unusual
23:27because what would usually happen
23:29is that these two terms
23:30are independent. They involve different
23:32patients who are treated in
23:34different ways, so why bother?
23:36But we bother because these
23:38two terms are actually dependent
23:40and this term is highly
23:42dependent on what happens here
23:44through the learning of the
23:45intervention.
23:50So, we can show that
23:51they're unbiased estimating equations. That's
23:53not a big deal. But
23:55then,
23:56consistently
23:57consistency doesn't immediately follow
24:00from theory about,
24:02estimating equations because that's mostly
24:04on IID observations.
24:06So we had to use
24:07tricks.
24:08If you wanna know those
24:09tricks, you have to come
24:11to the talk
24:12of the of the of
24:14Tuna's course tomorrow because then
24:15I will show how that
24:16works.
24:19As I told you, normality,
24:21so
24:21I think you have seen
24:22things like this if you
24:23are a little bit, familiar
24:25with
24:26the estimating equations.
24:28You would solve the score
24:29equations. So you set the
24:31derivative of the log likelihood
24:32to zero. So you solve
24:34the score equations, and then
24:35you do this standard Taylor
24:37expansion
24:37around beta star.
24:40And you'll find that, okay.
24:41This is great. So I
24:42can put this to the
24:43other side, and then I
24:45divide by the inverse of
24:46this thing, and it will
24:47all be fine.
24:48The problem is going to
24:50be that even though this
24:52could be fine,
24:54this is not the sum
24:55of IID terms.
24:57So even here, we are
24:59not at home yet because
25:01typically you would say this
25:02goes nicely to a normal
25:03random variable because it's IID
25:05terms. No. They're not IID
25:08terms. There's the two stages.
25:10So what we did
25:11was do a trick.
25:14We have been doing we
25:15have been replacing the outcomes
25:18in the second stage
25:20by outcomes that are coupled
25:22with the fixed design.
25:24And the fixed design is
25:26where the limiting you have
25:28this this this
25:29stage two interventions.
25:31We have been assuming through
25:33different stages
25:34that the this this,
25:37this intervention in the second
25:39stage conversion probability
25:41to a constant intervention. Let's
25:43call it little a.
25:44Under the little a intervention,
25:47we have that's a fixed
25:48intervention, so we lose all
25:50these dependencies.
25:52But we don't have data
25:53from the limiting intervention. We
25:55have from the actual intervention.
25:56So what we don't really
25:58couple the outcomes.
25:59And,
26:00you have all probably, if
26:02you have that one simulation,
26:03you have probably a couple
26:04of outcomes.
26:05Because what you do is
26:07you have these these outcomes
26:09that have similar probabilities.
26:11You can say, okay. The
26:12probability is a little bit
26:13higher, a little bit lower,
26:14but not too much difference.
26:16We can make these variables
26:18almost the same
26:20by using the same uniform
26:23to decide on success versus
26:24failure
26:25because you can have a
26:26success probability that depends on
26:29a uniform. If you have
26:30p is, p is point
26:31six, you draw a uniform,
26:33you say, okay, if it's
26:34less than point six, it's
26:35a success. If it's over
26:36point six, it's a it's
26:38a failure. Right?
26:39Now you have a different
26:40probability,
26:42but it's similar. If you
26:43use the same uniform,
26:45these outcomes are going to
26:46be almost the same.
26:47And the beauty of asymptotic
26:49normality is that you don't
26:51need to show
26:52that the original converges
26:54except because
26:56asymptotic normality is a distributional
26:58result.
26:59So you can replace
27:01these outcomes that you have
27:03under the actual design by
27:06outcomes that are coupled
27:08with the limiting
27:09design without changing the distribution
27:12of what you have.
27:14It's a trick, just a
27:15dirty trick.
27:18So
27:19you change them you the
27:20you change
27:22the
27:23things that you have by
27:24something that has the same
27:26distribution, and that doesn't affect
27:28the limiting results. So that's
27:30what we did. If you
27:30wanna know more about that,
27:32again, you can come tomorrow.
27:35K.
27:36Twenty nine.
27:40Because of the asymptotic normality,
27:42Michelle, the Volta likelihood ratio
27:44test for beta one is
27:45beta one star are asymptotically
27:47valid for any beta one
27:49star.
27:50And now typically you are
27:52interested not so much in
27:53any beta one star,
27:55because typically what you want
27:57to do is test the
27:58null hypothesis,
27:59maybe of no effect of
28:01any of the intervention
28:03components.
28:04So if you want to
28:05test the null hypothesis of
28:06none of the intervention components,
28:09there is a trick again.
28:11Because if none of the
28:13components
28:14has any effect on the
28:15outcome,
28:16all this learning is pretty
28:18irrelevant.
28:19And so what we did
28:20is figuring out, okay, we
28:22want to test the null
28:24and it has to remain
28:25a valid test under the
28:27null.
28:28What it does on the
28:29alternative,
28:30we hope it goes well,
28:31but we don't need to
28:32prove anything.
28:34We just have to make
28:35sure that under the now,
28:36this is a valid test
28:38with the right correct type
28:39one error.
28:41Now if there's none of
28:42the intervention components has any
28:44effect,
28:45it doesn't matter how you
28:46learn
28:47because you won't have an
28:48effect anyway. So you can
28:50learn, and you don't even
28:52need to necessarily restrict to
28:54this learning on averages and
28:55MLEs. You could even learn
28:57from donors' results in the
28:59trial.
29:00Yes. This may be a
29:01really dumb question.
29:03Totally. But
29:04but it it it it
29:05seems to be the assumption
29:06you're making is that each
29:07component of the intervention is
29:09independent of each other.
29:12No. I'm not making that
29:13assumption. I just the null
29:15hypothesis that I'm considering
29:17is that none of the
29:18intervention components has any effect.
29:20So it is a very
29:21sharp null.
29:23Not sharp, but but it's
29:25a very You're doing a
29:26trial of all the components
29:27together and then knowing each
29:29individual component
29:31isn't, for instance, one in
29:32one positive, one is it
29:33canceling out a negative?
29:35So that could happen,
29:37and, that's not our null
29:38hypothesis. Our null hypothesis is
29:40really that none of the
29:41intervention components has an effect
29:42and not that they cancel
29:43out. The null hypothesis that
29:45we're looking at is really
29:46none of the components has
29:47an effect.
29:48But you can't can't learn
29:50that from the data.
29:52No. But you can test
29:53it.
29:55You cannot learn it from
29:56the data. You can get
29:57an idea about whether it
29:58happens because not only are
30:00we testing, we are also
30:01estimating the effect of all
30:03the intervention components.
30:07But
30:08the the it it is
30:09a restriction that we are
30:11looking at the null of
30:12no effect of any of
30:13the intervention components because this
30:15might not be true or
30:17it's maybe they can come
30:18out. So it's a limited
30:19case. It's a limited case.
30:21Yes.
30:25But under the law on
30:26under this now, you can
30:27learn as you want, not
30:29learn.
30:30Because because none of none
30:31of what you're doing is
30:32has an effect on the
30:33distribution of the outcome. So
30:35it may affect the outcome,
30:36but not the distribution of
30:37the outcome.
30:42They have been lately also
30:44focusing on not only an
30:46outcome goal, but also a
30:48power goal.
30:49And a power goal is
30:50where after stage one, we
30:52sit around and we try
30:54to figure out, okay, what
30:55do we think? How do
30:56we think the trial is
30:57going?
30:58Are we achieve achieving the
31:00power
31:01that,
31:02that we hoped for?
31:04So there's two ways to
31:05do this. We can look
31:06at conditional or unconditional
31:08power.
31:09At first, I thought we
31:10should just be looking at
31:12unconditional power. But in conversations
31:14with Donna, we decided we
31:15also have to look at
31:16unconditional
31:17power. And this turns out
31:19to be important, especially
31:21in cases
31:22where the where the power
31:23is not very high.
31:25If the power is is
31:27is is pretty high, when
31:29the conditional power works full,
31:32if the
31:33overall power is not very
31:35high, it may be better
31:36to look at unconditional power.
31:38So what is the difference?
31:40Unconditional
31:40power
31:41ignores that we have already,
31:45collected the stage one data,
31:46and those will be the
31:48data that they'll be using
31:50at the test.
31:51So they consider more variation
31:53in the numerator of the
31:54test statistic. Because think of
31:56this, if you look at
31:57at at, at
32:00the test at the end
32:01of the study, they compare
32:02the treated and the untreated.
32:03And maybe just that the
32:05two sample t test with
32:06a variant studies like p
32:08one one minus p one
32:09p zero one minus p
32:10dot p zero with some
32:12n's and square roots and
32:13stuff on the board.
32:14So when you look at
32:16this, you have this p
32:17one hat minus p zero
32:19hats in the numerator.
32:20And part of this p
32:22one hats and part of
32:23this p zero hats at
32:24the end of stage two
32:25we have,
32:26we have the number of
32:28successes in the stage one
32:29trial in the stage one,
32:31both in the intervention and
32:33the control group.
32:34When we look at unconditional
32:36power, we still consider variation
32:39in stage one and stage
32:40two. When we look at
32:42conditional power, we only consider
32:44the stage two variation.
32:46So this indicates that if
32:48we look at unconditional
32:50power, we recommend
32:52a more effective intervention
32:55than if we look at
32:56an at conditional power where
32:58we can do with a
32:59little bit less
33:01effective intervention.
33:03So you see that if
33:04there is cases where we
33:05cannot really reach the uncondition
33:07reach the conditional power in
33:09many of the in many
33:10of the trials,
33:12then it's good to have
33:13the unconditional
33:14power.
33:15So based on the uncondition
33:16based on the
33:18power constraints,
33:19we can estimate what is
33:21the power of the test
33:22at the end of the
33:23study.
33:24So at at stage one,
33:25you could do two things.
33:26You can estimate you can
33:28work with the
33:30outcome goal,
33:31and you can work with
33:32a power goal.
33:34So you can try to
33:35have them both be okay.
33:37So maybe a power of
33:38eighty percent
33:40and maybe an outcome
33:42goal of ninety percent. And
33:43these are two constraints,
33:45and you can,
33:47minimize
33:48the cost
33:49by the
33:51by the recommended
33:53intervention.
33:55So,
33:56adheres to both constraints, so
33:58the power goal and the
34:00outcome goal.
34:01And with these two goals,
34:03you're still adapting only the
34:05intervention, not the sample size?
34:07No. Not the sample size.
34:08We're for now, we're working
34:09with sample size fix. It's
34:11actually an interesting idea to
34:12also work on sample size
34:14adjustments for for a level.
34:16But I think they're not
34:17there yet because we have,
34:19repeated outcomes to work on
34:21and confounding by indications work.
34:23Right. So, essentially, you're just
34:25an intervention,
34:26not only to reach that
34:27outcome goal, but also there'll
34:29be different interventions that have
34:31more like, you could achieve
34:32more or less power, so
34:34you're also
34:35changing the Too long strange.
34:37Okay. We we we want
34:38an intervention
34:40that hopefully
34:41gets us the outcome goal
34:43and the power goal.
34:45And And all the models.
34:47On those among those two
34:49goals, the intervention that satisfy
34:52both goals
34:53estimated based on the stage
34:54found data. Among those, we
34:56choose the one that is
34:57least cost.
35:00Yes. Thank you.
35:04Simulation study. How about they
35:06stick that, Dona?
35:07That sounds like a good
35:08idea. We can go over
35:09tomorrow in class. Yeah. Which
35:11by the way,
35:12I told people on chat,
35:14but Judith will give a
35:15more detailed technical
35:17lecture tomorrow, two hour lecture
35:19in my course
35:21in
35:22room one zero two at
35:23sixty Public Street.
35:25And you're all welcome to
35:26join that as well if
35:27you wanna hear more about
35:28Mago and learn a bit
35:30more in in fact the
35:30mathematical details.
35:33And I can get you
35:34that information if you email
35:35me on me at j
35:36j dot com.
35:38Or we can and you
35:39can ask anyone who also
35:40knows. Sally or me.
35:47Oh, cool.
35:48Yeah. So,
35:50what we have we can
35:52do, we can use the
35:53resulting estimates and confidence intervals
35:55for the beta using the
35:56coupling argument. We can find
35:58asymptotic normality.
35:59We can also find formulas
36:01for the confidence intervals.
36:03We can use that to,
36:04create estimates and confidence regions
36:07for the optimal intervention
36:09x opt.
36:10And then we could, maybe
36:12keep only package configurations
36:14for which the confidence interval
36:14for the
36:17success probability
36:18contains zero point nine to
36:20create a confidence set
36:22for x points.
36:25They can also,
36:26do a confidence band for
36:28the probability
36:29of success based on beta
36:32hat, since Jaffe's,
36:34theorem.
36:35This is something that, Daniel
36:36was able to figure out.
36:37Once we have a subdoted
36:39normality, we can use standard
36:40results,
36:41and that gives us an
36:42option to create confidence bands
36:45for the probability of success
36:47under any possible
36:49configuration of the inter implementation
36:52package.
36:52And then we can also
36:53get a no confidence interval
36:55for success probability on the
36:57of x opt
36:58that we estimated because we
37:00can just look at the
37:01confidence bands at x opt.
37:04It's a little bit conservative,
37:05but it works.
37:10Okay.
37:12Thirty nine.
37:13No. Not thirty nine. Ah,
37:14here. Yeah.
37:16The better birth starts. We
37:18looked at one outcome. It
37:19is a process outcome. We
37:20looked at our satoshian administration
37:23immediately
37:23after delivery.
37:25This is recommended by the
37:26WHO.
37:31The intervention components we considered
37:33were the number of coaching
37:34visits and the launch duration,
37:36and we used the center
37:37characteristics. I didn't this would
37:39be one that we also
37:41included in the logistic regression
37:43model. It was the monthly
37:44birth birth volume because it
37:46was highly,
37:48highly variable among the centers
37:50in bed of birth.
37:52And then you can get
37:54the, confidence intervals and the
37:56estimate decides this is the
37:58beta zero hat, beta one
38:00hat, beta two hat.
38:02After stage one, you could
38:03get an estimate of the
38:05optimal intervention,
38:07and and the recommended intervention
38:09in this case would have
38:09been
38:10one,
38:11long day of lunch duration
38:13and five coaching visits.
38:16And then after stage two,
38:17the recommended intervention would be
38:19three lunch duration
38:21and only one coaching visit.
38:23And after the study,
38:26one could estimate all the
38:28in the effect of all
38:29the of of the both
38:30intervention components
38:32and also the recommended optimal
38:34intervention.
38:40The ninety five percent confidence
38:42set for the optimal intervention
38:43over the grid of x,
38:45we looked at,
38:46all possible numbers of coaching
38:48visits that can only be
38:49half because you have a
38:50coaching visit or you don't
38:51have it. Launchuation, you could
38:53think of having it one
38:54or one point five days.
38:56And of the three hundred
38:57and sixty potential intervention package,
39:00we included only ten point
39:02five percent in the confidence
39:04set.
39:06And they had different,
39:08different costs.
39:13We also created,
39:17I think it's right here.
39:19Sorry.
39:20We also created these conference
39:22bands for the probability of
39:24success under the different intervention
39:26component compositions. So you see
39:28the first component here and
39:30the second component here, and
39:32you can visualize this. This
39:33is pictures that Daniel never
39:35was able to create in
39:37art.
39:42And now I'm going.
39:44So the adaptive learn as
39:45you go, Labo design aims
39:47to find the optimal intervention
39:49composition
39:50and its effect.
39:52There's a dependence between stage
39:53one and stage two data,
39:55which makes it really nice
39:56for me to work on
39:57this because I am also
39:58a medical statistic from time
40:00to time. I love doing
40:01those math things. We use
40:03the coupling argument to provide
40:05theoretical
40:06justification for using existing software
40:08for estimation and inference because
40:10these estimating equations that I
40:12showed you are actually really
40:14just the usual estimating equations
40:16that logistic regression solves if
40:18you run
40:19like a PROC LOGISTIC or,
40:22or or,
40:23geom with a logic.
40:25It provided a confidence set
40:27for the optimal intervention and
40:29confidence band for the probabilities
40:31under any,
40:33package composition.
40:36Extensions, these are things we
40:38have done and I haven't
40:39shown.
40:40Replace n is n one
40:42and n two with n
40:44one and n two that
40:45go, at the same rate.
40:48Include center specific covariates
40:50in the regression in the
40:51logistic regression model,
40:53including more than two stages.
40:56And, also, we worked on
40:57LaVu for continuous outcomes. This
40:59is work with my PhD
41:01student, advisee, Amta Bing. I
41:03think he's home live if
41:04you want to ask him
41:05about details.
41:06So we looked at continuous
41:08outcomes,
41:09and we looked not only
41:10at a linear regression. So
41:12with a linear link, we
41:13also looked at different link
41:15functions.
41:16What we did assume so
41:17far is that all the
41:18errors are identically
41:20distributed.
41:22And the way he did
41:23this was to look at
41:25replacing
41:26the errors under the
41:28actual intervention
41:30by errors under the under
41:32the limiting
41:33intervention
41:34without changing the distribution. So
41:36it's a little bit like
41:36coupling,
41:37except for coupling, you cannot
41:39really copy them over because
41:40they have different
41:42different success probabilities.
41:43But here, we are showing
41:44that the errors are iid,
41:45so we can re replace
41:47one by the other. The
41:49other thing that he needed
41:50to do was, use, theory
41:52from,
41:53from empirical purposes.
42:00No. But I do.
42:04I'm not fully loading this
42:05morning.
42:06Let me go fast.
42:21Okay.
42:23Then another thing,
42:25the power constraints, this is
42:27still really work in progress.
42:31We're almost there. I expect
42:32that we can
42:34we can send the one
42:36liter power constraints
42:38within a month or so.
42:39Okay?
42:42There's future research.
42:45Xin Yu here is still
42:46is working on,
42:48including random center effects. This
42:50is joint work, Jun Jin
42:51Yu Yu, Donna Spiegelman and
42:53Xin Xu Zhou. I'm also
42:56involved. I should have put
42:57that. Oh, that's your joint
42:59work business. Xin Yu is
43:00leading this.
43:02Allow centers to contribute to
43:04more than one stage. I
43:05think that we are there
43:06now.
43:07Study design, consider help additional
43:10ways to choose the recommended
43:12intervention,
43:13but he left. So
43:15I don't remember his name,
43:16but I also would look
43:18look at.
43:19So
43:21far, we have done worked
43:22with a power goal and
43:24an outcome sorry, power goal
43:26and outcome goal, but there
43:28are other ways to do
43:29it. It could be based
43:29on process outcomes. I would
43:31be curious to see that
43:33implemented in practice.
43:35Replace the logistic regression models
43:37for the outcome with a
43:38more flexible model. One can
43:39think of a probate model.
43:41We didn't work with that.
43:42Probably that's not a big
43:43deal, but it would be
43:44interesting.
43:45Create an R package for
43:46design. This is joint work
43:48with Hunter Bing. He is
43:49at Boston University and on
43:51the Spiegelman here.
43:53Hunter has a package working
43:55for the better birth, both
43:56for continuous outcome and for,
43:59for a binary outcome, both
44:01with a logic link, but
44:02he's, extending it so that
44:04hopefully we can also use
44:06it for POLISA in the
44:07future and then other people
44:09can use it for different
44:10studies.
44:12Consider confounding by indication. This
44:14is joint work with Minh
44:16Bui,
44:17who is currently also online.
44:19So she is in, at
44:20Boston University.
44:22What if a center adopts
44:24the intervention?
44:25How they do that correlates
44:27with their potential outcomes? Maybe
44:29a very successful center Yeah.
44:31Will have a a higher
44:32intervention component on physician
44:35than a not as successful
44:36center. So we are considering
44:38this now.
44:39It seems like
44:41it's a permitting approach to
44:42include fixed center effects
44:45because the fixed center effects
44:46could pick up the effects
44:49of this successful center.
44:54And,
44:55I will
44:56answer questions.