9 Equivalence Trials

9.1 Introducing equivalence

[allowframebreaks]

Equivalence trials: aim to show the therapeutic equivalence of two treatments, typically an experimental treatment to a standard.

Absolute equivalence can never be demonstrated so trialists aim to demonstrate similarity based upon limits of equivalence (a clinically irrelevant difference).

Conventional significance test have little relevance: failure to detect a difference does not imply equivalence.

Comparison with superiority trial

  • “swap the null hypothesis and the alternative”

  • “relax former null hypothesis to an interval”

Important to specify boundaries (limits of equivalence) in advance!

Why do you think this is?

9.2 Equivalence vs non-inferiority

[allowframebreaks]

The concept of similarity may be one sided or two-sided: two-sided ’equivalence’; one-sided ‘non-inferiority’.

Instead of non-inferiority, some usage of the terms

  • one-sided equivalence

  • therapeutic equivalence.

Bioequivalence trials

typically aim to show the pharmacokinetic profile of an experimental treatment is similar to a reference product.

Pharmacokinetics

study of absorption, distribution and elimination of a pharmaceutical in the body as a function of dose and time.

Often bioequivalence based upon a single-dose cross-over on healthy volunteers.

Bioequivalence studies aim to indirectly demonstrate therapeutic equivalence!

9.3 Equivalence Tests: The Interval Inclusion Principle

[allowframebreaks]

Analysis is based upon the construction of a (1-α)% confidence interval for the parameter of interest.

Suppose interest lies in parameter θ (for example θ=μT-μC).

Hypotheses:

H0:θθ1orθθ2 vs. H1:θ1<θ<θ2

Where θ1 and θ2 are the pre-specified limits of equivalence

Compute confidence interval for θ

  • lower confidence bound for θ at level 1-α, say θL(x,1-α); upper confidence bound for θ at level 1-α, say θU(x,1-α)

  • reject H0, if (θL(x,1-α),θU(x,1-α))(θ1,θ2)

  • Example asthma trial later.

9.4 Remarks on the interval inclusion principle

[allowframebreaks]

Unnumbered Figure: Link

CI (θL(x,1-α),θU(x,1-α)) has level 1-2α
However, actual significance level of test procedure α
Procedure equivalent to two one-sided tests

9.5 Two one sided tests

[allowframebreaks]

Equivalence Test: Two one-sided test to compare the difference of two Means.

Data

(independent observations!)

  • XiTN(μT,σ2),i=1,,rn

  • XiCN(μC,σ2),i=1,,(1-r)n

Hypotheses
  • H0:μT-μCθ1orμT-μCθ2  vs.
     H1:θ1<μT-μC<θ2

  • for example θ1=-D and θ2=D with D0

Test statistic
Ti=r(1-r)nX¯T-X¯C-θiS,i=1,2

with

S2=1n-2(Xij-X¯i)2
Rejection (2 one-sided tests)
T1tn-2,1-α𝐚𝐧𝐝T2tn-2,α

Confidence interval approach preferred!:

X¯T-X¯C±tn-2,1-2α.SE(X¯T-X¯C)

9.6 Example: Bioequivalence trial of “Allopurinol” compounds

[allowframebreaks]

A bioequivalence trial using a cross-over design.

References:

  • Buehrens K et al (1991) Arzneimittel-Forschung / Drug Research 41: 250-253.

  • Bock J (1998), Section 5.2.

Objective: bioequivalence of two “Allopurinol” compounds (“Cellidrin” (T) and standard drug (R))

Endpoint

  • area under the serum concentration curve (AUC) [μg/ml h]

  • (approximate) log normal distribution.

Design

  • 2×2 crossover trial

  • wash-out phase of 14 days.

Log-transform the AUC’s to achieve approximate normality working upon the log-ratio scale.

Standard equivalence margins for bioequivalence: 0.80 and 1.25 for the ratio of the population means.

Results

  • n=12 healthy males

  • mean of the difference of the logarithms: d¯=-0.0446, sd=0.1719

  • 90% CI for the mean of the difference [-0.1337; 0.0445]

  • 90% CI for the ratio on the original scale [0.875; 1.046] equivalence.

Assumptions: no period effect, no carry-over.
Comments?

9.7 Ratio of two means

[allowframebreaks]

Sometimes interest lies in the ratio of two group means.

Hypotheses

  • H0:μT/μCθ1orμT/μCθ2  vs.  H1:θ1<μT/μC<θ2

  • for example θ1=B and θ2=1/B with 0<B<1.

Sasabuchi test statistic

Ti=X¯T-θiX¯CS1-r+rθi2r(1-r)n,i=1,2.

Rejection (2 one-sided tests)

T1tn-2,1-α𝐚𝐧𝐝T2tn-2,α.

A (1-α)% confidence interval for ratio of group means is preferable.

9.8 Appropriate sample size formulae

[allowframebreaks]

Ratio

of two means (μTμC)

n1+(B2-1)rr(1-r)(Φ-1(α)+Φ-1(β))2(μT/μC-B)2σ2μc2.
Difference

of two means (μTμC)

n1r(1-r)(Φ-1(α)+Φ-1(β))2(μT-μC-D)2σ2.

For μT=μC: β:=β/2.

Reference: Kieser M, Hauschke D (1999). Journal of Biopharmaceutical Statistics 9: 641–650.
For exact sample sizes see Hauschke D, Kieser M, Diletti E, Burke M (1999). Statistics in Medicine 18: 93–105.

9.9 Example: Equivalence trial in Asthma

[allowframebreaks]

References

  • CPMP. Replacement of Chlorofluorocarbons (CFCs) in metered dose inhalation products. Note for Guidance III/5378/93; 1993.

  • Kunkel G, Schlaefke S, Koehler S. European Respiratory Journal 1999;14 (Suppl 30):105s (Abstract).

Background

  • Montreal protocol on substances that deplete the ozon layer

  • suitable alternatives to CFCs in metered-dose inhalers

  • to prove equivalence:

    • pharmaceutical equivalence: e.g. amount of active ingredient delivered and deposition of the emitted dose

    • clinical equivalence: …to demonstrate that the change …has no adverse effect on the benefit/risk ratio to the patients in comparison with the existing CFC-containing products.

Design

controlled, randomised, double-blind trial in patients with asthma.

Objective

clinical equivalence of two inhalation devices for beclomethasone

  • experimental device: new dry powder inhaler which is CFC-free

  • control device: std metered dose inhaler using CFC-containing propellants.

Endpoint
  • forced expiratory volume in one second (FEV1)

  • measured after 4, 8, and 12 weeks of treatment

  • primary endpoint: mean of these three measurements

  • ratio of means.

Equivalence margins

(0.85;1.18).

Results

90% confidence interval of (0.94;1.11) equivalence.

Sample size calculation
  • σμC=0.28, α=0.05, 1-β=0.80, and μTμC within the interval (0.96;1.05)

  • μT/μC=1: B=0.8552 (B=1.1850)

  • μT/μC=0.96: B=0.8570 (B=1.1824)

  • μT/μC=1.05: (B=0.8521) B=1.1869

70 patients per group.

Note: μT/μC=0.95 requires 84 patients!
(using exact formula: 85 patients).

Exact sample sizes with nQuery.