throbber
STATISTICAL COMMENT
`
`A Jr EXHIBIla1Og
`DCeonent1a`
`Dátlr "Rpt
`
`W WwDEPOBOOK.COM
`
`A brief note on overlapping confidence intervals
`Peter C. Austin, PhD,a'b and Janet E. flux, MD, SM, FRCP(C),'''jd Toronto, Ontario, Canada
`
`Clinical researchers frequently assess the statistical significance of the difference between two means by examining
`whether the two 95% confidence intervals overlap. The purpose of this brief communication is to illustrate that the 95%
`confidence intervals for two means can overlap and yet the two means can be statistically significantly different from one
`another at the a = 0.05 level. (J Vase Surg 2002;36:194 -5.)
`
`During seminars in which the results of clinical research
`are presented, one frequently hears the statement that
`because the 95% confidence intervals overlap, the means of
`two different groups are not statistically significantly differ-
`entfrom each other (at the a = 0.05 level). Furthermore, in
`the literature, one occasionally observes similar asser-
`tions.1'2 The purpose of this technical note is to discuss the
`relationship between confidence intervals and hypothesis
`testing and to illustrate that 95% confidence intervals can
`overlap, yet the two means can be significantly different
`from one another at the 0.05 level.
`Rosner' describes the relationship between hypothesis
`testing and confidence intervals. In testing of the null
`hypothesis that a population mean is equal to a specific fixed
`value (ie, the international normalized ratio is 1.0), the null
`hypothesis is rejected at a significance level of 0.05 if and
`only if the 95% confidence interval for the population mean
`excludes that value. One can make this assertion because
`the value under the null hypothesis is considered to be
`fixed. The only source of variability is in the estimation of
`the population mean with the sample mean.
`In testing of the null hypothesis that a mean is equal to
`a fixed quantity, the only source of variability is in the
`estimate of the sample mean. Extreme observations are
`those that lie in the extreme tails of the sampling distribu-
`tion of the sample mean under the null hypothesis. The
`probability that a sample mean would lie in the lower 2.5th
`percentile or the upper 2.5th percentile is 5 %. However,
`
`From the Institute for Clinical Evaluative Sciences'; the Departments of
`Public Health Sciencesb and Medicine,` the University of Toronto; and
`the Division of General Internal Medicine, Clinical Epidemiology Unit
`and Health Care Research Program, Sunnybrook and Women's College
`Health Silences Centre.°
`Views expressed herein are solely those of the authors and do not represent
`the views of any of the sponsoring organizations.
`Competition of interest: nil.
`Reprint requests: Peter Austin, PhD, Institute for Clinical Evaluative Sci-
`ences, G-160,2075 Bayview Ave, Toronto, Ontario, M4N 3M5, Canada
`(e -mail: peter.austin @ices.on.ca).
`Copyright C 2002 by The Society for Vascular Surgery and The American
`Association for Vascular Surgery.
`0741- 5214/2002/S35.00 + 0 24/1/125015
`doi:10.1067/mva.2002.125015
`194
`
`when one compares two means, the probability that one
`mean would lie in the upper 2.5th percentile of that means
`sampling distribution, while the other simultaneously lies
`in the lower 2.5th percentile of its sampling distribution, is
`substantially less than 5 %. Hence, despite having overlap-
`ping 95% confidence intervals, one can reject the null
`hypothesis with a Pvalue that is substantially less than .05.
`In comparison of two groups, the confidence intervals
`may overlap yet the means may be significantly different
`from one another. This fact is known in the statistical
`community4's but bears the occasional repeating within the
`medical community. Let us assume that we have two inde-
`pendent samples, each composed ofn subjects, and that we
`measure a continuous variable on each subject. For in-
`stance, we use 200 patients with diabetes receiving two
`different drug regimens with hemoglobin Arc values as the
`outcome measure. Let R1 and 72 denote the sample means
`in the first and second groups, respectively. To simplify the
`algebra, we assume a common known population variance,
`a2, in each of the two groups. We shall use formulas from
`Rosner.' For simplicity, we assume that the first mean is less
`than the second mean. Suppose the confidence intervals
`overlap, with the proportion of the overlap being p. For
`example, we use mean hemoglobin Arc of 7.4 (7.0, 7.8)
`and 8.0 (7.6, 8.4). The width of a 95% confidence interval
`is equal to 2 x 1.96 a /Vi. Then we have that
`z1 +1.96ff /Jn =R2- 1.96a /Jn +p
`X2X1.96Xa /Jn
`Rearranging to give the difference between means, we
`have that
`
`(1)
`
`(2)
`
`R2- R1= 2X1.96Xa /Jn- 2xpx1.96a /f
`
`We can now test the hypothesis that the means are
`equal in the two groups. We will compute the two- sample
`z test for independent samples with equal and known
`variances. The test statistic z is:
`;etc = (3Z2 - x1)/a J1/n + 1/n
`
`(3)
`
`JANSSEN EXHIBIT 2184
`Mylan v. Janssen IPR2016-01332
`
`

`

`JOURNAL OF VASCULAR SURGERY
`Volume 36, Number 1
`
`Austin and Hux 195
`
`P values for testing equality of two means when two
`confidence intervals overlap
`
`Percent overlap of two confidence intervals
`
`0%
`.0056
`
`5%
`.0085
`
`10%
`.0126
`
`15%
`.0185
`
`20%
`.0266
`
`25%
`.0376
`
`Above table only refers to comparisons of groups with equal sample size and
`equal variance. Variations would give different results.
`
`(4)
`
`We reject the null hypothesis of the equality of the two
`means if z,, is more than 1.96 because the probability that
`the absolute value of zms, is greater than 1 96 is .05. We can
`now insert the definition of z2 - àt from Eq 2. This results
`in a test statistic of:
`z,,,,= ax 1.96x(1 -p)
`We will reject the null hypothesis of the equality of the
`two means when ztest is larger than 1.96. This will hold as
`long as p is less than .29. Hence, as long as the two 95%
`confidence intervals overlap by less than 29%, one will reject
`the null hypothesis of the equality of the two means with a
`P value of less than .05. The previous argument can be
`easily modified to the case in which unknown population
`variances are estimated with the sample variances. In such a
`situation, depending on the sample size, the degree of
`overlap can exceed 29 %, and the two means would still be
`significantly different from one another at the .05 level. The
`Table contains several degrees of overlap and the P values
`with which one would reject the null hypothesis that the
`means of the two groups are equal, if the two 95% confi-
`dence intervals overlap. Therefore, the fact that two confi-
`dence intervals overlap does not necessarily imply that the
`two means are not significantly different from one another.
`We have shown that two 95% confidence intervals can
`overlap and yet the two means can be statistically signifi-
`cantly different from one another at the a = 0.05 level.
`Hence, one cannot use the fact that two 95% confidence
`intervals overlap as a substitute for hypothesis testing in
`
`assessing the statistical difference between two means.
`However, one can modify the previous calculations to show
`that if one constructs 83% confidence intervals, rather than
`95% confidence intervals, then if the confidence intérvals
`abur, the Pvalue associated with testing the equality of the
`two means would be approximately .05. Therefore, one can
`use the criterion of whether or not two 83% confidence
`intervals overlap as a method for assessing whether or not
`two means are significantly different from one another at
`the a = 0.05 level.
`Returning to the diabetes example, despite the 95%
`confidence intervals overlapping by 25 %, the means differ
`with P = .0376. If the confidence intervals abutted (ie,
`[7.1, 7.7] and [7.7, 8.3]), the means would differ with P =
`.0056.
`In summary, comparing two means is different than
`comparing one mean with a constant. In comparing two
`means, there is variability on both measurements of the
`means, whereas comparing a single mean with a constant
`involves only one source of variability. Two means may be
`significantly different from one another, despite the two
`confidence intervals abutting or having a modest degree of
`overlap.
`
`REFERENCES
`I. Mancuso CA, Peterson MGE, Charlson ME. Comparing discriminative
`validity between a disease- specific and a general health scale in patients
`with moderate asthma. J Clin Epidemiol 2001;54;263 -74.
`2. Sont WN, Zielinski JM, Ashmore JP, Jiang H, Krewski D, Fair ME, et
`al. First analysis of cancer incidence and occupational radiation exposure
`based on the National Dose Registry of Canada. Am J Epidemiol
`2001;153:309 -18.
`3. Rosner B. Fundamentals of biostatistics, fourth edition. Belmont, Calif:
`Duxbury Press; 1995.
`4. Goldstein H, Healy MJR. The graphical presentation of a collection of
`means. J R Stat Soc Assoc 1995;158:175 -7.
`5. Schenker N, Gentleman JF. On judging the significance of differences
`by examining the overlap between confidence intervals. Am Statistician
`2001;55:182 -6.
`
`Submitted Feb 26, 2002; accepted Mar 14, 2002.
`
`

This document is available on Docket Alarm but you must sign up to view it.


Or .

Accessing this document will incur an additional charge of $.

After purchase, you can access this document again without charge.

Accept $ Charge
throbber

Still Working On It

This document is taking longer than usual to download. This can happen if we need to contact the court directly to obtain the document and their servers are running slowly.

Give it another minute or two to complete, and then try the refresh button.

throbber

A few More Minutes ... Still Working

It can take up to 5 minutes for us to download a document if the court servers are running slowly.

Thank you for your continued patience.

This document could not be displayed.

We could not find this document within its docket. Please go back to the docket page and check the link. If that does not work, go back to the docket and refresh it to pull the newest information.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

Set your membership status to view this document.

With a Docket Alarm membership, you'll get a whole lot more, including:

  • Up-to-date information for this case.
  • Email alerts whenever there is an update.
  • Full text search for other cases.
  • Get email alerts whenever a new case matches your search.

Become a Member

One Moment Please

The filing “” is large (MB) and is being downloaded.

Please refresh this page in a few minutes to see if the filing has been downloaded. The filing will also be emailed to you when the download completes.

Your document is on its way!

If you do not receive the document in five minutes, contact support at support@docketalarm.com.

Sealed Document

We are unable to display this document, it may be under a court ordered seal.

If you have proper credentials to access the file, you may proceed directly to the court's system using your government issued username and password.


Access Government Site

We are redirecting you
to a mobile optimized page.





Document Unreadable or Corrupt

Refresh this Document
Go to the Docket

We are unable to display this document.

Refresh this Document
Go to the Docket