澳洲幸运5官方开奖结果体彩网

T-Test: What It Is With Multiple Formulas and When to Use Them

This calcu💧lation can be used for hypothesis testing in statistics

Definition

A t-test measures the difference between the means 🔴of two samples.

What Is a T-Test?

A t-test is an inferential 澳洲幸运5官方开奖结果体彩网:statistical test used to determine if there is a s🎶ignificant difference between the means of ജtwo groups and how they are related.

T-tests are used when the data sets follow a normal distribution and have unknown variances, like the data set recorded from flip🔯ping a coin 1꧂00 times.

The t-test assists in hypothesis testing in statistics and uses the t-statistic, the 澳洲幸运5官方开奖结果体彩网:t-distribution values, and the degrees of fಞreedom to determine statistical significance.

Key Takeaways

  • A t-test can shed light on a statistically significant difference between the means of two data sets.
  • It is used for hypothesis testing in statistics.
  • Calculating a t-test requires the difference between the mean values from each data set, the standard deviation of each group, and the number of data values.
  • T-tests can be dependent or independent.
T-Test: A statistical test used to compare the means of two groups of data.

Investopedia / Sabrina Jiang

Understanding the T-Test

A t-test ♐compares the mean valu♈es of two samples to determine a statistically significant difference.

For example, the grades of students from a physics class and those of a different group of students from a writing class 🍌would 🦹not likely have the same mean and standard deviation.

Similarly, samples taken from the placebo-fed control group of a drug test and those taken from the drug-pres🦹cribed group should have a slightly different mean and standard🐽 deviation.

Four assumptions are made while using a t-test:

  1. The data collected must follow a continuous or ordinal scale, such as the scores for an IQ test.
  2. The data is collected from a randomly selected portion of the total population
  3. The data will result in a normal distribution of a bell-shaped curve.
  4. Equal or homogenous variance exists when the standard variations are equal.

Mathematically, the t-test takes a sample from each of the two sets and establishes the problem statement. It assumes a null hypothesis, which means that it assumes the two means are equal.

Using the t-test formulas, values are calculated and compared against the standard values. This comparison 澳洲幸运5官方开奖结果体彩网:helps to determine the e🌊ffect of chance on th꧃e difference, and whether the difference is outside that chance range꧂.

The💟 t-test questions whether the difference between the groups represents a true difference in the stud✅y or merely a random difference.

Based on the results, the assumed null hypothesis is accepted or rejected. If the null hypothesis is rejected, it indicates that data readings are strong and are 📖probably not due to chance.

  • Null hypothesis rejected: Differences are statistically significant
  • Null hypothesis accepted: Differences are not statistically significant

The t-test is just one of many tests used for this purpose. Others may be more appropriate deꦯpending on the number of variables or the size of the sample.

For example, statisticians use a z-test for data sets with a large sample size. Oth🧸er testing options include the chi-square test and the f-test.

Example of When a T-Test Would Be Useful

Imagine that a drug manufacturer tests a new medicine. Following standard procedure, the drug is given to one group of patients, and a placebo is given to another group called the conಌtrol group.

The placebo is a substance with no therapeutic valu💟e and serves as a benchmark to measure how the other group, administered the actual drug, responds.

After the drug trial, the members of the control group reported an increase in average life expectancy of three years. Members of the group that was prescribed the new drug reported an increase in average life expꦿectancy of four years.

Initial observation inไdicates that the drug is working. However, it is also possible that the observation may be due to chance.

A t-test could be used to determine if the results are significant and applicable to the entire population, or whether they are random and not due to the drug intervention.

Using the T-Test

Calculating a t-test requires three f🦂undamental data values:

  1. The difference between the mean values from each data set, also known as the mean difference
  2. The 澳洲幸运5官方开奖结果体彩网:standard deviation of each group
  3. The number of data values of each group

The t-test produces two values as its output: t-value and 澳洲幸运5官方开奖结果体彩网:degrees of freedom. The t-value, or t-score, is a ratio of the difference between the mean of the two sample sets and the variation that exists wi♏thin the sample seꦍts.

The numerator is the difference between the mean of th🍎e two sample sets. The denominator is the variation that exists within the sample se✱ts and is a measurement of the dispersion or variability.

This calculated t-value is then compared against a value obtained from a critical value table🌸 called the T-distribution table.

Higher values of the t-score indicate♏ that a large difference exists between the two sam🐷ple sets. The smaller the t-value, the more similarity exists between the two sample sets.

Degrees of freedom refers to the values in a study that have the freedom to varꦫy and are essential for assessing the importance and the validity of the null hypothesis.

Computation of these values usually depends upon the ༺number of data records available☂ in the sample set.

Important

A large t-score, or t-value, indicates that the groups are different while a small t-score♋ indicates t🌊hat the groups are similar.

Types of T-Tests

Paired Sample T-Test Formula

The paired t-test, or correlated t-test, is a dependent type of test and is performed when the samples consist of matched pairs of similar units, or when there are cases of repeatꦑed measure🐠s.

For example, there may be instances where the same patients are repeatedly teste꧅d before and after receiving a particular treatment. Each patient is being used as a control sample against🐎 themselves.

This method also applies to cases where the samples are related or have matching characteristics, like a comparative analysis involving ꦑchildren, parents, or siblings.

The formula to compuౠte the t-value and degrees of freedom for a paired t-test is:

T = mean 1 mean 2 s ( diff ) ( n ) where: mean 1  and  mean 2 = The average values of each of the sample sets s ( diff ) = The standard deviation of the differences of the paired data values n = The sample size (the number of paired differences) n 1 = The degrees of freedom \begin{aligned}&T=\frac{\textit{mean}1 - \textit{mean}2}{\frac{s(\text{diff})}{\sqrt{(n)}}}\\&\textbf{where:}\\&\textit{mean}1\text{ and }\textit{mean}2=\text{The average values of each of the sample sets}\\&s(\text{diff})=\text{The standard deviation of the differences of the paired data values}\\&n=\text{The sample size (the number of paired differences)}\\&n-1=\text{The degrees of freedom}\end{aligned} T=(n)s(diff)mean1mean2where:mean1 and mean2=The average values of each of&nb♏sp;the sample setss(diff)=The stanജdard deviation of the differences 🥃of the paired data valuesn=The&n🌺bsp;🐟sample size (the number of paired differences)n1=The degrees of freedom

Equal Variance or Pooled T-Test Formula

The equal variance t-test is an independent t-test and is used when the number of samples in each 🦹group is the same, or the ജvariance of the two data sets is similar.

The formula to calcul🃏ate t-value and degrees of freedom for equal variance t-test is:

T-value = mean 1 mean 2 ( n 1 1 ) × var 12 + ( n 2 1 ) × var 22 n 1 + n 2 2 × 1 n 1 + 1 n 2 where: mean 1  and  mean 2 = Average values of each of the sample sets var 1  and  var 2 = Variance of each of the sample sets n 1  and  n 2 = Number of records in each sample set \begin{aligned}&\text{T-value}=\frac{\textit{mean}1-\textit{mean}2}{\sqrt{\frac{(n1-1)\times\textit{var}1^2+(n2-1)\times\textit{var}2^2}{n1+n2-2}\times\frac{1}{n1}+\frac{1}{n2}}}\\&\textbf{where:}\\&\textit{mean}1 \text{ and } \textit{mean}2=\text{Average values of each}\\&\text{of the sample sets}\\&\textit{var}1\text{ and }\textit{var}2=\text{Variance of each of the sample sets}\\&n1\text{ and }n2=\text{Number of records in each sample set}\end{aligned} T-value=n1+n22(n11)×var12+(n21)×var22×n11+n21mean1mean2where:mean1 and mean2=Average values of eachof the sample setsvar1 and var2=Variance🌞 of each of&nbs🐠p;the sample setsn1 and n2=♒Number of records in each&n♍bsp;sample set

and,

Degrees of Freedom = n 1 + n 2 2 where: n 1  and  n 2 = Number of records in each sample set \begin{aligned} &\text{Degrees of Freedom} = n1 + n2 - 2 \\ &\textbf{where:}\\ &n1 \text{ and } n2 = \text{Number of records in each sample set} \\ \end{aligned} Degrees of Freedom=n1+n22where:n1 and n2=Number of records in&🦩𒅌nbsp;each sample set

Unequal Variance T-Test Formula

The unequal 澳洲幸运5官方开奖结果体彩网:variance t-test is an independent t-test and is used when▨ the number of samples in each group is different, and the variance of the two data sets is also different. This test ♔is also called Welch's t-test.

The formula to calculate t-value and de🃏grees of freedom for an uneq🔥ual variance t-test is:

T-value = m e a n 1 m e a n 2 ( v a r 1 n 1 + v a r 2 n 2 ) where: m e a n 1  and  m e a n 2 = Average values of each of the sample sets v a r 1  and  v a r 2 = Variance of each of the sample sets n 1  and  n 2 = Number of records in each sample set \begin{aligned}&\text{T-value}=\frac{mean1-mean2}{\sqrt{\bigg(\frac{var1}{n1}{+\frac{var2}{n2}\bigg)}}}\\&\textbf{where:}\\&mean1 \text{ and } mean2 = \text{Average values of each} \\&\text{of the sample sets} \\&var1 \text{ and } var2 = \text{Variance of each of the sample sets} \\&n1 \text{ and } n2 = \text{Number of records in each sample set} \end{aligned} T-value=(n1var1+n2var2)mean1mean2where:mean1 and mean2=Average values of eachof the sample setsvar1 and var2=Variance of🐎 each&ꦬnbsp;of the sample setsn1 and n2=Num♔ber&nb🙈sp;of records in each sample set

and,

Degrees of Freedom = ( v a r 12 n 1 + v a r 22 n 2 ) 2 ( v a r 12 n 1 ) 2 n 1 1 + ( v a r 22 n 2 ) 2 n 2 1 where: v a r 1  and  v a r 2 = Variance of each of the sample sets n 1  and  n 2 = Number of records in each sample set \begin{aligned} &\text{Degrees of Freedom} = \frac{ \left ( \frac{ var1^2 }{ n1 } + \frac{ var2^2 }{ n2 } \right )^2 }{ \frac{ \left ( \frac{ var1^2 }{ n1 } \right )^2 }{ n1 - 1 } + \frac{ \left ( \frac{ var2^2 }{ n2 } \right )^2 }{ n2 - 1}} \\ &\textbf{where:}\\ &var1 \text{ and } var2 = \text{Variance of each of the sample sets} \\ &n1 \text{ and } n2 = \text{Number of records in each sample set} \\ \end{aligned} Degrees of Freedom=n11(n1var12)2+n21(n2var22)2(n1var12+n2var22)2where:var1 and var2=Variance of each of th♑e sampl๊e setsn1 and n2=Number of records in ♍;each sample set

Which T-Test to Use

The following flowchart can determine which t-test to use based on the characteristics🥂 of the samꦚple sets. The key items to consider include:

  • The similarity of the sample records
  • The number of data records in each sample set
  • The variance of each sample set
T-test

Image by Julie Bang © Investopedia 2019

Example of an Unequal Variance T-Test

Assume that the diagonal measurement of paintings received in an art gallery is taken. One group of samples includes 10 paintings, while the other includes 20 paintings. The data sets, with the corresponding mean and variance values, are as follows:

  Set 1 Set 2
  19.7 28.3
  20.4 26.7
  19.6 20.1
  17.8 23.3
  18.5 25.2
  18.9 22.1
  18.3 17.7
  18.9 27.6
  19.5 20.6
  21.95 13.7
    23.2
    17.5
    20.6
    18
    23.9
    21.6
    24.3
    20.4
    23.9
    13.3
Mean 19.4 21.6
Variance 1.4 17.1

Is the difference from 19.4 to 21.6 due to 𓆏chance alone, or do difference꧅s exist in the overall populations of all the paintings received in the art gallery?

We establish the problem by assuming the null hypothesis that the mean is the same between the two sample sets and conduct a t-test to test if the hypothesiꩵs is plaus🐼ible.

Since the number of data records is different (n1 = 10 and n2 = 20) and the variance is also differentཧ, the t-value and degrees of freedom are computed for the above data set using the formula for the Unequal Variance T-Test.

The t-value is -2.24787. Since the minus sign can be ignored when comparing the 𓆏two t-values, the computed value is 2.24787.

The degrees of 🔜freedom value is 24.38 and is reduced to 24 (the formula definition requires rounding down the value to the least possible integer value).

One can specify a level of probability (alpha level, level of significance, p) as a cr☂iterion for acceptance. In most cases, a 5% value ca꧒n be assumed.

Using the degree of freedom value as 24 and a 5% level of signific༺ance, a look at the t-value distribution table ꦍgives a value of 2.064.

Comparing this value against the computed value of 2.247 indicates that the calculated t-value is🍰 greater than the table value at a significance level of 5%. 

Theওrefore, it is sa𒈔fe to reject the null hypothesis that there is no difference between means.

Rejecting the null hypothesis means the population set has intri🎃nsic di൲fferences, and they are not by chance.

How Is the T-Distribution Table Used?

The T-Distribution Table is available in one-tail and 澳洲幸运5官方开奖结果体彩网:two-tails formats. The one-tail format is used for assessing cases that have a fixed value or range with a clear direction, either positive or negative. For instance, what is the probability of the output value remaining below -3, or getting more than seven when rolling a pair of dice? The two-tails format is used for range-bound analysis, such as asking if the coordinates fall between -2 and +2.

What Is an Independent T-Test?

The samples of independent t-tests are selected independent of each other where the data sets in the two groups don’t refer to the same values. They may i꧒nclude a group of♏ 100 randomly unrelated patients split into two groups of 50 patients each. One of the groups becomes the control group and is administered a placebo, while the other group receives a prescribed treatment. This constitutes two independent sample groups that are unpaired and unrelated to each other.

What Does a T-Test Explain and How Is It Used?

A t-test is a statistical test that is used to c🤡ompare the means of two groups. It is often used in hypothesis testing to determine whether a process or treatment has an effect on the population of interest, or whether two groups are different from one another.

The Bottom Line

A t-test is used to determine if there is a statistically significant difference between the means of two population samples.꧙ It is used in statistics for hypothesis testing and can indicate whether differences between two populations are meaning꧑ful or random.

The t-test calculation uses three data: the difference between the mean values from each 💃data set, the standard deviation of each group, an🤡d the number of data values.

There are different variations of the t-test formula. Which one to use depends on different factors. However, each variation is used to investigate the same statisꦇtical question.

Compare Accounts
The offers that appear in this table are from partnerships from which Investopedia receives compensation. This compensation may impact how and where listings appear. Investopedia does not include all offers available in the marketplace.
Take the Next Step to Invest
The offers that appear in this table are from partnerships from which Investopedia receives compensation. This compensation may impact how and where listings appear. Investopedia does not include all offers available in the marketplace.

Related Articles