澳洲幸运5开奖号码历史查询

How Stratified Random Sampling Works, With Examples

Definition

Stratified random sampling is the process of creating subgro♏ups in a dataset according to various factors such as age, gender, income level, or education.

What Is Stratified Random Sampling?

Stratified random sampling involves the division of a populatꩲion into smaller subgroups known as strata. The strata are formed based on members’ shared attributes or characteristics in stratified random sampling or stratificat🤪ion, such as income or educational attainment.

Stratified random sampling has numerous applications and benefits including the study population demographics and 澳洲幸运5开奖号码历史查询:life expectancy. It's also 🐎referred to as proportional random sampling or quota random 🐲sampling.

Key Takeaways

  • Sampling involves statistical inference that's made using a subset of a population.
  • A population is divided into groups that share characteristics called strata for stratified random sampling.
  • Proportional stratified random sampling involves taking random samples from stratified groups in proportion to the population.
  • The strata aren't proportional to their occurrence in the population in disproportionate sampling.
  • Stratified random sampling differs from simple random sampling which involves the random selection of data from an entire population.
Stratified Random Sampling Definition

Investopedia / Xiaojie Liu

How Stratified Random Sampling Works

A researcher might find that the population size is too large to complete research on it when beginning an analysis of a group of entities with similar characteristics. They may select a small group from the overall population to save time and moneওy and to make the research more feasible💙. This is known as sampling.

The small group is referred to as a 澳洲幸运5开奖号码历史查询:sample size which is a subset of the popul🐲ation that's used to represent the en♛tire population. A sample might be selected from a population in many ways. One is the stratified random sampling method.

Stratified random sampling involves dividing the entire population into homogeneous groups called strata, the plural of stratum. Random samples are then selected from each stratum to analyze the different experiences or outcomes associated with each of the demographic groups.

Important

Stratified sampling is used to highlight differences among groups in a population. This is different from simple random sampling which treats all members of a population as equal with an equal likelihood of being sam🥀pled.

Consider an academic researcher who would like to know the number of MBA students in a specific graduating year who received a job offer within three months of graduation. The researcher would find that there were almost 200,000 MBA graduates that year. They could take a simple ran🌌dom sample of 50,000 graduates and run a survey. They could divide the population into strata and take a random sample from the strata, however, to learn more.

They would create population groups based on gender, age range, race, country of nationality, and career background. A random sample from each stratum would be taken in a number proportional to the stratum’s size compared with the population. These subsets of the strata would then be pooled to form a random sample which the researcher would analyze for differences in groups that receive job offers after graduation.

Simple vs. Stratified Random Samples

Simple random samples and stratified random samples are both statistical measurement tools. A simple random sample is used to represent the entire data population. A stratified random sample divides the population into smaller groups or strata based on shared characteristics. Stratified sampling is more complicated, time-consuming, and potentially more expensive to carry out than simplified random sampling.

The simple random sample is often used when:

  • There's little information available about the data population.
  • The data population has too many differences to divide into clean subsets.
  • There's only one distinct characteristic among the data population.

A candy company may want to study the buying habits of ಞits customers to determine the future of its product line. The company might choose 100 customers as a random sample if there are 10,000 of them. It can then apply what it finds from those 100 customers to the rest of its base.

Those 100 customers would be divided into strata based on age, income, or other characteristics with stratified random sampling. There won't be many in each stratum with only 100 people in the sample, however, or there might not be many differences between the strata. It would make more sense to use simple random sampling in this case and sample 100 members purely at random without defining their individual characteristics.

Proportionate vs. Dispropor𒅌tionate Stratification

Stratified random sampling ensures that each subgroup of a given population is adequately represented within the whole sample pop๊ulation of a research study. Stratification can be proportionate or disproportionate.

The sample size of each stratum is proportionate to the population size of the stratum with proportionate stratification. This type of stratified random sampling is often a more precise metric because it’s a b❀etter representat🍸ion of the overall population.

Suppose a researcher is looking at a population of 180,000 people and wants to use a sample of 50,000 stratified by using age range. The researcher would use the formula:

Proportionate stratified random sample = (Sample size / Population size) × Stratum size
Age group 24–28 29–33 34–37 Total
Number of people in stratum 90,000 60,000 30,000 180,000
Strata sample size 25,000 16,667 8,333 50,000

The strat🍌a 🌌sample size in the age range of 24 to 28 years old is calculated as:

(50,000/180,000) × 90,000 = 25,000

The same m🦹ethod is used for the other age-range groups. The researcher can perform simple random sampling in each stratum to select their survey participants now that the strata sample size is known: 25,000 people ages 24 to 28 will be selected randomly from the entire population, 16,667 people ages 29 to 33, and 8,333 people ages 34 to 37.

The size of each stratum isn't proportional to its size in the population in a disproportional stratified sample. The researcher might decide to sample half the graduates within the 34 to 37 age group and one-third of the graduates within the 29 to 33 age group.

One person can't fit into multiple strata. Each entity can only be included in one stratum. Having overlapping subgroups means that some individuals will have higher chances of being selected for the survey and this negates the concept of stratified sampling as a type of probability sampling.

Fast Fact

Portfo♋lio managers can use stratified random samplingཧ to create portfolios by replicating an index such as a bond index.

Advantages of Stratified Random Sampling

The main advantage of stratified random sampling is that it captures key population characteristics. This method of sampling produces characteristics in the sample that are proportional to the overall population similar to a weighted average. Stratified random sampling works well for populations with a variety of attributes in which🎉 subgroups can be formed.

Stratification gives a smaller 澳洲幸运5开奖号码历史查询:error in estimation and greater precision than the s𓄧imple random sampling method. The greater the differences among the strata, the greater the gain🦂 in precision.

Disadvantages of Stratified Random Sampling

Unfortunately, this method of research can't be used in every study. Researchers must be able to identify every member of a population being studied to use it. They must classify each of them into one and only one subpopulation. Stratified random sampling can't be used if researchers can’t confidently classify every member of the population into a subgroup. This can be especially difficult if a definitive list of an entire 澳洲幸运5开奖号码历史查询:population isn't available.

Overlapping can be an issue if there are subjects that fall into multiple subgroups. Those who are in multiple subgroups are more likely to be chosen when simple random sampling is ꦛ𒈔performed. The result could be a misrepresentation or inaccurate reflection of the population. Stratified random sampling becomes ineffective if the sorting process is too difficult.

Example of Stratified Random Sampling

Supp🐲ose a research team wants to determine the grade point average (GPA) of college students across the United States. The team has difficulty collecting data from all 21 million college students and de𝓀cides to take a random sample of the population by using 4,000 students.

The team looks at the different attributes of the sample𝄹 participants and wonders if there are any differences in GPAs relative to the students’ majors. Suppose it finds that 560 students are English majors, 1,135 are science majors, 800 are computer science majors, 1,090 are engineering majors, and 415 are math majors. The team wants to use a proportional stratified random sample w🔴here the stratum of the sample is proportional to the random sample in the population.

Now assume that the team researches the 澳洲幸运5开奖号码历史查询:demographics of college students in the U.S. and finds that 12% major in English, 28% major in science, 24% major in computer science, 21% major in engineering, and 15% major in mathematics. Fi🌺ve strata are created from the stratified random sampling process.

The team must then confirm that the stratum of the population is in proportion to the stratum in the sample. They find that the proportions aren't equal, however. The team would have to resample 4,000 students from the population and randomly select 480 English, 1,120 science, 960 computer science, 840 engineering, and 600 mathematics students.

The research team has a proportionate stratified random sample of college students with these groups. It provides a better representation of students’ college majors in the U.S. The researchers can then highlight specific strata and investigate GPA with added information about the students' majors.

When Would I Use Stratified Random Sampling?

Stratified random samplingꦗ is often used when researchers want to know about different subgroups or strata based on the entire population being studied. Researc💮hers might want to explore outcomes for groups based on differences in race, gender, or education.

Which Sampling Method Is Best?

The best method of sampling will depend on the nature of the analysis and the data being used. Simple random sampling is generally the easiest and cheapest to use but stratified sampling can produce a more accurate sample relative to the population that's under study.

What Are the Two Types of Stratified Random Sampling?

There are two main types of stratified random samꦏpling: proportion𝓰ate and disproportionate. Proportionate sampling takes each stratum as proportionate to the population size of the stratum. The analyst will over- or under-sample certain strata based on the research question or study design being employed in disproportionate sampling.

How Are Strata Chosen for Stratified Random Sampling?

The strata will depend on the subgroups in which you're interested that appear in your population. These subgroups are based on shared characteristics among participants such as gender, race, educational attainment, geographic location, or age group.

The Bottom Line

Stratified random sampling is the process of creating subgroups in a dataset according to various factors such as age, gender, income level, or education. A random sample is then taken from each of the strata allowing researchers to obtꦺain samples from various subgroups including those that may be under-represented.

A stratified random sample may provide a more comprehensive picture of a broader dataset in this way. Using this method may not be possible across all studies, however, depending on the population or sample size, the⛄ level of information available about the population, and the time and resources available. The benefit of stratified random sampling is that it allows for a more accurate and nuanced representation of a population overall compared with a simple sampling method.

Article Sources
Investopedia requires writers to use primary sources to support their work. These include white papers, government data, original reporting, and interviews with industry experts. We also reference original research from other reputable publishers where appropriate. You can learn more about the standards we follow in producing accurate, unbiased content in our editorial policy.
  1. Researcher.Life. "."

  2. Yale University Department of Statistics and Data Science. "."

  3. Entropik Technologies. "."

Open a New Bank Account
The offers that appear in this table are from partnerships from which Investopedia receives compensation. This compensation may impact how and where listings appear. Investopedia does not include all offers available in the marketplace.

Related Articles