Statistics

  1. Descriptive Statistics
    • cosists of metods for organizing and summarizing information
    • ex. construction of graphs, charts, tables...
  2. population
    The collection of all individuals or items under cosideration in a statistical study
  3. Sample
    That part of the population from which inforamtion is obtained
  4. Inferential Statistics
    consists of methods for drawing and measuring the reliability of conclusions about a population based on information obtained from a sample of the population
  5. Classifying Statistical Studies (descr or statis)?
    • The study is descriptive - If the purpose of the study is to examine and explore information for its own intrinsic interest only. ( can be performed on both sample and pupulation)
    • The study is inferential - if information is obtained from a sample of a population and the purpose of the study is to use that information to draw conclusions about the population.
  6. Observational Study
    observing of characteristics and take measurements, as ina sample survey.Can reveal only association.
  7. Designed Experiments
    researchers impose treatments and controls and then observe characteristics and take measurements. Can help establish causation
  8. Census
    Obtaining information for the entire pupulation of interest. However, it is time consuming and might be impossible.
  9. The two methods for obtaining infromation other than census
    Sampling and experimentation
  10. Simple Random Sampling(sample)
    • Sim. Ran. Sampling - A sampling procedure for which each possible sample of a given size is equally likely to be the one obtained.
    • S.. R. Sample - A sample obtained by simple random sampling
    • There are 2 types of Simple Random Sampling
    • 1. Simple random sampling with replacement - a member of pup. can be selected more, than once
    • 2. ----without the replacement - can be selected at most once.
  11. Systemic Random Samping
    • Step 1: Divide the populati1on size by the sample size and round the result down to the nearest whole number, m.
    • Step 2: Use a random - number table (or a similar devise) to obtain a number, k, between 1-m.
    • Step 3: Select for the sample those members of the population that are numbered, k, k+m, k+2m, k+3m..
  12. Cluster Sampling
    • useful when the members of the population are widely scattered geographically.
    • Step1: Divide the population into groups (clusters)
    • Step 2: Obtain a simple random sample of the clusters
    • Step 3: Use all the members of the clusters obtained in step 2 as the sample.
  13. Stratified Sampling
    • more reliable than cluster sampling. First divided into subpopulations, called strata. Then sampling is done from each sratum. Uses proportions called stratifeid random Sampling with proportional allocation
    • Step 1: Divide the population into subpopulations (srata)
    • Step 2: From each sratum, obtain a simple random sample of size proportional to the size of the stratum; that is, the sample size for a stratum equals the total sample size times the stratum size divided by the populations size.
    • Step 3: Use all the members obtained in Step 2 as the sample.
  14. Principles of Experimental Design
    • -Experimental Units or subjects - those on whom performed the experiment. IF on humans -experimental units, if non humans - subjects.
    • -Control - who is given the placebo
    • -treatment group - who is given the treatment
    • - Randomization - The exper. units shold be randomly divided into groups to avoid unintentional selection bias in constituting the groups
    • - Replication - A sufficient number of experimental units should be used to ensure that randomization creates groups that resemble each other closely and to increase the chances of detecting any differences among the treatments.
    • Treatment - experimental condition
  15. Terminology of Experimental Design
    • response variable - experimental outcome that is to be measured or observed.
    • Factor - types of variables whose effect on the response variable in the experiment (ex. Ph and irrigation regime)
    • Levels - The possible values of a factor ( ex. with p4, no P4, light, none, med...)
    • Treatment - Each experimental condition. For one-factor experiments, the treatments are the levels of the single factor. (ex. 10 treatments, no p4/none, no p4/light, with p4/none....)
  16. Completely Randomized Design
    all the experimental units are assigned randomly amon all the treatments.
  17. Randomized Block Design
    experiemtanl units that are similar in ways that are expected to affect the response variable are grouped in blocks. Then the random assignment of experimetanl units to the treatments is made block by block. The experimental units are assigned randomly among all the treatments separately withing each block.
  18. Variable
    A characteristic that varies from one person or thing to another
  19. Qualitative variables
    A nonnumerically vallued variable
  20. Quantitative variable
    • A numerical valued variable
    • Two types:
    • 1. Discrete
    • 2.Continuous
  21. Discrete variable (prerivistiy)
    a quantitative variable whose possible values can be listed (ex. # of siblings,)
  22. Continuous variable
    A quantitative variable whose possible values form some einterval of numbers. A measurement of something, height of a person, weight of a newborn, lengh of time a battery lasts.
  23. Data
    values of variable (info collected, organized, and analyzed by statisticians is data)
  24. Observation
    each individual piece of data
  25. Data set
    collection of all observations fora paritcular variable.
  26. Classes
    • categories of grouping methods.
    • ex. group data by 10s, 1st class 30<- 40 from 30 days, but not including 40 days.
    • <- simble means "up to, but not including"
    • number of classes should be between 5-20
  27. Frequency
    The number of observations that fall in a particular class. Ex. the frequency of the class 50<-60 is 8.
  28. Frequency distribution
    a listing of all classes and their frequencies.
  29. Relative frequency
    • the ratio of the frequency of a class to the total number of observations. To find the pecentage, frequency of a class devide by total number of observations and then multiply the result by 100.
    • 8/40=0.20 or 20% . The relative frequencies should add up to 1 (100%)
  30. Relative-frequency distribution
    a list of all classes and their relative frequencies.
  31. Lower cutpoint
    the smallest value that could go into a class
  32. Upper cutpoint
    the smallest value that could go in the next higher class(or the lower cutpoint of the next higher class)
  33. Midpoint
    the middle of a class, found by averaging its cutpoints ex. 50<-60 (50+60)/2=55
  34. Width
    The difference between the cutpoints of a class. 60-50=10
  35. Grouped-data table
    A table that provides the classes, frequencies, relative frequencies and mid-points of a data set.
  36. How to make curtain width to make particular amount of classes
    • Make 8 classes
    • 1. Find max value (45)
    • 2. Min value ( 155)
    • 3. Range between Max and min =Max-Min (155-45) =110
    • 4. Width - 110/8=13.75~14 is the width between lower and upper cutpoints 45<-59
  37. Frequency Histogram
    A graph that displays the classes on the horizonal axis and the frequencies of the classes on the vertical axis. The frequency of each class is represented by a vertical bar whose height is qual to the frequency of the class.
  38. Relative-frequency histogram
    A graph that displays the classes on the horizontal axis and the relative-frequencies of the classes on the vertical axis. The relative frequency of each class is represented by a vertical bar whose height is equal to the relative frequency of the class.
Author
khonka
ID
36972
Card Set
Statistics
Description
Chapter 1-2.3
Updated