PSY 364

  1. Statistics
    Branch of mathematics that focuses on the organization, analysis, and interpretation of a group of numbers.

    • *how to prove a point
    • *numbers to use to advance a cause
    • *data are not theory neutral
    • *numbers don't mean anything out of a particular context
    • *Designed to advance a particular cause and supported by particular backgrounds
  2. Descriptive Statistics
    Procedures for summarizing a group of scores or otherwise making them more comprehensible

    • *used to summarize and describe data
    • *data is succinct and clear
    • *a way to characterize an overall opinion in one number, ways to get around the mounds of data
    • *Describing what you actually collected
  3. Inferential Statistics
    Procedures for drawing conclusions based on the scores collected in a research study but going beyond them.

    *includes methods for generalizing beyond the actual sample data to infer the properties of population data that you as a researcher did not actually collect

    Example- effects of drug on memory performance

    • * is a step beyond descriptive
    • *consider the assumptions for "generalizability"
    • * what applies to a smaller group can actually apply to a larger group
  4. Variable
    characteristic that can have different values

    Example: Stress level, age, gender, religion
  5. Values
    possible number or category that a score can have
  6. Score
    particular person's value on a variable
  7. Data
    This is a generic term for whatever is being studied, is a pleural term (data are). It could be social groups (Rugby team). It could be events (basketball games), It could be organisms (two-tied tree sloth).

    **a set of measurements that are made from the observations you make or the research you conduct.

    **anything you are interested in and ask questions that you have collected data. But that does not mean you can analyze what you have collected.
  8. Raw Data:
    The original measurements, not things that have been derived.

    • Example-
    • Raw: Number of suicide attempts (reported)

    Derived, transformed: Severity of depression
  9. Sets of Data
    • *Samples
    • *Populations
    • *Parameters
  10. Samples
    measure the most deals with statistics. Subsets of populations

    Part of a population, a set of data from which we draw conclusions about the population of interest.

    *A sample can be larger than a population

    • *Samples are often more convenient and practical to use than populations are
    • *Limited Time
    • *Limited resources
    • *Limited accessibility to subjects
  11. Populations
    the group that we are interested in; can be any size 5 to an entire country. Not size but interest. Will not generalize. The species as a whole.

    "Everybody"- but have to make a distinction of what we are looking for.

    *the complete set of data that we want to draw inferences from or make conclusions about.

    • Examples-
    • **all people between the ages of 12 and 15 who smoke cigarettes.

    **all Drexel freshman from Zimbabwe
  12. Parameters
    Quantitative summary characteristics of populations. Deals with population

    Greek symbols are used to specify parameters
  13. Mean (Parameters)
    Mean- µ
  14. Standard Deviation (Parameters)
    standard deviation- o
  15. Regression weights (Parameters)
    regression weights- B
  16. Correlation Coefficients (Parameters)
    Correlation Coefficients - p
  17. Mean Differences (Parameters)
    Mean Differences- ∆
  18. Statistics symbols
    these symbols are American (Latin) symbols are used to specify statistics.
  19. Mean (Statistics)
    Mean - M
  20. Standard Deviation (Statistics)
    • Standard Deviation- s
    • One and only one thing
  21. Correlation coefficient (Statistics)
    Correlation coefficient- r
  22. Mean difference (Statistics)
    Mean difference - d
  23. Finite Populations
    sometimes a small set of data is of interest for its own sake

    Example: Drexel freshmen from Zimbabwe

    *here if only 10 exist and all 10 are participating in your study you are working with a finite population

    *NOTE: you would use parameters to summarize the data of this group.
  24. Ways of obtaining Parameters
    • *Census
    • *The Random Sample
  25. Census
    • A case where the entire population is measured via a survey. Measuring everybody in a population like a country, city or state.
    • *can be completed on a large population

    Example: The US Census, The Drexel Men's Basketball Team
  26. The Random Sample
    although it may seem that there is no relation and or connection, doesn't mean that they aren't related. There very well could be a relation to each stimuli.

    *every observation in the population has an equal chance of being includes

    *the choice of any one observation does not change the likelihood of the choice of any other observation.
  27. Random samples are generally .....
    *Not identical to each other

    *Not identical to the population

    However, random samples are more like the population the larger the samples are.
  28. Variable
    any attribute, property, or characteristic of some organism, object, or samples are

    *A variable is not a constant. There should be a possibility of difference.

    *For a variable of interest, not all members of a population or sample will have the same scores or values on that variable.

    Examples- eye color, number of classes attended, score on the first exam, etc.
  29. Categorical variable
    if y (our variable) represents an observation on some category.

    Example- y = mental health status

    y1= depressed, y2 = depressed (diff level of severity), y3 = normal
  30. Numeric
    if z (our variable) is something that we can count or measure.

    Example: z= number of arrests

    z1=4, z2=1, z3= 5
  31. Two kinds of Variables
    *Dependent variable

    *Independent variable
  32. Independent Variable (IV)
    the variable that is controlled or manipulated

    • Examples
    • *the number of cigarettes smoked per day
    • *number of hours studying for exam 1
    • *Gender* (cant change your sex)
    • *Handedness* (can make you switch what hand you use to write but it would be uncomfortable)
  33. Organismic Variable
    Type of variable, this is a characteristic of an organism. Also called demographic variable. Typically used as an independent variable.

    Examples- Gender, height, religion, beliefs about smoking
  34. Dependent Variable (DV)
    the measured variable that is believed to result from manipulation of the independent variable. Something controlled. A consequence of the IV
  35. Examples of IV vs DV
    • IV-number of hours studying ,
    • mental health status, amount of exercise per week

    DV- score on Exam 1, Number of suicide attempts, Average weight loss per week

    **Whatever the dependent variable is depends on what the independent variable is
  36. Discrete Variable
    can be exactly measured by counting. It takes on a finite number of values, usually whole numbers. A mean can involve a decimal (we are concerned with groups as a whole not individuals)


    • *Number correct on first exam- 20
    • *Number of parking tickets- 5
  37. Discrete equals
    whole numbers
  38. Continuous Variable
    takes on an infinity of values within some interval, where each value requires an infinite number of numeric characters to specify.

    Examples- time, weight
  39. Constants-
    - the same value exists for all measured (in the sense of your observations that could have been variables became constants)
  40. Variables
    multiple values exists across measured
  41. Qualitative variables
    • levels differ by category, quality, characteristics (one
    • kind of eye color or two kinds of eye color)
  42. Quantitative variables
    - variables differ by amount or quantity (the amount it took you to react to a certain stimuli).
  43. Discrete vs. Continuous
    *Discrete variables can be accurately measured exactly

    *Continuous variables are refined ad infinitum

    *Materialism and reductionism
  44. Nominal Data
    classification into mutually exclusive categories

    -No logical order is needed, only that the categories differ.(male to female or female to male, there is nothing in between)

    -Numbers may be Used, but only to identify categories.

    -distinguishing things by kind (male or female, blue eyes or brown eyes)

    • NOTE- counting is the only operation you can perform on the data, cant really average these number is these cases. There are “one” more of
    • that category or name.
  45. Ordinal Data
    *Classification using numbers (though not always) where the numbers:

    -represent mutually exclusive quantities

    -have ordering based on the relationships of > and <
  46. Interval data
    numbers represent mutually exclusive quantites that have an ordering and have equal steps along the measured variable.

    In other words, a 1-point difference in any location along the measured variable is the same as a 1-point difference at any other location.

    EXAMples- Fahrenheit or Celsius
  47. RATIO Data
    Numbers represent mutually exclusive quantities that have an ordering, with equal intervals along the measured variable and have the property that a true zero point exists.

    *This zero point indicates the total absence of the measured attribute.

    *Negative numbers do not exist

    Examples- *Temperature in Kelvin, drug dosage, time elapsed
  48. Central Tendency
    The central value toward which scored tend. Trying to describe a distribution distinctly.

    *measures of central tendency provide us with a single summary figure that describes the central location of an entire distribution of observations

    *measures of central tendency help us to simplify the comparison of two or more groups tested under different conditions.

    Most common: Mode, Median, Arithmetic Mean
  49. MODE-
    The most frequent score in the distribution- the score with the highest frequency

    In ungrouped distributions: mode is the score that appears with the greatest frequency

    In grouped distributions: mode is taken as the midpoint of the class interval that contains the greatest number of scores
  50. Properties of the Modes:
    the mode is easy to obtain, but is not very stable from sample to sample.

    *in grouped data, the mode may be strongly affected by the width and the location of the class intervals.

    There may be more than one mode for a set of scores

    With numerical data, the mean or the median is often preferred to the mode
  51. Remember the mode (Mo ) is the only
    measure of central tendency
  52. The Median (Mdn)-
    the middle

    The Median of the distribution is the point along the scale of possible scores below which 50% of the scores fall

    In other words: Median is the value that divides the distribution in two halves
  53. How to find the Mdn
    *Put scores in rank from lowest to highest

    *Make sure to include zero (if it is an actual score)

    *if n (or N) is an odd number, the median will be the score that has an equal number of scores below and above it.

    • * if n is an even number, the median is taken as the point halfway between he two scores that bracket the middle position
    • 12, 14, 15, 18, 19 ,20
  54. Two interpretations of the mean:-
    • “The mean can be viewed as the amount that each
    • person would get if the total amount (not frequency) of the variable being measured were divided up equally” (p.110)
    • *****Income for faculty

    the sum of all deviations around the mean=0

    Use: can be used with any quantitative level of measurment.
  55. Qualitative data
    ways of labeling information (eye color brown eyes vs. blue eyes). Qualities that you have
  56. Quantitative data
    people vary in terms of an amount of something that you could posses
  57. Mode you use for??
    for qualitative
  58. Median you use for ??
    For quantitative

    *only characterizes a distribution by a single score. Does not care about an extreme score. Only interested in the middle number. The middle most x. if your looking at a distribution with extreme scores.
  59. Variability
    a measure of variability is a single summary figure thatdescribes the spread of observations within a distribution (eye color and thereare different types of eye color that occur in our distribution). If everybody has the same eye color than that is a constant.
  60. Measures of variability: What are they?
    *the measures of variability express quantitatively the extent to which the scores in a distribution scatter about or cluster together.

    • *Measures of variability describe the spread of
    • an entire set of scores:

    o They do not specify how far a particular score diverges from the center of a group

    o They do not provide information about the shape of the distribution or the performance of a group.
  61. *Nomothetic approach to research
    - is my measure representative of anything or anyone?

    Concerned with measuring variables
  62. Range
    *difference between the highest and lowest scores

    *Two types: Exclusive and Inclusive
  63. -Exclusive range
    distance between the midpoints of the intervals containing the two most extreme scores (highest score minus the lowest score)
  64. -Inclusive range:
    distance between the upper limit of the highest score and the lower limit of the lowest score.
  65. Properties of the Range
    • 1- the range is ideal for preliminary work or in
    • other circumstances where precision is not an important requirement.

    2- The range is very sensitive to outliers

    3- The range is not sensitive to the total condition of the distribution

    4- The range is of little use beyond the descriptive level

    5- The range depends on sample size: greater sample size means grater range
  66. *Negative feature of the Range
    -highly sensitive to extreme scores (outliers)

    -Sampling fluctuation is extreme

    -Magnitude depends on sample size

    -Virtually useless in advanced statistics
  67. The Variance
    (a kind of mean a typical way in which scores differ/deviate)

    *if deviation scores provide the distance of each raw score from the mean, the mean of the deviation scores might be an attractive measure of variability

    BUT: Remember!

    *The sum of all deviations from the mean equals zero
  68. UBE
    *The unbiased estimate formula for the variance corrects for the tendency of the traditional formula to underestimate the population variance
  69. Properties of the Standard Deviation
    The SD is closely related to the arithmetic mean

    The SD is the most important of the measures of variability

    The SD is responsive to the exact position of every score in the distribution

    • The SD is very sensitive to the presence of a few extreme scores (thus, for skewed
    • distributions it may not be the best..)
Card Set
PSY 364