 Is operational definition testworthy ex. you can use a measurement of your cake-baking skill as an operational definition of you intelligence. clearly this is not so. This definition lacks reliability and validity Face validity whether something looks valid on the surface the three sta tistical elements of testing Really Very NastyR= ReliabilityV= ValidityN= Norms Norms gives you a reference point Validity establishes that our measurement measures what its supposed to measure Reliability establishes that our measurement is stable or consistent (reliable)the correlation betwee two measurements of the same thing1 means perfect reliabilitywe want them to be high .6-.8 Operational definition turns our abstract contruct into something we can measure abstract construct the thing that we are really interested in but we cant measure it directly why are longer test more reliable each person answers questions in a variety of ways. if you only ask one quesrion, you might not get a true picture of that person.each item test what you want it to and plus some other nonsensewhen you average them the nonsense gets averaged out leaving only the construct of interest.  two kinds of reliability intrarater reliability: test the same person more than onceinterrater reliability: test different people Intrarater reliability retest the person on different occasionsproblems: memory effects and practice effectsinternal consistency: this is coceptually different, because we are not retesting in any way Internal consistency unidimensionality (single dimension) only makes sense when all of the items are supposed to be measuring the same thing. split-half reliability aplit the test in hald, and correlate one half with the other halfthis is like treating one test as twoit divides the test in half, which reduces the reliabilityproblems: you could get many different scores, depending on how you split it Cronbach's Alpha correlate each item on the test with the total test scoreaverage themthere is only oneAlpha is the sign you will always see in reliabilty Kuder-Richardson like alpha but only applies to a test where there are right and wrong answers spearman-brown reliability an adjustment formula, to increase the reliability back to what it would be for the full-length test Interrater reliability only applies when we expect people to agree with each othercorrelate the judges with each otherex. raters of videotapesdoesnt apply to paper and pencil (objective) tests, but does apply where tests have subjective elements, such as projective tests