Big Data: week 5

  1. What function is helpful for creating categorical predictors?
    • get_dummies()
    • from pandas
  2. What is the idea for cross-validation?
    Instead of splitting it once, we could split the data multiple ways and average the results!
  3. What is a common way to use cross validation?
    • Common forms for cross validation are 5-fold or 10-fold to assess a predictive model
    • Image Upload 2
  4. How can you plot categorical variables with different slopes?
    • create an interaction!
    • Image Upload 4
  5. If your categorical variable has more than two categories, how do you define the additional variables?
    Image Upload 6
  6. Main effect
    • The main effect for two predictors fits the best plane through the data
    • Image Upload 8
  7. How do you include categorical variables in a multiple regression model?
    can also include categorical variables through dummy or indicator variables
  8. Interaction effects
    Interaction effects allow for the effect of one variable to depend on the value of another
Author
saucyocelot
ID
361297
Card Set
Big Data: week 5
Description
Week 5: modeling
Updated