Material Detail

Data Management, EDA, and Regression Analysis with 1969-2000 Major League Baseball Attendance

Data Management, EDA, and Regression Analysis with 1969-2000 Major League Baseball Attendance

This article, created by James J. Cochran of Louisiana Tech University, describes a dataset containing Major League Baseball data from seasons 1969 through 2000 and illustrates how this data can be used as a course long project covering basic data management, the use of exploratory data analysis to "clean" data, and construction of regression models. The set contains data such as: runs scored, runs allowed, wins, losses, number of games behind the division leader and attendance. This is a great lesson for anyone interested in the statistics of baseball. The data is in .dat format.
Rate

Quality

  • User Rating
  • Comments
  • Learning Exercises
  • Bookmark Collections
  • Course ePortfolios
  • Accessibility Info

More about this material

Browse...

Disciplines with similar materials as Data Management, EDA, and Regression Analysis with 1969-2000 Major League Baseball Attendance

Comments

Log in to participate in the discussions or sign up if you are not already a MERLOT member.