Awesome Open Source
Awesome Open Source

CRAN_Status_Badge status Travis AppVeyor Build Status

wooldridge: 111 Data Sets for Econometrics

Economics students new to both econometrics and R may find the introduction to both challenging. However, if their text is "Introductory Econometrics: A Modern Approach, 6e" by Jeffrey M. Wooldridge, they are in luck!

The wooldridge package aims to lighten the task by easily loading any data set from the text. The package contains full documentation for each set and all data have been compressed to a fraction of their original size. Just install the package, load it, and call the data you wish to work with.

But wait...there's more! A vignette, Introductory Econometrics Examples✨, illustrates solutions to examples from each chapter of the text, offering a relevant introduction to econometric modelling with R. The vignette also includes an Appendix of helpful resources, such as Using R for Introductory Econometrics by Florian Hess.

While the original course companion site provides publicly available data sets for Eviews, Excel, and Stata commercial software, this package is the official R open source option. Using R while building a foundation in econometric modeling, not only saves learners a few units of currency, but also introduces them to software capable of scaling with the demands of modern statistical computing.

Note: All data sets are from the 6th edition (Wooldridge 2016, ISBN-13: 978-1-305-27010-7), which is compatible with all other editions.


One can Install wooldridge directly from The Comprehensive R Archive Network (CRAN), which depends on R >= 3.2.0.



It's always recommended that one read supporting documentation for data sets of interest. This becomes trivial with the wooldridge package:


Documentation includes Wooldridge's original source, data format, variable names and their descriptions, as well as page numbers where each set is referenced in the text. Some sets even contain additional notes suggesting related research projects or exploration.


Load the wooldridge package and use the data() function to bring the desired data set into the working environment. Data set names match those in the text. Once present in the working environment, modelling data is quick and easy, leaving learners with more time to focus on interpretation of results and general diagnostics.



wageModel <- lm(lwage ~ educ + exper + tenure, data = wage1)


Get A Weekly Email With Trending Projects For These Topics
No Spam. Unsubscribe easily at any time.
r (2,383
r-package (181
cran (99
econometrics (24