The National Survey on Drug Use and Health (NSDUH), a product of the Substance Abuse and Mental Health Services Administration (SAMHSA) under the U.S. Department of Health and Human Services, measures the use of illegal substances, the use and misuse of prescribed substances, substance use disorder and treatment, and mental health outcomes (Substance Abuse and Mental Health Services Administration 2022).
Downloadable data and documentation are freely available from SAMHSA (U.S. Department of Health and Human Services, Substance Abuse and Mental Health Services Administration, Center for Behavioral Health Statistics and Quality 2019) for research and statistical purposes. Documentation for the 2019 data can be found in the 2019 NSDUH Public Use File Codebook. See also Policies.
SAMHSA bears no responsibility for use of the data or for interpretations or inferences based upon such uses. Any analyses, interpretations, or conclusions reached herein are are only for the purpose of illustrating regression methods and are credited to the author, not to SAMHSA. The author makes no claim or implication that any inferences derived from these teaching datasets are valid estimates.
The teaching dataset
nsduh2019_adult_sub_rmph.RData includes a random subset of 1000 observations of adults, and variables that have been renamed for clarity. Sampling was done with replacement using sampling weights in order to approximate a nationally representative distribution. This sampling method is solely for the purpose of creating a teaching dataset to illustrate regression methods. Chapter 8 discusses analyzing data using the survey weights appropriately using the full dataset (
nsduh2019_rmph.RData). Chapter 9 uses the dataset
nsduh_mar_rmph.RData, derived from
nsduh2019_adult_sub_rmph.RData with some cases removed and some data values randomly set to missing, in an illustration of multiple imputation.
Creating the Teaching Datasets
To create the teaching datasets, do the following.
- Download the .zip file containing the 2019 R dataset found at 2019 Population Data.
- Extract the .RData file
NSDUH_2019.RDatafrom the .zip file.
- Download the R script files
NSDUH_2019 MI Simulation.Rfrom RMPH Resources.
- Run the R script file
NSDUH_2019 Process.Rto process the raw data and create the following teaching datasets:
- Place these
.Rdatafiles in your “Data” folder.
- Run the R script file
NSDUH_2019 MI Simulation.Rto process the raw data and create the following teaching datasets:
- Place this
.Rdatafile in your “Data” folder.
Rows and columns
These files have the following numbers of rows and columns:
##  56136 57
##  1000 54
##  843 5