This paper gives an introduction to some basic but critically important concepts of statistics and data analysis for the SAS programmer who pulls or manipulates data, but who might not understand what goes into a proper data analysis. We first introduce some basic ideas of descriptive statistics for one-variable data and then expand those ideas into many variables. We then introduce the idea of statistical significance and then conclude with how all these ideas can be used to answer questions about the data. Examples and SAS® code are provided.
Analyzing rare events like disease incidents, natural disasters, or component failures requires specialized statistical techniques since common methods like linear regression (PROC REG) are inappropriate. In this paper, we’ll first explain what it means to use a statistical model, then explain why the most common one (linear regression) is inappropriate for rare events. Then we’ll introduce the most basic statistical model for rare events: Poisson regression (using PROC GENMOD or PROC COUNTREG).