SASFrequency DistributionSAS frequency distribution is the most commonly used statistical procedure in SAS programming. Descriptive statistics such as mean and standard deviation can be used for continuous variables to summarize the data, but in the case of Categorical variables, Descriptive statistic is not appropriate. We can summarize Categorical variables by using a frequency table, but Before proceeding, let's first understand the categorical variable. Categorical Variable:Purely Categorical variables are those that allow us to assign categories, but not let to arrange them in a particular sequence. In other words, a Categorical variable is one which has two or more categories, but there is no sequence of these categories. For example, gender is a Categorical variable, which holds two categories i.e., male and female, but there is no sequence of both categories. Another example is hair color, which is also considered as a Categorical variable as it holds several categories, i.e., black, brown, brunette, red, etc. again, there is no sequence in which we can arrange it. Frequency Table:Frequency table displays the number and percentage of cases observed for each category of a variable. Each entry in the frequency table contains the count of the occurrences of values in a particular interval. Now, let's discover how we can implement frequency distribution operations in SAS programming language. PROC FREQWe can calculate the frequency distribution of observations of a variable by using the PROC FREQ procedure. Syntax: Where,
Use of SAS PROC FREQThe primary purpose of PROC FREQ is that we can calculate the frequency of any userdefined or already available or SAS inbuilt help library data set. In order to calculate the frequency distribution, we only need to specify the name of the dataset. Example Execute the following code in SAS Studio: Output: Single Variable Frequency DistributionFrequency distribution of a single variable can be determined by using the PROC FREQ procedure. After implementing the PROC FREQ procedure, the result will show the frequency of each contained observation of the variable. It will also show the percentage distribution, cumulative frequency, and cumulative percentage. Example:In this example, we are going to calculate the frequency distribution of variable "horsepower" for the dataset CARS1, which is created from already available data set CARS of SAS help Library. Execute the following code in SAS Studio: Output: Multiple Variable Frequency DistributionFrequency distribution of multiple variables can be determined by using the PROC FREQ procedure. After implementing the PROC FREQ procedure, the result will show the frequency of each contained observation of all variables. It will also show the percentage distribution, cumulative frequency, and cumulative percentage for all variables. Let's understand the implementation of this process through an example. Example:In this example, we are going to calculate the frequency distribution of two variables, the first variable is make, and another is type. Both variables belong to the dataset CARS1, which is created from already available data set CARS of SAS help Library. Execute the following code in SAS Studio: Output: Frequency Distribution with WeightIn the frequency distribution, Weight option is used to calculate the biased frequency distribution with the weight of the variable. ExampleIn the below example, we are going to calculate the frequency distribution of the variable make and type with the weight assigned in horsepower. Let's understand through the code: When we execute the above code in SAS Studio, we will get the following output:
Next TopicSASStandard deviation
