What is SAS ?

What is SAS ?


18 August 2020

What is SAS? 

  • Originally an acronym for Statistical Analysis System 
  • Provided by SAS institute since the 1970s 
  • A software used for statistical analysis, graphing, and presenting data

SAS is the leading business analytics software used in a variety of business domains such as Insurance, HealthCare, Pharmacy, Telecom etc. Very often, business analysts and other professionals with little or no programming experience are required to learn SAS. This paper will explain the key concepts and the minimum knowledge required to get started with SAS. The information presented in this paper will enable new SAS users to get up and running in a very short amount of time.

What can we do with SAS?
 
SAS is an integrated software suite for advanced analytics, business intelligence, data management, and predictive analytics. You can use SAS software through both a graphical interface and the SAS programming language, or Base SAS.
With SAS software, you can complete these tasks:
 
access data in almost any format, including SAS tables, Microsoft Excel tables, and database files.
manage and manipulate your existing data to get the data that you need. For example, you can subset your data, combine it with other data, and create new columns.
analyze your data using statistical techniques ranging from descriptive measures like correlations to logistic regression and mixed models to sophisticated methods such as modern model selection and Bayesian hierarchical models.
present the results of your analyses in a meaningful report that you can share with others. The reports that you create can be saved in a wide variety of formats, including HTML, PDF, and RTF.
 
 
 
1. SAS AND ITS RULES
1.1 WHAT IS SAS? SAS stands for Statistical Analysis System.
Useful for the following types of task:
  •  Data entry, retrieval, and management
  •  Report writing and graphics
  •  Statistical and mathematical analysis
1.2 UNDERSTANDING TERMS IN A SAS DATA STRUCTURE
  •  DATA VALUE: The basic unit of information.
  •  VARIABLE: A set of data values that describes a given attribute. 2 main types of variable types: numeric and character
  •  OBSERVATION: All the data values associated with a particular record.
  •  DATA SET: It is a collection of observation. 
  1. Figure: 

1.3 RULES FOR SAS NAMES

  •  SAS variable/variable names must be between 1 and 32 characters long.
  •  The first character must be a letter or an underscore.
  •  Characters after the first may be letters, digits or underscores. Example: NAME, _NAME_, FILE1, _NULL_ etc. 1.4 RULES FOR SAS STATEMENTS
  •  SAS statements may begin in any column of the line.
  •  SAS statements end with a semicolon (;)
  •  Some SAS statements consist of more than one line of commands.
  •  A SAS statement may continue over more than one line. One or more blanks should be placed between items in SAS statements. If the items are special characters such as '=', '+', '$', the blanks are not necessary.

There are two major building blocks in SAS.

DATA step

SAS built in procedures 2. DATA STEP

Used for: Names the SAS data set and creates the dataset. In a Data step the following can be performed.

◦ defining the variables ◦ read input files

◦ assign values to the variables,

◦ creating new variables,

◦ merging two data sets

◦ formatting and labeling variables

◦ assignment of missing values.

If a variable is used in a SAS program but not initialized then SAS automatically assign a missing value to it. Numeric missing values are represented by a single period (.). Character missing values are represented by a single blank enclosed in quotes (' ').

Syntax: DATA ; The name should be 1-32 characters and must begin with a letter or underscore. Example: DATA EMPDAT; The following SAS statements can be used in a DATA step.