Vous êtes sur la page 1sur 35

SAS Analytics, Data analysis using SAS

SAS Analytics, Data analysis Duration:60 hours course fee :Rs 30,000/More detail visit www.iisastr.com phone no:9312506496 Course content 1. Introduction to the SAS Language 1.1 Introduction 1.2 Basic Language: Rules and Syntax 1.3 Creating SAS Data Sets 1.4 The INPUT Statement 1.5 SAS Data Step Programming Statements Their Uses 1.6 Data Step Processing 1.7 More on INPUT Statement 1.7.1 Use of Pointer controls 1.7.2 The trailing@ line-hold specifier 1.7.3 The trailing@ @ line-hold specifier 1.7.4 Uses of RETAIN statement 1.7.5 The use of line pointer controls 1.8 Using SAS Procedures using SAS

2. More on SAS Programming and some Applications 2.1 More on the DATA and PROC STEPS 2.1.1 Reading data from files 2.1.2 Combing SAS data sets 2.1.3 Saving and retrieving permanent SAS data Sets 2.1.4 User- defined in formats and formats 2.1.5 Creating SAS data sets in procedure steps 2.2 SAS Procedures for Computing Statistics 2.2.1 The UNIVARIATE procedures 2.2.2 The FREQ procedure 2.3 Some Useful Base SAS Procedures 2.3.1 The PLOT procedures 2.3.2 The CHART procedures 2.3.3 The TABULATE procedure 3. Statistical Graphics Using SAS / GRAPH 3.1 INTRODUCTINS

3.2 An INTRODUCTINS to SAS / GRAPH 3.2.1 Useful SAS / GRAPH procedures GPLOT procedure GCHART procedure 3.2.2 Writing SAS/Graph programs 3.3 Quantile Plots 3.4 Empirical Quantile- Quantile Plots 3.5 Theoretical Quantile- Quantile Plots or probability Plots 3.6 Profile Plots of Means or Interaction Plots 3.7 Two Dimensional Scatter Plots and Scatter Plot Matrices 3.7.1 Two Dimensional Scatter Plots 3.7.2 Scatter plots Matrices 3.8 Histograms Bar Charts and Pie Charts 3.9 Other SAS Procedures for High- resolutions Graphics 4. Statistical Analysis of Regression Models 1. An Introduction to Simple Linear Regression 4.1.1 Simple linear regression using PROC REG 4.1.2 Lack of fit test using PROC ANOVA 4.1.3 Diagnostics use of case statistics 4.1.4 Predictions of new y values using regressions 4.2 An Introduction to Multiple Regression Analysis 4.2.1 Multiple regression analysis using PROC REG 4.2.2 Case Statistics and residual analysis 4.2.3 Residual Plots 4.2.4 Examining relationships among regression variables 4.3 Types of Sums of Squares Computed in PROC REG and PROC GLM 4.3.1 Model comparison technique and extra sum of squares 4.3.2 Types of sums of squares in SAS 4.4 Subset selection using PROC REG for Model selection 4.4.1 Subset selection using PROC REG 4.4.2 Other options available in PROC REG for model selection 4.5 Inclusion of squared Terms Product terms in Regression Models 4.5.1 Including interaction terms in the model 4.5.2 Comparing slopes of regression lines using interaction 4.5.3 Analysis of models with higher-order terms with PROC REG 5. Analysis of Variance Model 5.1 Introduction 5.1.1 Treatment Structure

5.1.2 Experimental Designs 5.1.3 Linear Models 5.2 One-way Classification 5.2.1 Using PROC ANOVA to analyze one-way Classifications 5.2.2 Making preplanned (or a priori) comparisons using PROC GLM 5.2.3 Testing orthogonal polynomials using contrasts 5.3 One-Way Analysis of Covariance 5.3.1 Using PROC GLM to perform one-way covariance analysis 5.3.2 One-way covariance analysis: Testing for equal slopes 5.4 A two Factorial in a Completely Randomized Design 5.4.1 Analysis of a two-way factorial using PROC GLM 5.4.2 Residual Analysis of Interaction 5.5 Two-Way Factorial: Analysis of Interaction 5.6 Two-Way Factorial: Unequal Sample sizes 5.7 Two way Classification: Randomized Complete Block Design 5.7.1 Using PROC GLM to analyze a RCBD 5.7.2 Using PROC GLM to test for non additivity 6. Analysis of Variance: Random and Mixed Effects Models 6.1 Introduction 6.2 One-way Random Effects Model 6.2.1 Using PROC GLM to analyze one-way Random Effects Models 6.2.2 Using PROC MIXED to analyze one-way Random Effects Models 6.3 Two way Crossed Random Effects Model 6.3.1 Using PROC GLM and PROC MIXED to analyze two way Crossed Random Effects Model 6.3.2 Randomized complete block design: Blocking when treatment factors are random 6.4 Two-Way Nested Random Effects Model 6.4.1 Using PROC GLM to analyze two-way nested random effects models 6.4.2 Using PROC MIXED to analyze two-way Nested Random Effects Models 6.5 Two-way Mixed Effects Models 6.5.1 Two-way Mixed Effects Models: Randomized Complete Blocks Design 6.5.2 Two-way Mixed Effects Models: Crossed Classification 6.5.3 Two-way Mixed Effects Models: Nested Classification 6.6 Models with Random and Nested Effects for More Complex Experiments 6.6.1 Models for nested factorials 6.6.2 Models for split-plot experiments 6.6.3 Analysis of split-plot experiments using PROC GLM 6.6.4 Analysis of split-plot experiments using PROC MIXED Logistic Regression

Factor Analysis(Principal component)

Fees: INR 11,000


SAS is one of the worlds most widely used statistical software. With our SAS training learn how to read databases, conduct variable transformations and simple statistical analyses, and how to integrate SAS with other databases. One of our most popular courses, ATI has trained in excess of 4000 students in SAS over the last 5 years

o o o o o o o o o o o o

Introduction to Analytics Introduction to SAS, GUI Types of Libraries, Creating Variable Attributes Name, Type, Format, Informat, Label Introduction to Data steps and Proc steps DATA Understanding Reading, Importing, Exporting and Copying Data Conditional Statements (Where, If, If then Else) Appending, Merging and Sorting Datasets Proc steps like " Proc Means, Proc Freq, Proc Sort Output Delivery System (ODS) SAS Functions and Options List Input, Delimiters, Reading missing Values, and non standard values Do loops Generating Data Execution Output Statements Nesting Do loops Do While and Do Until Statement Arrays Dimensions Array elements and Range Proc report Introduction to Data base, Relational Data base concepts Proc SQL, Data integrity Constraints, Creating table and Inserting Values Proc SQL codes to Retrieve & Summarize data Group, Sort & Filter Using Joins

o o o o

Indexes Macros Defining and calling a macro Macro Parameters and Variables Global and Local Variables

Analyst
Progressive Digital Media Group
October 2011 February 2013 (1 year 5 months) Hyderabad Research: To identify and monitor competitors and research market conditions by constantly tracking the changes in the industry. To analyze and incorporate customer preferences, demographics and buying habits, that influences the market dynamics. To undertake secondary research where data is collected from trade magazines, trade associations, statistical offices, company statements and trade departments, amongst other sources. Interpret findings and draw conclusions from preliminary research. To analyze the key market metrics through data sanity checks and verifies if it is in trend with the market. To initiate and complete the permission process for sourcing and attribution of information whenever necessary. To conduct cross-country comparison and also check if the data is in line. Authoring reports address client centric requirements. Analytics: Running descriptive statistics and statistical analysis including T-Test, ANOVA, Time Series Forecasting, Regression and Cluster Analysis by using SAS and SPSS Import and Export of Data, Variable creation, String and Date function, keep and Drop variables. Conditional execution (if then, Else if, Else, do loop, where conditions), Macro, Array, Sorting, Merging by using SAS and SPSS Comprehensive knowledge of Advanced Excel including conditional Formatting, Advance filtering, Cell Referencing, Text functions, What if Analysis, Vlookup, If statements, Data validation, Pivot Table, Solver, Charts, Macro, VBA, etc. Comprehensive knowledge of MS PowerPoint including building blocks of a presentation, themes and styles, tables, charts, graphs and smart art, animation Gained comprehensive knowledge of SQL including Basic structures SQL, basic commands, Grouping, Ordering, Union, Intersection, Different types of Joining (natural join, outer join) etc Sound understanding of strategic tools like Porters 5 forces, SWOT, PEST, BCG, GE, Value chain analysis, etc

Adeptness in Analytics using Traditional and Advanced Statistical applications in Banking, Insurance, Market Research, CPG/Retail, Hi-Technology. International Exposure: Interacting with Top Executive Management Worked with popular Marketing scientists around the Globe. Trained in R programming & Analytics in Minneapolis, MN, USA. Presenter/Co-Author in multiple International Conferences Traditional/Advanced Analytic Methods: Linear, Non Linear, Logistic, Probit, Multinomial Logit, Log Linear Regression, Pricing Analysis (with/out competitive context), Ranking Analysis/Preference scores, Text Perception, Image Perception/Advertising Research, Shapley Value/Game Theory, different Product Optimization Techniques which suits to real marketing decisions, Cannibalisation of Products, Random Forest, Decision Trees, CART, Key Driver Analysis, Derived Importance , Brand Association & Mapping techniques. Analytics services and Product Development Technical Expertise: R, hadoop, mapreduce, SSRS, SPSS, SAS/SQL, SawTooth, Excel, Quantum, Quanvert. Extensive knowledge in Programming, Data Analytics, Data Mining & Weighting using different programming tools Client Interaction: Interaction with Top Executive Management Global client interaction, Analysis findings, recommendations and preparing Executive summaries, explaining complex findings in layman terms Management & Training: Key driver in motivating team towards the success of Organisation People Management Basic and Advanced Analytic Methods Trainer

R Programming Trainer Cross Functional Trainer

SAS/BASE

HISTORY OF SAS --------------------------------------------------------------------------------------------1 BASIC THEORY & RULES ETC...--------------------------------------------------------------------------1 SAS WINDOWING ENVIRONMENT-----------------------------------------------------------------------2 CREATION OF LIBRARIES-----------------------------------------------------------------------------------2 SAS PROGRAMMING (BRIEFLY-----------------------------------------------------------------------------2 - DATASTEP - PROC STEP WAYS TO READ DATA (BRIEFLY) ----------------------------------------------------------------------2 BACK END PROCESS------------------------------------------------------------------------------------------2 INSTALLATION PROCESS ----------------------------------------------------------------------------------3
DATASTEP

INFORMATS & FORMATS------------------------------------------------------------------------------------4 DATA & OPTIONS---------------------------------------------------------------------------------------------4 INFILE & OPTIONS--------------------------------------------------------------------------------------------5 INPUT & OPTIONS--------------------------------------------------------------------------------------------6, 7 DATALINES & OPTIONS-------------------------------------------------------------------------------------8 DATES-------------------------------------------------------------------------------------------------------------8 SUM STATEMENT----------------------------------------------------------------------------------------------9 ATTRIB STATEMENT & INDIVIDUAL ATTRIBUTES--------------------------------------------------9 RETAIN STATEMENT----------------------------------------------------------------------------------------9 IF, IF/THEN/ELSE STATEMENT--------------------------------------------------------------------------10 WHERE STATEMENT----------------------------------------------------------------------------------------10 DO, DO ITERATIVE, DO UNTIL & DO WHILE -------------------------------------------------------11 CONCATENATION & INTEREAVING---------------------------------------------------------------------12 MERGE & UPDATE--------------------------------------------------------------------------------------------12
FUNCTIONS

- DATE FUNCTIONS-----------------------------------------------------------------------------------------13, 14 - CHARACTER FUNCTIONS -------------------------------------------------------------------------------15, 16 - NUMERIC FUNCTIONS------------------------------------------------------------------------------------17, 18 ARRAYS---------------------------------------------------------------------------------------------------------19, 20 OTHERS MISSING CONCEPTS----------------------------------------------------------------------------21 REVIEW OF DATASTEP------------------------------------------------------------------------------------22 EXAM & DISCUSSION---------------------------------------------------------------------------------------23
PROC STEP UTILITY PROCEDURES

- PROC IMPORT-----------------------------------------------------------------------------------------------24

- PROC EXPORT-----------------------------------------------------------------------------------------------24 - PROC CONTENTS-------------------------------------------------------------------------------------------24 - PROC APPEND-----------------------------------------------------------------------------------------------25 - PROC SORT--------------------------------------------------------------------------------------------------25 - PROC COPY--------------------------------------------------------------------------------------------------26 - PROC DATASETS-------------------------------------------------------------------------------------------26 - PROC COMPARE---------------------------------------------------------------------------------------------26 - PROC TRANSPOSE------------------------------------------------------------------------------------------27 - PROC FORMAT-----------------------------------------------------------------------------------------------27 - PROC SETINIT-----------------------------------------------------------------------------------------------27 - PROC PWENCODE -----------------------------------------------------------------------------------------28 - PROC OPTIONS --------------------------------------------------------------------------------------------28 - PROC RANK -------------------------------------------------------------------------------------------------28 - PROC CPORT------------------------------------------------------------------------------------------------29 - PROC CIMPORT---------------------------------------------------------------------------------------------29 - PROC ----------------------------------------------------------------------------------------REPORTING PROCEDURES

- PROC PRINT-------------------------------------------------------------------------------------------------30 - PROC TABULATE------------------------------------------------------------------------------------------30 - PROC REPORT----------------------------------------------------------------------------------------------31 - _NULL_ REPORT-------------------------------------------------------------------------------------------32


SAS/GRAPHS & ODS CONCEPTS

- PROC GPLOT------------------------------------------------------------------------------------------------33 - PROC GCHART----------------------------------------------------------------------------------------------33 - ODS------------------------------------------------------------------------------------------------------------34


SAS/STAT

- BASICS OF STATISTICS---------------------------------------------------------------------------------35
STATISTICAL PROCEDURES

- PROC MEANS -----------------------------------------------------------------------------------------------36 - PROC SUMMARY--------------------------------------------------------------------------------------------36 - PROC UNIVARIATE ---------------------------------------------------------------------------------------37 - PROC FREQ -------------------------------------------------------------------------------------------------37 - PROC CORR -------------------------------------------------------------------------------------------------38 - PROC REG----------------------------------------------------------------------------------------------------38 - PROC ANOVA-----------------------------------------------------------------------------------------------38 - T-TEST -------------------------------------------------------------------------------------------------------39 - Chi SQUARE -------------------------------------------------------------------------------------------------39 - HYPOTHESIS------------------------------------------------------------------------------------------------40 - OTHER STATISTICAL PROCEDURES ----------------------------------------------------------------41 REVIEW OF PROC STEP-------------------------------------------------------------------------------------42 EXAM & DISCUSSION ---------------------------------------------------------------------------------------43

SAS/ACCESS

BASIC THEORY OF SAS/ACCESS ------------------------------------------------------------------------44 WAYS TO ACCESS DATA FROM DIFFERENT SOURCES LIKE NOTEPADS, EXCEL SHEETS, MS-ACCESS, ORACLE, DB2, SYBASE AND TERRADATA etc - PROC IMPORT --------------------------------------------------------------------------------------------- () - INFILE STATEMENT-------------------------------------------------------------------------------------- () - SQL PASS THRU QUERY---------------------------------------------------------------------------------45 - LIBNAME STATEMEN -------------------------------------------------------------------------------------45 - PROC DBLOAD----------------------------------------------------------------------------------------------46 - PROC ACCESS-----------------------------------------------------------------------------------------------46 ---------------------------------------------------------------------------------------47, 48, 49, 50, 51, REVIEW, EXAM & DISCUSSION --------------------------------------------------------------------------52 SAS/MACROS---------------------------------------------------------------------------------------------53, 54, 55, 56, 57 REVIEW, EXAM & DISCUSSION---------------------------------------------------------------------------58
SQL (PROC SQL)

PERFORMANANCE TUNING---------------------------------------------------------------------------------59 TESTING/DEBUGGING TECHNIQUES------------------------------------------------------------------- 60


PROJECT (REAL TIME)

-----------------------------------------------------------------------------1MONTH

PRACTICE CODING ON REAL TIME DATA INTERVIEW QUESTIONS RESUME PREPARATION WRITTEN TESTS MOCK INTERVIEWS CONSULTIN

Fees: INR 16,500


The Advanced Analytics course introduces participants to fundamental concepts of statistics, and guides them all the way to building predictive models using multiple linear and logistic regressions. All the topics are explained with the help of hands-on practice with live case studies and data, enabling a deeper understanding of the underlying concepts of analytics. Introduction to analytics

o o o o

Need for analytics Analytics use in different industries Challenges in adoption of analytics Overview of Course Contents Data understanding

o o o o o o o o o o o o o o o o

Data types (Nominal, Ordinal, Interval and Ratio) Descriptive statistics Tabular & Graphical Method Summary statistics Introduction to some statistical terminologies and inferences Population, Sample and Random variables Point and Interval Estimations Probability Discrete/Continuous Probability Distributions Hypothesis Testing Importance of formulating and validating the hypothesis Formulation of hypothesis (Null and alternate) Testing association and differences Statistical significance and test statistic Level of significance Z-Test, T-Test, Chi-Square test, ANOVA Parametric & Non-Parametric test Correlation & Regression Linear Regression Case Study on Multiple Regression Logistic Regression Case Study on Logistic Regression Cluster Analysis Case Study on Cluster Analysis Factor Analysis Case Study on Factor Analysis

programming for SAS 9


Item 1 The following program is submitted.
data WORK.TEST; input Name $ Age; datalines; John +35 ; run;

Which values are stored in the output data set?

A. B. C. D. E. F. G. H. I.

Name

Age

--------------------John Name 35 Age

--------------------John Name (missing value) Age

--------------------(missing value) (missing value)

J. The DATA step fails execution due to data errors.


correct_answer = "A" Item 2 Given the SAS data set WORK.ONE:
Id --182 190 250 720 Char1 ----M N O P

and the SAS data set WORK.TWO:


Id --182 623 720 Char2 ----Q R S

The following program is submitted:


data WORK.BOTH; merge WORK.ONE WORK.TWO; by Id; run;

What is the first observation in the SAS data set WORK.BOTH?


A. B. C. Id --182 Char1 ----M Char2 -----

D.
E. F. G. Id --182 Char1 ----Char2 ----Q

H.
I. Id Char1 Char2

J. K.

--182

----M

----Q

L.
M. N. O. Id --720 Char1 ----P Char2 ----S

P. correct_answer = "C" Item 3 Given the text file COLORS.TXT:


----+----1----+----2----+---RED BLUE CYAN GRAY ORANGE INDIGO WHITE BROWN YELLOW PURPLE FUCSIA PINK GREEN VIOLET BLACK MAGENTA

The following SAS program is submitted:


data WORK.COLORS; infile 'COLORS.TXT'; input @1 Var1 $ @8 Var2 $ @; input @1 Var3 $ @8 Var4 $ @; run;

What will the data set WORK.COLORS contain?


A. B. C. D. E. F. Var1 -----RED BLUE CYAN GRAY Var2 -----ORANGE INDIGO WHITE BROWN Var3 -----RED BLUE CYAN GRAY Var4 -----ORANGE INDIGO WHITE BROWN

G.
H. I. J. K. Var1 -----RED CYAN Var2 -----ORANGE WHITE Var3 -----BLUE GRAY Var4 -----INDIGO BROWN

L.
M. N. O. P. Var1 -----RED BLUE Var2 -----ORANGE INDIGO Var3 -----YELLOW PURPLE Var4 -----GREEN VIOLET

Q.
R. S. Var1 -----Var2 -----Var3 -----Var4 ------

T. U. V. W.

RED BLUE CYAN GRAY

ORANGE INDIGO WHITE BROWN

YELLOW PURPLE FUCSIA PINK

GREEN VIOLET BLACK MAGENTA

X. correct_answer = "A" Item 4 Given the SAS data set WORK.INPUT:


Var1 -----A A B C A Var2 ------one two three four five

The following SAS program is submitted:


data WORK.ONE WORK.TWO; set WORK.INPUT; if Var1='A' then output WORK.ONE; output; run;

How many observations will be in data set WORK.ONE? Enter your numeric answer. Do not add leading or trailing spaces to your answer.

correct_answer = "8" Item 5 The following SAS program is submitted:


data WORK.LOOP; X = 0; do Index = 1 to 5 X = Index; end; run; by 2;

Upon completion of execution, what are the values of the variables X and Index in the SAS data set named WORK.LOOP?

A. B. C. D.

X = 3, Index = 5 X = 5, Index = 5 X = 5, Index = 6 X = 5, Index = 7 correct_answer = "D" Item 6 The following SAS program is submitted:

proc format; value score 1 - 50 = 'Fail' = 'Pass';

51 - 100 run;

Which one of the following PRINT procedure steps correctly applies the format?
A. B. C. D. proc print data = SASUSER.CLASS; var test; format test score; run;

E.
F. G. H. I. proc print data = SASUSER.CLASS; var test; format test score.; run;

J.
K. L. M. proc print data = SASUSER.CLASS format = score; var test; run;

N.
O. P. Q. proc print data = SASUSER.CLASS format = score.; var test; run;

R. correct_answer = "B" Item 7 This item will ask you to provide a line of missing code; The SAS data set WORK.INPUT contains 10 observations, and includes the numeric variable Cost. The following SAS program is submitted to accumulate the total value of Cost for the 10 observations:

data WORK.TOTAL; set WORK.INPUT; <insert code here> Total=Total+Cost; run;

Which statement correctly completes the program?


A. B. C. D. keep Total; retain Total 0; Total = 0; If _N_= 1 then Total = 0;

correct_answer = "B" Item 8 This question will ask you to provide a line of missing code. Given the following data set WORK.SALES:
SalesID ------W6790 W7693 W1387 SalesJan -------50 25 . FebSales -------400 100 300 MarchAmt -------350 125 250

The following SAS program is submitted:


data WORK.QTR1; set WORK.SALES; array month{3} SalesJan FebSales MarchAmt; <insert code here> run;

Which statement should be inserted to produce the following output?


SalesID ------W6790 W7693 W1387 SalesJan -------50 25 . FebSales -------400 100 300 MarchAmt -------350 125 250 Qtr1 ---800 250 550

A. B. C. D.

Qtr1 = sum(of month{_ALL_}); Qtr1 = month{1} + month{2} + month{3}; Qtr1 = sum(of month{*}); Qtr1 = sum(of month{3}); correct_answer = "C" Item 9 Given the following SAS error log

44 45 46 47

data WORK.OUTPUT; set SASHELP.CLASS; BMI=(Weight*703)/Height**2; where bmi ge 20;

ERROR: Variable bmi is not on file SASHELP.CLASS. 48 run;

What change to the program will correct the error? A. B. C. D. Replace the WHERE statement with an IF statement Change the ** in the BMI formula to a single * Change bmi to BMI in the WHERE statement Add a (Keep=BMI) option to the SET statement correct_answer = "A" Item 10 The following SAS program is submitted:
data WORK.TEMP; Char1='0123456789'; Char2=substr(Char1,3,4); run;

What is the value of Char2? A. B. C. D. 23 34 345 2345 correct_answer = "D"

SAS Advanced Programming for SAS 9


Item 1 Given the following SAS data sets ONE and TWO:

The following SAS program is submitted:


proc sql; select one.*, sales from one right join two on one.year = two.year; quit;

Which one of the following reports is generated?

A.

B.

C.

D. correct_answer = "D" Item 2 Given the following SAS data sets ONE and TWO:

The following SAS program is submitted creating the output table THREE:
data three; merge one (in = in1) two (in = in2); by num; run;

Which one of the following SQL programs creates an equivalent SAS data set THREE?
A. B. C. D. proc sql; create table three as select * from one full join two

E. F. quit;

where one.num = two.num;

G.
H. I. J. K. L. M. N. O. P. Q. R. S. T. U. V. W. X. Y. Z. AA. quit; quit; proc sql; create table three as select coalesce(one.num, two.num) as NUM, char1, char2 from one full join two on one.num = two.num; quit; proc sql; create table three as select one.num, char1, char2 from one full join two on one.num = two.num; proc sql; create table three as select coalesce(one.num, two.num) as NUM, char1, char2 from one full join two where one.num = two.num;

correct_answer = "D" Item 3 The following SAS program is submitted:


%let type = RANCH; proc sql; create view houses as select * from sasuser.houses where style = "&type"; quit;

%let type = CONDO;

proc print data = houses; run;

The report that is produced displays observations whose value of STYLE are all equal to RANCH.

Which one of the following functions on the WHERE clause resolves the current value of the macro variable TYPE? A. B. C. D. GET SYMGET %SYMGET &RETRIEVE correct_answer = "B" Item 4 The SAS data set SASDATA.SALES has a simple index on the variable DATE and a variable named REVENUE with no index. In which one of the following SAS programs is the DATE index considered for use?
A. B. C. D. E. F. G. H. I. J. K. L. M. N. proc print data = sasdata.sales; by date; run; proc print data = sasdata.sales; where month(date) = 3; run; data march; set sasdata.sales; if '01mar2002'd < date < '31mar2002'd; run; data march; set sasdata.sales; where date < '31mar2002'd or revenue > 50000; run;

correct_answer = "A"

Clinical Trials Programming Using SAS 9


Item 1 What is the main focus of Good Clinical Practices (GCP)? A. B. C. D. harmonized data collection standard analysis practices protection of subjects standard monitoring practices correct_answer = "C" Item 2 Vital Signs are a component of which SDTM class?

A. B. C. D.

Findings Interventions Events Special Purpose correct_answer = "A" Item 3 Which option in the PROC EXPORT procedure overwrites an existing file?

A. B. C. D.

NEW OVERWRITE REPLACE KEEP correct_answer = "C" Item 4 Given the following data set WORK.DEMO:
PTID 689574 423698 758964 653347 493847 500029 513842 515151 522396 534787 875642 879653 542369 698754 656423 785412 785698 763284 968743 457826 M F F F M M F F M M F F F F M M M M M M Sex Age 15 14 12 14 14 12 12 15 13 12 11 15 12 11 16 12 16 11 14 18 Height 80.0 65.5 60.3 62.8 63.5 57.3 59.8 62.5 62.5 59.0 51.3 75.3 56.3 50.5 72.0 67.8 72.0 57.5 60.5 74.0 Weight 115.5 90.0 87.0 98.5 102.5 83.0 84.5 112.5 84.0 99.5 50.5 105.0 77.0 70.0 150.0 121.0 110.0 85.0 85.0 165.0

The following SAS program is submitted:


proc print data=WORK.DEMO(firstobs=5 obs=10); where Sex='M'; run;

How many observations will be displayed?

A. B. C. D.

4 6 7 8 correct_answer = "B" Item 5 Given the following partial data set:


SUBJID 101 103 106 107 SAF 1 1 1 1 ITT . 1 1 . OTH 1 1 1 1

The following SAS program is submitted:


proc format; value stdypfmt 1="Safety" 2="Intent-to-Treat" 3="Other"; run;

data test; set temp (keep=SUBJID ITT SAF OTH ); by subjid; length STDYPOP $200; array pop{*} SAF ITT OTH ; do i=1 to 3; if STDYPOP="" and pop{i}=1 then STDYPOP=put(i, stdypfmt.); else if STDYPOP^="" and pop{i}=1 then STDYPOP = trim(STDYPOP)||"/"||put(i, stdypfmt.); end; run;

What is the value of STDYPOP for SUBJID=107?

correct_answer = "Safety/Other"
Item 6 This question will ask you to provide a line of missing code. Given the data set WORK.STUDYDATA with the following variable list:
# 2 3 Variable DAY DIABP Type Char Num Len 8 8 Label Study Day Diastolic Blood Pressure

TRT

Char

Treatment

The following SAS program is submitted:


proc means data=WORK.STUDYDATA noprint; <insert code here> class TRT DAY; var DIABP; output out=WORK.DIAOUT mean=meandp; run;

WORK.DIAOUT should contain:


A. B. C. D.

the mean diastolic blood pressure values for every day by treatment group the overall mean diastolic blood pressure for each treatment group Which statement correctly completes the program to meet these requirements? where trt or trt*day; types trt trt*day; by trt day; id trt day; correct_answer = "B" Item 7 The following SAS program is submitted:
%let member1=Demog; %let member2=Adverse; %let Root=member; %let Suffix=2; %put &&&Root&Suffix;

What is written to the SAS log? A. B. C. D. &member2 Adverse &&&Root&Suffix WARNING: Apparent symbolic reference ROOT2 not resolved. correct_answer = "B" Item 8 This question will ask you to provide a line of missing code. The following SAS program is submitted:
proc format ; value dayfmt 1='Sunday' 2='Monday' 3='Tuesday'

4='Wednesday' 5='Thursday' 6='Friday' 7='Saturday' ; run ;

proc report data=diary ; column subject day var1 var2 ; <insert code here> run ;

In the DIARY data set, the format DAYFMT is assigned to the variable DAY. Which statement will cause variable DAY to be printed in its unformatted order? A. B. C. D. define day / order define day / order define day / order define day / order correct_answer = "D" 'Day' ; order=data 'Day' ; noprint 'Day' ; order=internal 'Day' ;

Item 9 You are using SAS software to create reports that will be output in a Rich Text Format so that it may be read by Microsoft Word. The report will span multiple pages and you want to display a '(Continued)' text at the end of each page when a table spans multiple pages. Which statement can you add to the SAS program to ensure the inclusion of the '(Continued)' text? A. B. C. D. ods rtf file='report.rtf'; ods tagsets.rtf file='report.rtf'; ods tagsets.rtf file='report.rtf' break='Continued'; ods file open='report.rtf' type=rtf break='(Continued)'; correct_answer = "B" Item 10 What is the primary purpose of programming validation? A. Ensure that the output from both the original program and the validation program match. B. Efficiently ensure any logic errors are discovered early in the programming process. C. Justify the means used to accomplish the outcome of a program and ensure its accurate representation of the original data. D. Document all specifications pertaining to programmed output and ensure all were reviewed during the programming process. correct_answer = "C"

Predictive Modeling Using SAS Enterprise Miner 7


Item 1

Open the diagram labeled Practice A within the project labeled Practice A. Perform the following in SAS Enterprise Miner: 1. Set the Clustering method to Average. 2. Run the Cluster node. Use this project to answer the next two questions: What is the Importance statistic for MTGBal (Mortgage Balance)? 0.32959 0.42541 0.42667 1.000000 correct_answer = "C" You must change the clustering method to average and run the cluster node first. Select view results and look in the output window and view the Variable Importance results. What is the Cubic Clustering Criterion statistic for this clustering? 5.00 14.69 5862.76 67409.93 correct_answer = "B" Run the diagram flow and view the results. From the results window, select View -> Summary Statistics -> CCC Plot and mouse over where the data point and the line intersect. This will display the CCC statistic.

A. B. C. D.

A. B. C. D.

Item 2 1. Create a project named Insurance, with a diagram named Explore. 2. Create the data source, DEVELOP, in SAS Enterprise Miner. DEVELOP is in the directory c:\workshop\Practice. 3. Set the role of all variables to Input, with the exception of the Target variable, Ins (1= has insurance, 0= does not have insurance). 4. Set the measurement level for the Target variable, Ins, to Binary. 5. Ensure that Branch and Res are the only variables with the measurement level of Nominal. 6. All other variables should be set to Interval or Binary. 7. Make sure that the default sampling method is random and that the seed is 12345. Use this project to answer the next <b.seven< b="" style="color: rgb(51, 51, 51); font-family: Arial, Helvetica, Verdana, sans-serif; font-size: small; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: 16px; orphans: auto; text-align: left; text-indent: 0px; texttransform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-size-adjust: auto;

-webkit-text-stroke-width: 0px;">questions. (Note: only 2 of 7 questions are displayed for this example) The variable Branch has how many levels? 8 12 19 47 correct_answer = "C" This information can be obtained by viewing the PROC FREQ output. What is the mean credit card balance (CCBal) of the customers with a variable annuity? $0.00 $8,711.65 $9,586.55 $11,142.45 correct_answer = "D" You can use a Stat Explore Node and view the output for the Descriptive Statistics for CCBal by level of the target variable.

A. B. C. D.

A. B. C. D.

SAS Platform Administration for SAS 9


Item 1 The location of the repository manager physical files can be found in: A. B. C. D. SAS Management Console. the metadata server's omaconfig.xml file. the foundation repository. the metadata server's sasv9.cfg file. correct_answer = "B" Item 2 Every SAS platform implementation includes: A. B. C. D. a foundation repository and a repository manager. a foundation repository and a custom repository. a custom repository and a repository manager. multiple project repositories. correct_answer = "A" Item 3 Which procedure allows a platform administrator to update table metadata? A. METAUPDATE_RULE B. METASELECT C. METATABLE

D. METALIB correct_answer = "D" Item 4 Which statement regarding pre-assigned libraries is true? A. B. C. D. Pre-assigned libraries reduce the initialization time for a workspace server. Pre-assigned libraries always connect to an RDBMS at server initialization. Pre-assigned libraries always connect to a base SAS library at server initialization. Pre-assigned libraries do not have to be identical across all SAS client applications. correct_answer = "C" Item 5 A platform administrator needs to retrieve from the metadata a complete LIBNAME statement including the user ID and password. To complete this task, the platform administrator must be connected to SAS Management Console with what type of user access in the metadata? A. B. C. D. Access to the credentials associated with libraries created with the METALIB procedure. Access to credentials established by the LIBNAME engine. Access to credentials associated with users in the outbound login. Access to credentials for the authentication domain associated with the database server. correct_answer = "D" Item 6 By default, which groups have WriteMetadata on the Foundation repository? A. B. C. D. PUBLIC SASUSERS ADMINISTRATORS ONLY SAS SYSTEM SERVICES ONLY correct_answer = "B" Item 7 Given the following authorization settings for Library Sales2:

A. B. C. D.

Library Sales2's parent folder has an explicit grant of RM for Mary. Library Sales2 has an explicit denial of RM for PUBLIC. Which statement is true? Mary can see Library Sales2. Mary can see data flagged as PUBLIC in Library Sales2. Mary cannot see Library Sales2. Mary can see Library Sales2, but not any data flagged as PUBLIC.

correct_answer = "C" Item 8 Which statement is FALSE regarding the WriteMemberMetadata (WMM) permission? A. B. C. D. By default, it mirrors the WriteMetadata permission. It only applies to folders. If WriteMetadata is granted, then you should not deny WMM. WMM is inherited from one folder to another folder. correct_answer = "D" Item 9 Content has been exported from a SAS 9.1.3 environment into a SAS 9.2 development environment. After the export, the platform administrator attempts to promote an object from the SAS 9.2 development environment into a production environment. What will be the result of this promotion attempt? A. The promotion will not be allowed because objects from SAS 9.1.3 cannot be promoted to SAS 9.2. B. The promotion will not be allowed because objects in a development environment cannot be promoted to a production environment. C. The promotion will be allowed assuming the object type is allowed for promotion. D. The promotion will not be allowed because objects exported from a previous environment cannot be promoted. correct_answer = "C"

SAS Data Integration Developer for SAS 9


Item 1 Which of the following servers is NOT a part of the platform for SAS Business Analytics server tier? A. B. C. D. SAS Metadata Server SAS Workspace Server SAS/CONNECT Server SAS Content Server correct_answer = "D" Item 2 Which products are needed on the local host in order to access data from an MS Access Database using an ODBC Data Source name? A. SAS/ACCESS interface to DSN B. SAS/ACCESS interface to MDB C. SAS/ACCESS interface to PC Files

D. SAS/ACCESS interface to ODBC correct_answer = "D" Item 3 Which statement is true regarding external files? A. B. C. D. External file objects are accessed with SAS INFILE and FILE statements. External files contain only one record per line. External files can be used as input but not as outputs in SAS Data Integration Studio jobs. SAS can only work with Blank, Comma, Semicolon and Tab as delimiters in external files. correct_answer = "A" Item 4 Within SAS Data Integration Studio's SQL Join transformation, the option to turn on debug is located in which Properties pane? A. B. C. D. Select Properties Create Properties SQL Join Properties Job Properties correct_answer = "C" Item 5 Which SAS Data Integration Studio reports, generated as external files, can be stored as document objects within metadata? A. B. C. D. only job reports only table reports both job reports and table reports No reports can be stored as document objects. correct_answer = "C" Item 6 You want to create a job to extract only the rows that contain information about female employees from a table that contains information about both male and female employees. The new table should have observations in ascending order of age. Refer to the job flow diagram in the exhibit. Where would you set the options to filter and sort the data?

A. Where tab and Group By tab B. Where tab and Order By tab C. Where tab and Parameters tab

D. Group By tab and Parameters tab correct_answer = "B" Item 7 Within SAS Data Integration Studio's Table Loader transformation, which load style choice does NOT exist? A. B. C. D. Delete where Append to Existing Replace Update/Insert correct_answer = "A" Item 8 In SAS Data Integration Studio, a business key can be defined in the properties of which transformation? A. B. C. D. Data Validation SQL Join Lookup SCD Type 2 Loader correct_answer = "D"

SAS BI Content Developer for SAS 9


Item 1 When opening a registered SAS data file into a Microsoft Excel Worksheet, a user has the option to sort the data. Which application performs the sort and where does the sort occur? A. B. C. D. SAS performs the sort on the server. SAS performs the sort on the local machine. Excel performs the sort on the server. Excel performs the sort on the local machine. correct_answer = "A" Item 2 When can you add a stored process as a data source to an information map? A. anytime B. when at least one table is selected as a data source C. when at least one OLAP cube is selected as a data source

D. once an application server has been selected correct_answer = "B" Item 3 Refer to the exhibit.

A SAS.IdentityGroups filter has been created in SAS Information Map Studio. There is a data item called "Group" that contains different metadata groups. If the "Group" filter is applied to the map, how will it affect the data? A. B. C. D. All rows will be returned for any group that the user is a member of. Only rows that belong to the first group are returned. All rows will be returned for PUBLIC group only. All rows matching the group identity login are returned. correct_answer = "A" Item 4 A SAS data set is used as a data source for a SAS BI Dashboard data model. Which type of code do you write to query the data? A. B. C. D. DATA Step PROC SQL a SQL/JDBC query MDX correct_answer = "C" Item 5 Refer to the exhibit.

What causes this error message when executing a stored process? A. Stored process code cannot be a .TXT file. B. The stored process server is not running.

C. The file that contains the stored process code is not in the specified location. D. An administrator deleted the stored process from the metadata. correct_answer = "C" Item 6 In a stored process, when using a range prompt named DateRange, which macro variables would you use in your SAS code? A. B. C. D. DateRange_START and DateRange_FINISH DateRange_BEGIN and DateRange_END DateRange_MIN and DateRange_MAX DateRange0 and DateRange1 correct_answer = "C" Item 7 Upon initial install, all of the capabilities in the 'Web Report Studio: Report Creation' role are also included in which role? A. B. C. D. Web Report Studio: Report Viewing Web Report Studio: Advanced Web Report Studio: Content Management Web Report Studio: Administration correct_answer = "B" Item 8 A content developer would like to create a group of cascading prompts to use in multiple reports without recreating the prompts for each report. What features of the prompt framework must the developer use? A. B. C. D. Cannot create shared cascading prompts for use in multiple reports. Dynamic Prompts and Shared Prompts Cascading Prompts and Standard Groups Cascading Prompts, Standard Groups, and Shared Prompts correct_answer = "D" Item 9 A SAS Information Map with a SAS OLAP Cube as a data source can be built from which of the following? A. B. C. D. multiple SAS OLAP Cubes a SAS OLAP Cube and a stored process one table joined with one SAS OLAP Cube one SAS OLAP Cube only correct_answer = "D"

Item 10 Which statement is true regarding connection profiles used with the SAS platform applications? A. B. C. D. Each SAS platform application must have its own connection profile. Connection profiles are stored on the server machine. Connection profiles are stored on the machine where the SAS application is installed. All SAS platform applications share one connection profile. correct_answer = "C"

SAS Statistical Business Analyst Using SAS 9


Item 1 A financial analyst wants to know whether assets in portfolio A are more risky (have higher variance) than those in portfolio B. The analyst computes the annual returns (or percent changes) for assets within each of the two groups and obtains the following output from the GLM procedure:

Which conclusion is supported by the output? A. B. C. D. Assets in portfolio A are significantly more risky than assets in portfolio B. Assets in portfolio B are significantly more risky than assets in portfolio A. The portfolios differ significantly with respect to risk. The portfolios do not differ significantly with respect to risk. correct_answer = "C" Item 2 An analyst has determined that there exists a significant effect due to region. The analyst needs to make pairwise comparisons of all eight regions and wants to control the experimentwise error rate. Which GLM procedure statement would provide the correct output?

A. B. C. D.

lsmeans Region / pdiff=all adjust=dunnett; lsmeans Region / pdiff=all adjust=tukey; lsmeans Region / pdiff=all adjust=lsd; lsmeans Region / pdiff=all adjust=none;

correct_answer = "B" Item 3 A linear model has the following characteristics:


a dependent variable (y) one continuous predictor variables (x1) including a quadratic term (x12) one categorical predictor variable (c1 with 3 levels) one interaction term (c1 by x1) Which SAS program fits this model?
proc glm data=SASUSER.MLR; class c1; model y = c1 x1 x1sq c1byx1 run; proc reg data=SASUSER.MLR; model y = c1 x1 x1sq c1byx1 run; proc glm data=SASUSER.MLR; class c1; model y = c1 x1 x1*x1 c1*x1 run; proc reg data=SASUSER.MLR; model y = c1 x1 x1*x1 c1*x1; run; /solution; /solution; /solution;

A. B. C. D. E. F. G. H. I. J. K. L. M. N.

correct_answer = "C" Item 4 Refer to the REG procedure output:

What is the most important predictor of the response variable? A. B. C. D. intercept overhead scrap training correct_answer = "B" Item 5 Which statement is an assumption of logistic regression? A. B. C. D. The sample size is greater than 100. The logit is a linear function of the predictors. The predictor variables are not correlated. The errors are normally distributed. correct_answer = "B" Item 6 When selecting variables or effects using SELECTION=BACKWARD in the LOGISTIC procedure, the business analyst's model selection terminated at Step 3. What happened between Step 1 and Step 2? A. B. C. D. DF increased. AIC increased. Pr > Chisq increased. - 2 Log L increased. correct_answer = "D" Item 7 The LOGISTIC procedure will be used to perform a regression analysis on a data set with a total of 10,000 records. A single input variable contains 30% missing records. How many total records will be used by PROC LOGISTIC for the regression analysis? Enter your numeric answer in the space below. Do not add leading or trailing spaces to your answer. Click the calculator button to display a calculator if needed. correct_answer = "7000" Item 8 An analyst is screening for irrelevant variables by estimating strength of association between each input and the target variable. The analyst is using Spearman correlation and Hoeffding's D statistics in the

CORR procedure. What would likely cause some inputs to have a large Hoeffding and a near zero Spearman statistic? A. B. C. D. nonmonotonic association between the variables linear association between the variables monotonic association between the variables no association between the variables correct_answer = "A" Item 9 An analyst builds a logistic regression model which is 75% accurate at predicting the event of interest on the training data set. The analyst presents this accuracy rate to upper management as a measure of model assessment. What is the problem with presenting this measure of accuracy for model assessment? A. B. C. D. This accuracy rate is redundant with the misclassification rate. It is pessimistically biased since it is calculated from the data set used to train the model. This accuracy rate is redundant with the average squared error. It is optimistically biased since it is calculated from the data used to train the model. correct_answer = "D" Item 10 Refer to the exhibit:

For the ROC curve shown, what is the meaning of the area under the curve? A. B. C. D. percent concordant plus percent tied percent concordant plus (.5 * percent tied) percent concordant plus (.5 * percent discordant) percent discordant plus percent tied correct_answer = "B"</b.seven<>

Vous aimerez peut-être aussi