Académique Documents
Professionnel Documents
Culture Documents
SAS Analytics, Data analysis Duration:60 hours course fee :Rs 30,000/More detail visit www.iisastr.com phone no:9312506496 Course content 1. Introduction to the SAS Language 1.1 Introduction 1.2 Basic Language: Rules and Syntax 1.3 Creating SAS Data Sets 1.4 The INPUT Statement 1.5 SAS Data Step Programming Statements Their Uses 1.6 Data Step Processing 1.7 More on INPUT Statement 1.7.1 Use of Pointer controls 1.7.2 The trailing@ line-hold specifier 1.7.3 The trailing@ @ line-hold specifier 1.7.4 Uses of RETAIN statement 1.7.5 The use of line pointer controls 1.8 Using SAS Procedures using SAS
2. More on SAS Programming and some Applications 2.1 More on the DATA and PROC STEPS 2.1.1 Reading data from files 2.1.2 Combing SAS data sets 2.1.3 Saving and retrieving permanent SAS data Sets 2.1.4 User- defined in formats and formats 2.1.5 Creating SAS data sets in procedure steps 2.2 SAS Procedures for Computing Statistics 2.2.1 The UNIVARIATE procedures 2.2.2 The FREQ procedure 2.3 Some Useful Base SAS Procedures 2.3.1 The PLOT procedures 2.3.2 The CHART procedures 2.3.3 The TABULATE procedure 3. Statistical Graphics Using SAS / GRAPH 3.1 INTRODUCTINS
3.2 An INTRODUCTINS to SAS / GRAPH 3.2.1 Useful SAS / GRAPH procedures GPLOT procedure GCHART procedure 3.2.2 Writing SAS/Graph programs 3.3 Quantile Plots 3.4 Empirical Quantile- Quantile Plots 3.5 Theoretical Quantile- Quantile Plots or probability Plots 3.6 Profile Plots of Means or Interaction Plots 3.7 Two Dimensional Scatter Plots and Scatter Plot Matrices 3.7.1 Two Dimensional Scatter Plots 3.7.2 Scatter plots Matrices 3.8 Histograms Bar Charts and Pie Charts 3.9 Other SAS Procedures for High- resolutions Graphics 4. Statistical Analysis of Regression Models 1. An Introduction to Simple Linear Regression 4.1.1 Simple linear regression using PROC REG 4.1.2 Lack of fit test using PROC ANOVA 4.1.3 Diagnostics use of case statistics 4.1.4 Predictions of new y values using regressions 4.2 An Introduction to Multiple Regression Analysis 4.2.1 Multiple regression analysis using PROC REG 4.2.2 Case Statistics and residual analysis 4.2.3 Residual Plots 4.2.4 Examining relationships among regression variables 4.3 Types of Sums of Squares Computed in PROC REG and PROC GLM 4.3.1 Model comparison technique and extra sum of squares 4.3.2 Types of sums of squares in SAS 4.4 Subset selection using PROC REG for Model selection 4.4.1 Subset selection using PROC REG 4.4.2 Other options available in PROC REG for model selection 4.5 Inclusion of squared Terms Product terms in Regression Models 4.5.1 Including interaction terms in the model 4.5.2 Comparing slopes of regression lines using interaction 4.5.3 Analysis of models with higher-order terms with PROC REG 5. Analysis of Variance Model 5.1 Introduction 5.1.1 Treatment Structure
5.1.2 Experimental Designs 5.1.3 Linear Models 5.2 One-way Classification 5.2.1 Using PROC ANOVA to analyze one-way Classifications 5.2.2 Making preplanned (or a priori) comparisons using PROC GLM 5.2.3 Testing orthogonal polynomials using contrasts 5.3 One-Way Analysis of Covariance 5.3.1 Using PROC GLM to perform one-way covariance analysis 5.3.2 One-way covariance analysis: Testing for equal slopes 5.4 A two Factorial in a Completely Randomized Design 5.4.1 Analysis of a two-way factorial using PROC GLM 5.4.2 Residual Analysis of Interaction 5.5 Two-Way Factorial: Analysis of Interaction 5.6 Two-Way Factorial: Unequal Sample sizes 5.7 Two way Classification: Randomized Complete Block Design 5.7.1 Using PROC GLM to analyze a RCBD 5.7.2 Using PROC GLM to test for non additivity 6. Analysis of Variance: Random and Mixed Effects Models 6.1 Introduction 6.2 One-way Random Effects Model 6.2.1 Using PROC GLM to analyze one-way Random Effects Models 6.2.2 Using PROC MIXED to analyze one-way Random Effects Models 6.3 Two way Crossed Random Effects Model 6.3.1 Using PROC GLM and PROC MIXED to analyze two way Crossed Random Effects Model 6.3.2 Randomized complete block design: Blocking when treatment factors are random 6.4 Two-Way Nested Random Effects Model 6.4.1 Using PROC GLM to analyze two-way nested random effects models 6.4.2 Using PROC MIXED to analyze two-way Nested Random Effects Models 6.5 Two-way Mixed Effects Models 6.5.1 Two-way Mixed Effects Models: Randomized Complete Blocks Design 6.5.2 Two-way Mixed Effects Models: Crossed Classification 6.5.3 Two-way Mixed Effects Models: Nested Classification 6.6 Models with Random and Nested Effects for More Complex Experiments 6.6.1 Models for nested factorials 6.6.2 Models for split-plot experiments 6.6.3 Analysis of split-plot experiments using PROC GLM 6.6.4 Analysis of split-plot experiments using PROC MIXED Logistic Regression
o o o o o o o o o o o o
Introduction to Analytics Introduction to SAS, GUI Types of Libraries, Creating Variable Attributes Name, Type, Format, Informat, Label Introduction to Data steps and Proc steps DATA Understanding Reading, Importing, Exporting and Copying Data Conditional Statements (Where, If, If then Else) Appending, Merging and Sorting Datasets Proc steps like " Proc Means, Proc Freq, Proc Sort Output Delivery System (ODS) SAS Functions and Options List Input, Delimiters, Reading missing Values, and non standard values Do loops Generating Data Execution Output Statements Nesting Do loops Do While and Do Until Statement Arrays Dimensions Array elements and Range Proc report Introduction to Data base, Relational Data base concepts Proc SQL, Data integrity Constraints, Creating table and Inserting Values Proc SQL codes to Retrieve & Summarize data Group, Sort & Filter Using Joins
o o o o
Indexes Macros Defining and calling a macro Macro Parameters and Variables Global and Local Variables
Analyst
Progressive Digital Media Group
October 2011 February 2013 (1 year 5 months) Hyderabad Research: To identify and monitor competitors and research market conditions by constantly tracking the changes in the industry. To analyze and incorporate customer preferences, demographics and buying habits, that influences the market dynamics. To undertake secondary research where data is collected from trade magazines, trade associations, statistical offices, company statements and trade departments, amongst other sources. Interpret findings and draw conclusions from preliminary research. To analyze the key market metrics through data sanity checks and verifies if it is in trend with the market. To initiate and complete the permission process for sourcing and attribution of information whenever necessary. To conduct cross-country comparison and also check if the data is in line. Authoring reports address client centric requirements. Analytics: Running descriptive statistics and statistical analysis including T-Test, ANOVA, Time Series Forecasting, Regression and Cluster Analysis by using SAS and SPSS Import and Export of Data, Variable creation, String and Date function, keep and Drop variables. Conditional execution (if then, Else if, Else, do loop, where conditions), Macro, Array, Sorting, Merging by using SAS and SPSS Comprehensive knowledge of Advanced Excel including conditional Formatting, Advance filtering, Cell Referencing, Text functions, What if Analysis, Vlookup, If statements, Data validation, Pivot Table, Solver, Charts, Macro, VBA, etc. Comprehensive knowledge of MS PowerPoint including building blocks of a presentation, themes and styles, tables, charts, graphs and smart art, animation Gained comprehensive knowledge of SQL including Basic structures SQL, basic commands, Grouping, Ordering, Union, Intersection, Different types of Joining (natural join, outer join) etc Sound understanding of strategic tools like Porters 5 forces, SWOT, PEST, BCG, GE, Value chain analysis, etc
Adeptness in Analytics using Traditional and Advanced Statistical applications in Banking, Insurance, Market Research, CPG/Retail, Hi-Technology. International Exposure: Interacting with Top Executive Management Worked with popular Marketing scientists around the Globe. Trained in R programming & Analytics in Minneapolis, MN, USA. Presenter/Co-Author in multiple International Conferences Traditional/Advanced Analytic Methods: Linear, Non Linear, Logistic, Probit, Multinomial Logit, Log Linear Regression, Pricing Analysis (with/out competitive context), Ranking Analysis/Preference scores, Text Perception, Image Perception/Advertising Research, Shapley Value/Game Theory, different Product Optimization Techniques which suits to real marketing decisions, Cannibalisation of Products, Random Forest, Decision Trees, CART, Key Driver Analysis, Derived Importance , Brand Association & Mapping techniques. Analytics services and Product Development Technical Expertise: R, hadoop, mapreduce, SSRS, SPSS, SAS/SQL, SawTooth, Excel, Quantum, Quanvert. Extensive knowledge in Programming, Data Analytics, Data Mining & Weighting using different programming tools Client Interaction: Interaction with Top Executive Management Global client interaction, Analysis findings, recommendations and preparing Executive summaries, explaining complex findings in layman terms Management & Training: Key driver in motivating team towards the success of Organisation People Management Basic and Advanced Analytic Methods Trainer
SAS/BASE
HISTORY OF SAS --------------------------------------------------------------------------------------------1 BASIC THEORY & RULES ETC...--------------------------------------------------------------------------1 SAS WINDOWING ENVIRONMENT-----------------------------------------------------------------------2 CREATION OF LIBRARIES-----------------------------------------------------------------------------------2 SAS PROGRAMMING (BRIEFLY-----------------------------------------------------------------------------2 - DATASTEP - PROC STEP WAYS TO READ DATA (BRIEFLY) ----------------------------------------------------------------------2 BACK END PROCESS------------------------------------------------------------------------------------------2 INSTALLATION PROCESS ----------------------------------------------------------------------------------3
DATASTEP
INFORMATS & FORMATS------------------------------------------------------------------------------------4 DATA & OPTIONS---------------------------------------------------------------------------------------------4 INFILE & OPTIONS--------------------------------------------------------------------------------------------5 INPUT & OPTIONS--------------------------------------------------------------------------------------------6, 7 DATALINES & OPTIONS-------------------------------------------------------------------------------------8 DATES-------------------------------------------------------------------------------------------------------------8 SUM STATEMENT----------------------------------------------------------------------------------------------9 ATTRIB STATEMENT & INDIVIDUAL ATTRIBUTES--------------------------------------------------9 RETAIN STATEMENT----------------------------------------------------------------------------------------9 IF, IF/THEN/ELSE STATEMENT--------------------------------------------------------------------------10 WHERE STATEMENT----------------------------------------------------------------------------------------10 DO, DO ITERATIVE, DO UNTIL & DO WHILE -------------------------------------------------------11 CONCATENATION & INTEREAVING---------------------------------------------------------------------12 MERGE & UPDATE--------------------------------------------------------------------------------------------12
FUNCTIONS
- DATE FUNCTIONS-----------------------------------------------------------------------------------------13, 14 - CHARACTER FUNCTIONS -------------------------------------------------------------------------------15, 16 - NUMERIC FUNCTIONS------------------------------------------------------------------------------------17, 18 ARRAYS---------------------------------------------------------------------------------------------------------19, 20 OTHERS MISSING CONCEPTS----------------------------------------------------------------------------21 REVIEW OF DATASTEP------------------------------------------------------------------------------------22 EXAM & DISCUSSION---------------------------------------------------------------------------------------23
PROC STEP UTILITY PROCEDURES
- PROC IMPORT-----------------------------------------------------------------------------------------------24
- PROC EXPORT-----------------------------------------------------------------------------------------------24 - PROC CONTENTS-------------------------------------------------------------------------------------------24 - PROC APPEND-----------------------------------------------------------------------------------------------25 - PROC SORT--------------------------------------------------------------------------------------------------25 - PROC COPY--------------------------------------------------------------------------------------------------26 - PROC DATASETS-------------------------------------------------------------------------------------------26 - PROC COMPARE---------------------------------------------------------------------------------------------26 - PROC TRANSPOSE------------------------------------------------------------------------------------------27 - PROC FORMAT-----------------------------------------------------------------------------------------------27 - PROC SETINIT-----------------------------------------------------------------------------------------------27 - PROC PWENCODE -----------------------------------------------------------------------------------------28 - PROC OPTIONS --------------------------------------------------------------------------------------------28 - PROC RANK -------------------------------------------------------------------------------------------------28 - PROC CPORT------------------------------------------------------------------------------------------------29 - PROC CIMPORT---------------------------------------------------------------------------------------------29 - PROC ----------------------------------------------------------------------------------------REPORTING PROCEDURES
- BASICS OF STATISTICS---------------------------------------------------------------------------------35
STATISTICAL PROCEDURES
- PROC MEANS -----------------------------------------------------------------------------------------------36 - PROC SUMMARY--------------------------------------------------------------------------------------------36 - PROC UNIVARIATE ---------------------------------------------------------------------------------------37 - PROC FREQ -------------------------------------------------------------------------------------------------37 - PROC CORR -------------------------------------------------------------------------------------------------38 - PROC REG----------------------------------------------------------------------------------------------------38 - PROC ANOVA-----------------------------------------------------------------------------------------------38 - T-TEST -------------------------------------------------------------------------------------------------------39 - Chi SQUARE -------------------------------------------------------------------------------------------------39 - HYPOTHESIS------------------------------------------------------------------------------------------------40 - OTHER STATISTICAL PROCEDURES ----------------------------------------------------------------41 REVIEW OF PROC STEP-------------------------------------------------------------------------------------42 EXAM & DISCUSSION ---------------------------------------------------------------------------------------43
SAS/ACCESS
BASIC THEORY OF SAS/ACCESS ------------------------------------------------------------------------44 WAYS TO ACCESS DATA FROM DIFFERENT SOURCES LIKE NOTEPADS, EXCEL SHEETS, MS-ACCESS, ORACLE, DB2, SYBASE AND TERRADATA etc - PROC IMPORT --------------------------------------------------------------------------------------------- () - INFILE STATEMENT-------------------------------------------------------------------------------------- () - SQL PASS THRU QUERY---------------------------------------------------------------------------------45 - LIBNAME STATEMEN -------------------------------------------------------------------------------------45 - PROC DBLOAD----------------------------------------------------------------------------------------------46 - PROC ACCESS-----------------------------------------------------------------------------------------------46 ---------------------------------------------------------------------------------------47, 48, 49, 50, 51, REVIEW, EXAM & DISCUSSION --------------------------------------------------------------------------52 SAS/MACROS---------------------------------------------------------------------------------------------53, 54, 55, 56, 57 REVIEW, EXAM & DISCUSSION---------------------------------------------------------------------------58
SQL (PROC SQL)
-----------------------------------------------------------------------------1MONTH
PRACTICE CODING ON REAL TIME DATA INTERVIEW QUESTIONS RESUME PREPARATION WRITTEN TESTS MOCK INTERVIEWS CONSULTIN
o o o o
Need for analytics Analytics use in different industries Challenges in adoption of analytics Overview of Course Contents Data understanding
o o o o o o o o o o o o o o o o
Data types (Nominal, Ordinal, Interval and Ratio) Descriptive statistics Tabular & Graphical Method Summary statistics Introduction to some statistical terminologies and inferences Population, Sample and Random variables Point and Interval Estimations Probability Discrete/Continuous Probability Distributions Hypothesis Testing Importance of formulating and validating the hypothesis Formulation of hypothesis (Null and alternate) Testing association and differences Statistical significance and test statistic Level of significance Z-Test, T-Test, Chi-Square test, ANOVA Parametric & Non-Parametric test Correlation & Regression Linear Regression Case Study on Multiple Regression Logistic Regression Case Study on Logistic Regression Cluster Analysis Case Study on Cluster Analysis Factor Analysis Case Study on Factor Analysis
A. B. C. D. E. F. G. H. I.
Name
Age
D.
E. F. G. Id --182 Char1 ----Char2 ----Q
H.
I. Id Char1 Char2
J. K.
--182
----M
----Q
L.
M. N. O. Id --720 Char1 ----P Char2 ----S
G.
H. I. J. K. Var1 -----RED CYAN Var2 -----ORANGE WHITE Var3 -----BLUE GRAY Var4 -----INDIGO BROWN
L.
M. N. O. P. Var1 -----RED BLUE Var2 -----ORANGE INDIGO Var3 -----YELLOW PURPLE Var4 -----GREEN VIOLET
Q.
R. S. Var1 -----Var2 -----Var3 -----Var4 ------
T. U. V. W.
How many observations will be in data set WORK.ONE? Enter your numeric answer. Do not add leading or trailing spaces to your answer.
Upon completion of execution, what are the values of the variables X and Index in the SAS data set named WORK.LOOP?
A. B. C. D.
X = 3, Index = 5 X = 5, Index = 5 X = 5, Index = 6 X = 5, Index = 7 correct_answer = "D" Item 6 The following SAS program is submitted:
51 - 100 run;
Which one of the following PRINT procedure steps correctly applies the format?
A. B. C. D. proc print data = SASUSER.CLASS; var test; format test score; run;
E.
F. G. H. I. proc print data = SASUSER.CLASS; var test; format test score.; run;
J.
K. L. M. proc print data = SASUSER.CLASS format = score; var test; run;
N.
O. P. Q. proc print data = SASUSER.CLASS format = score.; var test; run;
R. correct_answer = "B" Item 7 This item will ask you to provide a line of missing code; The SAS data set WORK.INPUT contains 10 observations, and includes the numeric variable Cost. The following SAS program is submitted to accumulate the total value of Cost for the 10 observations:
correct_answer = "B" Item 8 This question will ask you to provide a line of missing code. Given the following data set WORK.SALES:
SalesID ------W6790 W7693 W1387 SalesJan -------50 25 . FebSales -------400 100 300 MarchAmt -------350 125 250
A. B. C. D.
Qtr1 = sum(of month{_ALL_}); Qtr1 = month{1} + month{2} + month{3}; Qtr1 = sum(of month{*}); Qtr1 = sum(of month{3}); correct_answer = "C" Item 9 Given the following SAS error log
44 45 46 47
What change to the program will correct the error? A. B. C. D. Replace the WHERE statement with an IF statement Change the ** in the BMI formula to a single * Change bmi to BMI in the WHERE statement Add a (Keep=BMI) option to the SET statement correct_answer = "A" Item 10 The following SAS program is submitted:
data WORK.TEMP; Char1='0123456789'; Char2=substr(Char1,3,4); run;
A.
B.
C.
D. correct_answer = "D" Item 2 Given the following SAS data sets ONE and TWO:
The following SAS program is submitted creating the output table THREE:
data three; merge one (in = in1) two (in = in2); by num; run;
Which one of the following SQL programs creates an equivalent SAS data set THREE?
A. B. C. D. proc sql; create table three as select * from one full join two
E. F. quit;
G.
H. I. J. K. L. M. N. O. P. Q. R. S. T. U. V. W. X. Y. Z. AA. quit; quit; proc sql; create table three as select coalesce(one.num, two.num) as NUM, char1, char2 from one full join two on one.num = two.num; quit; proc sql; create table three as select one.num, char1, char2 from one full join two on one.num = two.num; proc sql; create table three as select coalesce(one.num, two.num) as NUM, char1, char2 from one full join two where one.num = two.num;
The report that is produced displays observations whose value of STYLE are all equal to RANCH.
Which one of the following functions on the WHERE clause resolves the current value of the macro variable TYPE? A. B. C. D. GET SYMGET %SYMGET &RETRIEVE correct_answer = "B" Item 4 The SAS data set SASDATA.SALES has a simple index on the variable DATE and a variable named REVENUE with no index. In which one of the following SAS programs is the DATE index considered for use?
A. B. C. D. E. F. G. H. I. J. K. L. M. N. proc print data = sasdata.sales; by date; run; proc print data = sasdata.sales; where month(date) = 3; run; data march; set sasdata.sales; if '01mar2002'd < date < '31mar2002'd; run; data march; set sasdata.sales; where date < '31mar2002'd or revenue > 50000; run;
correct_answer = "A"
A. B. C. D.
Findings Interventions Events Special Purpose correct_answer = "A" Item 3 Which option in the PROC EXPORT procedure overwrites an existing file?
A. B. C. D.
NEW OVERWRITE REPLACE KEEP correct_answer = "C" Item 4 Given the following data set WORK.DEMO:
PTID 689574 423698 758964 653347 493847 500029 513842 515151 522396 534787 875642 879653 542369 698754 656423 785412 785698 763284 968743 457826 M F F F M M F F M M F F F F M M M M M M Sex Age 15 14 12 14 14 12 12 15 13 12 11 15 12 11 16 12 16 11 14 18 Height 80.0 65.5 60.3 62.8 63.5 57.3 59.8 62.5 62.5 59.0 51.3 75.3 56.3 50.5 72.0 67.8 72.0 57.5 60.5 74.0 Weight 115.5 90.0 87.0 98.5 102.5 83.0 84.5 112.5 84.0 99.5 50.5 105.0 77.0 70.0 150.0 121.0 110.0 85.0 85.0 165.0
A. B. C. D.
data test; set temp (keep=SUBJID ITT SAF OTH ); by subjid; length STDYPOP $200; array pop{*} SAF ITT OTH ; do i=1 to 3; if STDYPOP="" and pop{i}=1 then STDYPOP=put(i, stdypfmt.); else if STDYPOP^="" and pop{i}=1 then STDYPOP = trim(STDYPOP)||"/"||put(i, stdypfmt.); end; run;
correct_answer = "Safety/Other"
Item 6 This question will ask you to provide a line of missing code. Given the data set WORK.STUDYDATA with the following variable list:
# 2 3 Variable DAY DIABP Type Char Num Len 8 8 Label Study Day Diastolic Blood Pressure
TRT
Char
Treatment
A. B. C. D.
the mean diastolic blood pressure values for every day by treatment group the overall mean diastolic blood pressure for each treatment group Which statement correctly completes the program to meet these requirements? where trt or trt*day; types trt trt*day; by trt day; id trt day; correct_answer = "B" Item 7 The following SAS program is submitted:
%let member1=Demog; %let member2=Adverse; %let Root=member; %let Suffix=2; %put &&&Root&Suffix;
What is written to the SAS log? A. B. C. D. &member2 Adverse &&&Root&Suffix WARNING: Apparent symbolic reference ROOT2 not resolved. correct_answer = "B" Item 8 This question will ask you to provide a line of missing code. The following SAS program is submitted:
proc format ; value dayfmt 1='Sunday' 2='Monday' 3='Tuesday'
proc report data=diary ; column subject day var1 var2 ; <insert code here> run ;
In the DIARY data set, the format DAYFMT is assigned to the variable DAY. Which statement will cause variable DAY to be printed in its unformatted order? A. B. C. D. define day / order define day / order define day / order define day / order correct_answer = "D" 'Day' ; order=data 'Day' ; noprint 'Day' ; order=internal 'Day' ;
Item 9 You are using SAS software to create reports that will be output in a Rich Text Format so that it may be read by Microsoft Word. The report will span multiple pages and you want to display a '(Continued)' text at the end of each page when a table spans multiple pages. Which statement can you add to the SAS program to ensure the inclusion of the '(Continued)' text? A. B. C. D. ods rtf file='report.rtf'; ods tagsets.rtf file='report.rtf'; ods tagsets.rtf file='report.rtf' break='Continued'; ods file open='report.rtf' type=rtf break='(Continued)'; correct_answer = "B" Item 10 What is the primary purpose of programming validation? A. Ensure that the output from both the original program and the validation program match. B. Efficiently ensure any logic errors are discovered early in the programming process. C. Justify the means used to accomplish the outcome of a program and ensure its accurate representation of the original data. D. Document all specifications pertaining to programmed output and ensure all were reviewed during the programming process. correct_answer = "C"
Open the diagram labeled Practice A within the project labeled Practice A. Perform the following in SAS Enterprise Miner: 1. Set the Clustering method to Average. 2. Run the Cluster node. Use this project to answer the next two questions: What is the Importance statistic for MTGBal (Mortgage Balance)? 0.32959 0.42541 0.42667 1.000000 correct_answer = "C" You must change the clustering method to average and run the cluster node first. Select view results and look in the output window and view the Variable Importance results. What is the Cubic Clustering Criterion statistic for this clustering? 5.00 14.69 5862.76 67409.93 correct_answer = "B" Run the diagram flow and view the results. From the results window, select View -> Summary Statistics -> CCC Plot and mouse over where the data point and the line intersect. This will display the CCC statistic.
A. B. C. D.
A. B. C. D.
Item 2 1. Create a project named Insurance, with a diagram named Explore. 2. Create the data source, DEVELOP, in SAS Enterprise Miner. DEVELOP is in the directory c:\workshop\Practice. 3. Set the role of all variables to Input, with the exception of the Target variable, Ins (1= has insurance, 0= does not have insurance). 4. Set the measurement level for the Target variable, Ins, to Binary. 5. Ensure that Branch and Res are the only variables with the measurement level of Nominal. 6. All other variables should be set to Interval or Binary. 7. Make sure that the default sampling method is random and that the seed is 12345. Use this project to answer the next <b.seven< b="" style="color: rgb(51, 51, 51); font-family: Arial, Helvetica, Verdana, sans-serif; font-size: small; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: 16px; orphans: auto; text-align: left; text-indent: 0px; texttransform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-size-adjust: auto;
-webkit-text-stroke-width: 0px;">questions. (Note: only 2 of 7 questions are displayed for this example) The variable Branch has how many levels? 8 12 19 47 correct_answer = "C" This information can be obtained by viewing the PROC FREQ output. What is the mean credit card balance (CCBal) of the customers with a variable annuity? $0.00 $8,711.65 $9,586.55 $11,142.45 correct_answer = "D" You can use a Stat Explore Node and view the output for the Descriptive Statistics for CCBal by level of the target variable.
A. B. C. D.
A. B. C. D.
D. METALIB correct_answer = "D" Item 4 Which statement regarding pre-assigned libraries is true? A. B. C. D. Pre-assigned libraries reduce the initialization time for a workspace server. Pre-assigned libraries always connect to an RDBMS at server initialization. Pre-assigned libraries always connect to a base SAS library at server initialization. Pre-assigned libraries do not have to be identical across all SAS client applications. correct_answer = "C" Item 5 A platform administrator needs to retrieve from the metadata a complete LIBNAME statement including the user ID and password. To complete this task, the platform administrator must be connected to SAS Management Console with what type of user access in the metadata? A. B. C. D. Access to the credentials associated with libraries created with the METALIB procedure. Access to credentials established by the LIBNAME engine. Access to credentials associated with users in the outbound login. Access to credentials for the authentication domain associated with the database server. correct_answer = "D" Item 6 By default, which groups have WriteMetadata on the Foundation repository? A. B. C. D. PUBLIC SASUSERS ADMINISTRATORS ONLY SAS SYSTEM SERVICES ONLY correct_answer = "B" Item 7 Given the following authorization settings for Library Sales2:
A. B. C. D.
Library Sales2's parent folder has an explicit grant of RM for Mary. Library Sales2 has an explicit denial of RM for PUBLIC. Which statement is true? Mary can see Library Sales2. Mary can see data flagged as PUBLIC in Library Sales2. Mary cannot see Library Sales2. Mary can see Library Sales2, but not any data flagged as PUBLIC.
correct_answer = "C" Item 8 Which statement is FALSE regarding the WriteMemberMetadata (WMM) permission? A. B. C. D. By default, it mirrors the WriteMetadata permission. It only applies to folders. If WriteMetadata is granted, then you should not deny WMM. WMM is inherited from one folder to another folder. correct_answer = "D" Item 9 Content has been exported from a SAS 9.1.3 environment into a SAS 9.2 development environment. After the export, the platform administrator attempts to promote an object from the SAS 9.2 development environment into a production environment. What will be the result of this promotion attempt? A. The promotion will not be allowed because objects from SAS 9.1.3 cannot be promoted to SAS 9.2. B. The promotion will not be allowed because objects in a development environment cannot be promoted to a production environment. C. The promotion will be allowed assuming the object type is allowed for promotion. D. The promotion will not be allowed because objects exported from a previous environment cannot be promoted. correct_answer = "C"
D. SAS/ACCESS interface to ODBC correct_answer = "D" Item 3 Which statement is true regarding external files? A. B. C. D. External file objects are accessed with SAS INFILE and FILE statements. External files contain only one record per line. External files can be used as input but not as outputs in SAS Data Integration Studio jobs. SAS can only work with Blank, Comma, Semicolon and Tab as delimiters in external files. correct_answer = "A" Item 4 Within SAS Data Integration Studio's SQL Join transformation, the option to turn on debug is located in which Properties pane? A. B. C. D. Select Properties Create Properties SQL Join Properties Job Properties correct_answer = "C" Item 5 Which SAS Data Integration Studio reports, generated as external files, can be stored as document objects within metadata? A. B. C. D. only job reports only table reports both job reports and table reports No reports can be stored as document objects. correct_answer = "C" Item 6 You want to create a job to extract only the rows that contain information about female employees from a table that contains information about both male and female employees. The new table should have observations in ascending order of age. Refer to the job flow diagram in the exhibit. Where would you set the options to filter and sort the data?
A. Where tab and Group By tab B. Where tab and Order By tab C. Where tab and Parameters tab
D. Group By tab and Parameters tab correct_answer = "B" Item 7 Within SAS Data Integration Studio's Table Loader transformation, which load style choice does NOT exist? A. B. C. D. Delete where Append to Existing Replace Update/Insert correct_answer = "A" Item 8 In SAS Data Integration Studio, a business key can be defined in the properties of which transformation? A. B. C. D. Data Validation SQL Join Lookup SCD Type 2 Loader correct_answer = "D"
D. once an application server has been selected correct_answer = "B" Item 3 Refer to the exhibit.
A SAS.IdentityGroups filter has been created in SAS Information Map Studio. There is a data item called "Group" that contains different metadata groups. If the "Group" filter is applied to the map, how will it affect the data? A. B. C. D. All rows will be returned for any group that the user is a member of. Only rows that belong to the first group are returned. All rows will be returned for PUBLIC group only. All rows matching the group identity login are returned. correct_answer = "A" Item 4 A SAS data set is used as a data source for a SAS BI Dashboard data model. Which type of code do you write to query the data? A. B. C. D. DATA Step PROC SQL a SQL/JDBC query MDX correct_answer = "C" Item 5 Refer to the exhibit.
What causes this error message when executing a stored process? A. Stored process code cannot be a .TXT file. B. The stored process server is not running.
C. The file that contains the stored process code is not in the specified location. D. An administrator deleted the stored process from the metadata. correct_answer = "C" Item 6 In a stored process, when using a range prompt named DateRange, which macro variables would you use in your SAS code? A. B. C. D. DateRange_START and DateRange_FINISH DateRange_BEGIN and DateRange_END DateRange_MIN and DateRange_MAX DateRange0 and DateRange1 correct_answer = "C" Item 7 Upon initial install, all of the capabilities in the 'Web Report Studio: Report Creation' role are also included in which role? A. B. C. D. Web Report Studio: Report Viewing Web Report Studio: Advanced Web Report Studio: Content Management Web Report Studio: Administration correct_answer = "B" Item 8 A content developer would like to create a group of cascading prompts to use in multiple reports without recreating the prompts for each report. What features of the prompt framework must the developer use? A. B. C. D. Cannot create shared cascading prompts for use in multiple reports. Dynamic Prompts and Shared Prompts Cascading Prompts and Standard Groups Cascading Prompts, Standard Groups, and Shared Prompts correct_answer = "D" Item 9 A SAS Information Map with a SAS OLAP Cube as a data source can be built from which of the following? A. B. C. D. multiple SAS OLAP Cubes a SAS OLAP Cube and a stored process one table joined with one SAS OLAP Cube one SAS OLAP Cube only correct_answer = "D"
Item 10 Which statement is true regarding connection profiles used with the SAS platform applications? A. B. C. D. Each SAS platform application must have its own connection profile. Connection profiles are stored on the server machine. Connection profiles are stored on the machine where the SAS application is installed. All SAS platform applications share one connection profile. correct_answer = "C"
Which conclusion is supported by the output? A. B. C. D. Assets in portfolio A are significantly more risky than assets in portfolio B. Assets in portfolio B are significantly more risky than assets in portfolio A. The portfolios differ significantly with respect to risk. The portfolios do not differ significantly with respect to risk. correct_answer = "C" Item 2 An analyst has determined that there exists a significant effect due to region. The analyst needs to make pairwise comparisons of all eight regions and wants to control the experimentwise error rate. Which GLM procedure statement would provide the correct output?
A. B. C. D.
lsmeans Region / pdiff=all adjust=dunnett; lsmeans Region / pdiff=all adjust=tukey; lsmeans Region / pdiff=all adjust=lsd; lsmeans Region / pdiff=all adjust=none;
a dependent variable (y) one continuous predictor variables (x1) including a quadratic term (x12) one categorical predictor variable (c1 with 3 levels) one interaction term (c1 by x1) Which SAS program fits this model?
proc glm data=SASUSER.MLR; class c1; model y = c1 x1 x1sq c1byx1 run; proc reg data=SASUSER.MLR; model y = c1 x1 x1sq c1byx1 run; proc glm data=SASUSER.MLR; class c1; model y = c1 x1 x1*x1 c1*x1 run; proc reg data=SASUSER.MLR; model y = c1 x1 x1*x1 c1*x1; run; /solution; /solution; /solution;
A. B. C. D. E. F. G. H. I. J. K. L. M. N.
What is the most important predictor of the response variable? A. B. C. D. intercept overhead scrap training correct_answer = "B" Item 5 Which statement is an assumption of logistic regression? A. B. C. D. The sample size is greater than 100. The logit is a linear function of the predictors. The predictor variables are not correlated. The errors are normally distributed. correct_answer = "B" Item 6 When selecting variables or effects using SELECTION=BACKWARD in the LOGISTIC procedure, the business analyst's model selection terminated at Step 3. What happened between Step 1 and Step 2? A. B. C. D. DF increased. AIC increased. Pr > Chisq increased. - 2 Log L increased. correct_answer = "D" Item 7 The LOGISTIC procedure will be used to perform a regression analysis on a data set with a total of 10,000 records. A single input variable contains 30% missing records. How many total records will be used by PROC LOGISTIC for the regression analysis? Enter your numeric answer in the space below. Do not add leading or trailing spaces to your answer. Click the calculator button to display a calculator if needed. correct_answer = "7000" Item 8 An analyst is screening for irrelevant variables by estimating strength of association between each input and the target variable. The analyst is using Spearman correlation and Hoeffding's D statistics in the
CORR procedure. What would likely cause some inputs to have a large Hoeffding and a near zero Spearman statistic? A. B. C. D. nonmonotonic association between the variables linear association between the variables monotonic association between the variables no association between the variables correct_answer = "A" Item 9 An analyst builds a logistic regression model which is 75% accurate at predicting the event of interest on the training data set. The analyst presents this accuracy rate to upper management as a measure of model assessment. What is the problem with presenting this measure of accuracy for model assessment? A. B. C. D. This accuracy rate is redundant with the misclassification rate. It is pessimistically biased since it is calculated from the data set used to train the model. This accuracy rate is redundant with the average squared error. It is optimistically biased since it is calculated from the data used to train the model. correct_answer = "D" Item 10 Refer to the exhibit:
For the ROC curve shown, what is the meaning of the area under the curve? A. B. C. D. percent concordant plus percent tied percent concordant plus (.5 * percent tied) percent concordant plus (.5 * percent discordant) percent discordant plus percent tied correct_answer = "B"</b.seven<>