Vous êtes sur la page 1sur 8

1. What are the Environmental variables in Datastage? 2. Check for Job Errors in datastage 3.

What are Stage Variables, Derivations and Constants? 4. What is Pipeline Parallelism? 5. Debug stages in PX 6. How do you remove duplicates in dataset 7. What is the difference between Job Control and Job Sequence 8. What is the max size of Data set stage? 9. performance in sort stage 10. How to develop the SCD using LOOKUP stage? 12. What are the errors you expereiced with data stage 13. what are the main diff between server job and parallel job in datastage 14. Why you need Modify Stage? 15. What is the difference between Squential Stage & Dataset Stage. When do u us e them. 16. memory allocation while using lookup stage 17. What is Phantom error in the datastage. How to overcome this error. 18. Parameter file usage in Datastage 19. Explain the best approch to do a SCD type2 mapping in parallel job? 20. how can we improve the performance of the job while handling huge amount of data 21. HI How can we create read only jobs in Datastage. 22. how to implement routines in data stage,have any one has any material for da ta stage 23. How will you determine the sequence of jobs to load into data warehouse? 24. How can we Test jobs in Datastage?? 25. DataStage - delete header and footer on the source sequential 26. How can we implement Slowly Changing Dimensions in DataStage?. 27. Differentiate Database data and Data warehouse data? 28. How to run a Shell Script within the scope of a Data stage job? 29. what is the difference between datastage and informatica

30. Explain about job control language such as (DS_JOBS) 32. What is Invocation ID? 33. How to connect two stages which do not have any common columns between them? 34. In SAP/R3, How do you declare and pass parameters in parallel job . 35. Difference between Hashfile and Sequential File? 36. How do you fix the error "OCI has fetched truncated data" in DataStage 37. A batch is running and it is scheduled to run in 5 minutes. But after 10 day s the time changes to 10 minutes. What type of error is this and how to fix it? 38. Which partition we have to use for Aggregate Stage in parallel jobs ? 39. What is the baseline to implement parition or parallel execution method in d atastage job.e.g. more than 2 millions records only advised ? 40. how do we create index in data satge? 41. What is the flow of loading data into fact & dimensional tables? 42. What is a sequential file that has single input link?? 43. Aggregators What does the warning Hash table has grown to xyz . mean?

44. what is hashing algorithm? 45. How do you load partial data after job failed source has 10000 records, Job failed after 5000 records are loaded. This status of the job is abort , Instead of removing 5000 records from target , How can i r esume the load 46. What is Orchestrate options in generic stage, what are the option names. val ue ? Name of an Orchestrate operator to call. what are the orchestrate operators available in datastage for AIX environment. 47. Type 30D hash file is GENERIC or SPECIFIC? 48. Is Hashed file an Active or Passive Stage? When will be it useful? 49. How do you extract job parameters from a file? 50. 1.What about System variables? 2.How can we create Containers? 3.How can we improve the performance of DataStage? 4.what are the Job parameters? 5.what is the difference between routine and transform and function? 6.What are all the third party tools used in DataStage? 7.How can we implement Lookup in DataStage Server jobs? 8.How can we implement Slowly Changing Dimensions in DataStage?. 9.How can we join one Oracle source and Sequential file?. 10.What is iconv and oconv functions?[/quote] Read more: http://www.placementpapers.us/datastage/300-datastage_interview_quest ions_part_i.html#ixzz1W2i7WzPo

Under Creative Commons License: Attribution ******************************************************************************** ************************** 1 1. Difference between Hashfile and Sequential File?. What is modulus? 2 2. What is iconv and oconv functions?. 3 3. How can we join one Oracle source and Sequential file?. 4 4. How can we implement Slowly Changing Dimensions in DataStage?. 5 5. How can we implement Lookup in DataStage Server jobs?. 6 6. What are all the third party tools used in DataStage?. 7 7. what is the difference between routine and transform and function?. 8 8. what are the Job parameters?. 9 9. Plug-in?. 10 10.How can we improve the performance of DataStage jobs?. 11 11.How can we create Containers?. 12 12.What about System variables?. 13 13 What is the use of Usage analysis ? 14 14 Different ways a project can be moved to production ?...example .export-imp ort and Version control. 15 15 What database is the Datastage repository use ? Answer: Universe Database 16 16 How is the scheduling done in the project ? 17 17 Which version of DataStage is used in the project? 18 18 What are the performance tuning required while dealing with large data ? 19 19 What do the reject option in transformer do? 20 What is the architecture of datastage? 21 How do you define and use the job parameters? 22 What is stage variables,system variables,environment varaibles? 23 How to use routines in datastage? 24 What is difference between shared-container and local-container? 25 How do you connect to Oracle? 26 Please explain any ETL process that you have developed? 27 What is hash file? Types of hash files. 28 If you are doing any changes in shared-container will it reflect in all the j obs wherever you used this shared- container? 29 Have u written any custom routines in your project? If so explain? 30 How do you get log info into a file? 31 What is before job subroutine/After job subroutine? When do you use them? 32 How do you backup and restore the project? 33 What is Clear Status File and when do you use it? 34 What is Cleanup Resources and when do you use it? 35 Can I join a flat file and oracle and load into oracle? Is this possible? 36 While loading some data into target suddenly thier is a problem loading proce ss stopped how can u start loading from the records that were left? 37 What are the general problems that u face in Datastage? 38 What are the various reports that could be generated using this Datastage? 39 How to remove blank spaces from data 40 What is Active and Passive stage? 41 What all are the stages you have used in your project? 42 Could DataStage generate test cases? 43 What is difference between hash file and sequential file 44 What is the difference between Transform and routine 45 What is sequencer? 46 How to take backup of project? These are some of DataStage PX Jobs questions which can be asked in interviews. 1) Types of parallel processing 2) What is the SMP(Symmetric MultiProcessing) and MPP (Masiively parallel proces sing)?

3) What is CPU limited , Memory limited and Disk I/O limited jobs ? 4) Can one combine pipeline & partition parallelism? 5) Advantages of PX over server job 6) Is it possible to create user-defined stage in PX? 7) Can I use hash file in PX? 8) What is surrogate key stage? 1) What is the use of APT_DUMP_SCORE ? Ans: To get messages in logs such as no o f processes, no of nodes used. 2) What are the fours types of joins possible in Joiner stage? Ans: Inner, Left Outer, Right Outer, Full outer 3) What are the components of APT_CONFIG_FILE Ans: Nodes, Fastnode, Pools, Resou rce What are the points that needs to be considred while creating the config file? Ans: Available nodes, CPU time, available memory, what other process to be execu ted on same nodes, are there any configurations restrictions ? Eg DB only runs o n certain nodes and ETL cannot run on them, Get breakdown of the resource usage? Is the hardware config SMP, Cluster or MPP? 5) When are wrappers created? Ans:only for executable commands for unix, dos 6) When are buildups created? Ans: More functionality, complex logic needed. 7) When are custom stage created ? .. Ans :new operators need which are not in E E 8) What are different job sequencer stages ? 9) What is iconv and oconv functions?. 10) can we implement Slowly Changing Dimensions in DataStage?. what are the Job parameters?. 9. Plug-in?. 10.How can we improve the performance of DataStage jobs?. 11.How can we create Containers?. 12.What about System variables?. 13 What is the use of Usage analysis ? 14 Different ways a project can be moved to production ?...example .export-import and Version control. 15 What database is the Datastage repository use ? Answer: Universe Database 16 How is the scheduling done in the project ? 17 Which version of DataStage is used in the project? 18 What are the performance tuning required while dealing with large data ? 19 What do the reject option in transformer do? 20) How is parallelism executed? 21) What is RCP? 22) What is orchestrate 23) Difference between join, merge, and lookup stage? 24) What is dataset ? 25) Diffrenece between dataset, fileset and lookup file set? Questions on Data Warehousing concept 1. What is Data Warehouse? 2. What is difference between Data Warehouse and Data Mart ? 3. What is Star schema? 4. What is Snow-flake schema? 5. What is fact and dimension? 6. What is surrogate key? 7. What Normlisation ?Explain 3rd Normlised form? 8. What is the difference between OLTP and OLAP? 9. Are you involved in data modeling ?If yes which tool/tech you are using? 10. Which schema modeling techniques you ever used? 11. What do you mean by summary table? 12. What Degenerated Dimensions 13. What is fact less fact?

Oracle question based on data warehouse? 1 2 3 4 5 6 What is parallel execution What is Bitmap and B-Tree indexes ? Explain Local Vs Global variables What is materialised view What is page size/array size in oracle? What are integrity constraints ? How can one tune SQL s in Oracle?

******************************************************************************** ******************************************************************************** **** 301. How do we do the automation of dsjobs? 302. what is trouble shhoting in server jobs ? what are the diff kinds of errors encountered while running any job? 303. what is Data stage Multi-byte, Single-byte file conversions?how we use that conversions in data stage? 304. What are other Performance tunings you have done in your last project to in crease the performance of slowly running jobs? Staged the data coming from ODBC/OCI/DB2UDB stages or any database on the server using Hash/Sequential files for optimum performance also for data recovery in c ase job aborts.Tuned the OCI stage for ' 305. what is DataStage Multi-byte, Single-byte file conversions in Mainframe job s? what is UTF 8 ? whats use of UTF 8 ? 306. What Happens if RCP is disable ? 307. What are Routines and where/how are they written and have you written any r outines before? Routines are stored in the Routines branch of the DataStage Repository, where yo u can create, view or edit. The following are different types of routines: 1) Tr ansform functions 308. What is version Control? 309. Hi, What are the Repository Tables in DataStage and What are they? 310. I want to process 3 files in sequentially one by one , how can i do that. w hile processing the files it should fetch files automatically . 311. where does unix script of datastage executes weather in clinet machine or i n server.suppose if it eexcutes on server then it will execute ? 312. please list out the versions of datastage Parallel , server editions and in which year they are realised. 313. what are the Job parameters? 314. defaults nodes for datastage parallel Edition

315. Orchestrate Vs Datastage Parallel Extender? 316. Dimensional modelling is again sub divided into 2 types. a)Star Schema - Simple & Much Faster. Denormalized form. b)Snowflake Schema - Co mplex with more Granularity. More normalized form. 317. Tell me the environment in your last projects Give the OS of the Server and the OS of the Client of your recent most project 318. How can we join one Oracle source and Sequential file?. 319. What is Modulus and Splitting in Dynamic Hashed File? In a Hashed File, the size of the file keeps changing randomly. If the size of t he file increases it is called as "Modulus". If the size of the file decreases i t is called as "Splitting 320. Scenario based Question ........... Suppose that 4 job control by the seque ncer like (job 1, job 2, job 3, job 4 )if job 1 have 10,000 row ,after run the j ob only 5000 data has been loaded in target table remaining are not loaded and y our job going to be aborted then.. How can short out the problem. Suppose job sequencer synchronies or control 4 job but job 1 have problem, in th is condition should go director and check it what type of problem showing either data type problem, warning massage, job Asked by: Mukesh Kumar Madhav 321. What is the Batch Program and how can generate ? Batch programe is the programe it's generate run time to maintain by the datasta ge it self but u can easy to change own the basis of your requirement (Extractio n, Transformation,Loading) .Batch progr 322. How many places u can call Routines? Four Places u can call (i) Transform of routine (A) Date Transformation (B) Upst ring Transformation (ii) Transform of the Before & After Subroutines(iii) XML tr ansformation(iv)Web base t Asked by: Mukesh Kumar Madhav 323. How many jobs have you created in your last project? 100+ jobs for every 6 months if you are in Development, if you are in testing 40 jobs for every 6 months although it need not be the same number for everybody 324. what's the difference between Datastage Developers and Datastage Designers. What are the skill's required for this. 325. Could you please help me with a set of questions on Parallel Extender? 326. what is difference between data stage and informatica 327. Suppose if there are million records did you use OCI? if not then what stag e do you prefer? 328. What are types of Hashed File? 329. How do you eliminate duplicate rows? 330. What is DS Designer used for - did u use it? 331. Compare and Contrast ODBC and Plug-In stages? ODBC : a) Poor Performance. b) Can be used for Variety of Databases. c) Can hand le Stored Procedures. Plug-In: a) Good Performance. b) Database specific.(Only o ne database) c) Cannot handle Stored Pr 332. What is project life cycle and how do you implement it?

333. Explain your last project and your role in it.?

334. What are the often used Stages or stages you worked with in your last proje ct? A) Transformer, ORAOCI8/9, ODBC, Link-Partitioner, Link-Collector, Hash, ODBC, A ggregator, Sort. 335. Have you ever involved in updating the DS versions like DS 5.X, if so tell us some the steps you have taken in doing so? Yes. The following are some of the steps; I have taken in doing so:1) Definitely take a back up of the whole project(s) by exporting the project as a .dsx file2 ) See that you are using the same parent 336. What versions of DS you worked with? DS 7.0.2/6.0/5.2 337. If worked with DS6.0 and latest versions what are Link-Partitioner and Link -Collector used for? Link Partitioner - Used for partitioning the data.Link Collector - Used for coll ecting the partitioned data. 338. How did you handle an 'Aborted' sequencer? In almost all cases we have to delete the data inserted by this from DB manually and fix the job and then run the job again. 339. How did u connect with DB2 in your last project? Most of the times the data was sent to us in the form of flat files. The data is dumped and sent to us. In some cases were we need to connect to DB2 for look-up s as an instance then we used ODBC drive 340. Read the String functions in DS Functions like [] -> sub-string function and ':' 341. How would call an external Java function which are not supported by DataSta ge? Starting from DS 6.0 we have the ability to call external Java functions using a Java package from Ascential. In this case we can even use the command line to i nvoke the Java function and write the re 342. The above might rise another question: Why do we have to load the dimension al tables first, then fact tables: As we load the dimensional tables the keys (primary) are generated and these key s (primary) are Foreign keys in Fact tables. 343. Tell me one situation from your last project, where you had faced problem a nd How did u solve it? A. The jobs in which data is read directly from OCI stages are running extremely slow. I had to stage the data before sending to the transformer to make the job s run faster.B. The job aborts 344. Does the selection of 'Clear the table and Insert rows' in the ODBC stage s end a Truncate statement to the DB or does it do some kind of Delete logic. There is no TRUNCATE on ODBC stages. It is Clear table blah blah and that is a d elete from statement. On an OCI stage such as Oracle, you do have both Clear and Truncate options. They are radically di

345. How do you rename all of the jobs to support your new File-naming conventio ns? Create a Excel spreadsheet with new and old names. Export the whole project as a dsx. Write a Perl program, which can do a simple rename of the strings look ing up the Excel file. 346. When should we use ODS? DWH's are typically read only, batch updated on a scheduleODS's are maintained i n more real time, trickle fed constantly 347. how to create batches in Datastage from command prompt Read more: http://www.placementpapers.us/datastage/306-data_stage_interview_ques tions_part_7_a.html#ixzz1iHgYpMsi Under Creative Commons License: Attribution

Vous aimerez peut-être aussi