Vous êtes sur la page 1sur 8

Re: how to obtain performance data for individual transformations.

Re: suppose i have source 101 a 101 b 101 c 101 d i want target
like 101 abcd how will u achive this please give me the answer

Re: if 3 table having different columes. like first table having 4


columns , second table having 3 columns and third table having 2
columns then how to capture the data by using funnel stage in
parallel jobs...srinu.thadi
Re: i have source like balance,drawtime 20000, 8.30 50000,10.20 3000,4.00 i want
target like this balance,drawtime 20000, 20.30 50000,22.20 3000,16.00
Answer if (drawtime)< 13 then 4
# 1 drawtime=drawtime+12 else drawtime

8
Yes
Is This Answer Correct ?
1 No

suppose i have source


101 a
101 b
101 c
101 d i want target like 101 abcd how will u achive this
please give me the answer

Re: i have to extract data from a flat file.the flat file has 10 records.i have to
extract the 1st and 5th record every time.how is it done
Re: How to delete the data in the target table after loaded.

Re: what is data mapping


Answer Copying the data from input
# 1 links to Output links
Re: WHERE YOU USE UNIX COMMANDS AS A ETL DEVELOPER?
Answer you can call in the following
# 2 places in ETL(DataStage)
1-Sequential File Stage (Stage-
>Stage Uses filter commands).
2-Before and after Stage
Subroutines in Job Paramametrs
tab.
3-Before and after Stage
Subroutines in Transformer
stage.
4-Job Sequences using Execute
command Activity and Routine
activity.
5-Using DataStage Routines
6-Using Routines called ExecSH
and ExecSHSilent

Re: types of errors in datastage?


Answer source file not found
# 1 null value populated not
nullable column
died with signal 11( due to
server down)
fatal errors
nls-warning
meta data missmatch
datatype missmatch
parallel loading not enabled
field size different
data type size between source
and target diff
column missmatch
file opening error
mutex error
process time out...

Re: how to do pergformence tuning in datastage?


Answer 1. Staged the data coming 1 Prams
# 1 from ODBC/OCI/DB2UDB stages [WIPRO]
or any database on the server
using Hash/Sequential files
for optimum performance also
for data recovery in case job
aborts.
2. Tuned the OCI stage for
'Array Size' and 'Rows per
Transaction' numerical values
for faster inserts, updates
and selects.
3. Tuned the 'Project
Tunables' in Administrator
for
better performance.
4. Used sorted data for
Aggregator.
5. Sorted the data as much
as possible in DB and
reduced the use of DS-Sort
for better performance of
jobs
6. Removed the data not
used from the source as early
as possible in the job.
7. Worked with DB-admin to
create appropriate Indexes
on tables for better
performance of DS queries
8. Converted some of the
complex joins/business in DS
to Stored Procedures on DS
for faster execution of the
jobs.
9. If an input file has an
excessive number of rows
and can be split-up then use
standard logic to run jobs in
parallel.
10. Before writing a
routine or a transform, make
sure
that there is not the
functionality required in one
of the
standard routines supplied in
the sdk or ds utilities
categories.
Constraints are generally CPU
intensive and take a
significant amount of time to
process. This may be the case
if the constraint calls
routines or external macros
but if
it is inline code then the
overhead will be minimal.
11. Try to have the
constraints in the
'Selection'
criteria of the jobs itself.
This will eliminate the
unnecessary records even
getting in before joins are
made.
12. Tuning should occur on
a job-by-job basis.
13. Use the power of DBMS.
14. Try not to use a sort
stage when you can use an
ORDER BY clause in the
database.
15. Using a constraint to
filter a record set is much
slower than performing a
SELECT … WHERE….
16. Make every attempt to
use the bulk loader for your
particular database. Bulk
loaders are generally faster
than
using ODBC or OLE.

14
Yes
Is This Answer Correct ?
1 No
Re: how to do pergformence tuning in datastage?
Answer 1.Avoid using transformer 0 Raji
# 2 stage for renaming some
columns.Because it will slow
down the performance of the
jobs.. Try to use Copy stage
for renaming the columns.
2.Take only the required
columns during the table
level
lookups. Remove all the
unnessary columns
3.By using partioning
technique. It depends on our
requirement. It will increase
the performance as well

11
Yes
Is This Answer Correct ?
2 No

Re: how to do pergformence tuning in datastage?


Answer 1. By using hashfile stage we
# 3 can improve the performance.
In case of hashfile stage we
can define the read cache
size
& write cache size but the
default size is 128M.B.
2. By using active-to-active
link performance also we can
improve the performance.
Here we can improve the
performance by enabling the
row
buffer, the default row
buffer size is 128K.B.
3. By removing unwanted
columns.
4. By selecting appropriate
update actions.
5. In parallel by replacing
transformer with copy or
filter
stage we can improve the
performance.Because if you
are
using more than 5
transformers in a stage the
performance
will degrade,so to avoid
transformer you can use copy
or
filter.
6. In server by using
linkpartitioner,linkcollectoe
& IPC
stages also we can improve
the performance.

Re: How can we move a DATASTAGE JOB from Development to Testing


environment with the help of a datastage job using unix commands.

Answ
er dsjob -import dsxfile ....
#2

Re: How can u execute the sql query through unix? What is the Primary
key for Dimension table? what is the primary key for Fact table?

Answ connect to sqlplus from unix


er typing sqlplus
#3 user name:user_name@database
password:
then type
@pathname of the file containing
sql query\sql query file
name.sql
eg:@path\file.sql
press enter
Re: in sequtial file 2 columns avaliable,iwant only one column load the
target how do it.

Answ we can do it using copy or modify,


er using the copy is better
#6 bcoz it gives the performance

you can use unix command in


sequential file or in
transformer stage u can do this in
sequential firl u will
get filter option there u can use
(cut comand to cut
particular columns)

Re: Hi This is Vijay How Can u Read the data from sequential file Parall'y?

Answ Hi Vijay,
er
#1 You can do it by setting the 'No.
of readers per node'
property value to >1. When you do
this there will be n No.
of readers created in the node and
each reader will read a
block of data from the sequential
file.

If you have multiple input files


(pattern), you can set the
property 'Read from multiple
nodes' to 'yes'.

These 2 properties of sequential


files are mutually
exclusive.
Hi Amar

What u send is right but "read


from multi-nodes" is used for
fixed-width file only u can use
"no.of readers per node" can
be used for variable length files
Re: I WANT TO SEND SYSDATE AS PARAMETER,SO WHAT COMMAND I
SHOULD USE TO CALL SYSDATE?

Answ In DsMacro the function


er "DsJobstartDate " is there, by
#2 using this we can recall sysdate
Re: col1 123 abc 234 def jkl 768 opq 567 789 but i want two targetss
target1 contains only numeric values and target2 contains only alphabet
values like trg1 123 234 768 567 789 trg2 abc def jkl opq

Answ source->transormer->fileter1-
er >transformer1
#5 ->filter2
->tranformer2 joinstage ->target
in source:-
take i/p
transformer :
append new collumn and apply char
function is alpha
filter1:
separate 0
filter2:
separate 1
transformer1
add another column @outrownum
transformer 2
add another column @outrownum
we join on @rownum column we will
get
do perfect we get answer

Vous aimerez peut-être aussi