Académique Documents
Professionnel Documents
Culture Documents
Main Source
\\nicsrv10\TTS\E\ETL\Best Practices\DataStageTechDoc\DataStageTech.doc
Job Naming
Job Naming
Stage Names
Link Names
shortdesc_InTo_stagedesc shortdesc_OutTo_stagedesc
In_linkdesc
Out_linkdesc_action
Lkup_linkdesc
Example
Images copyright claimed by Ascential Software Corporation
Reusable Components
Annotations
Annotations are to be used to explain processing Description annotation shows purpose of job
Annotations
Description Annotation
Images copyright claimed by Ascential Software Corporation
Job Descriptions
Images copyright claimed by Ascential Software Corporation
Become text of description annotation Short description visible in Detail view (Manager)
Stage/Link Naming
the data they access (passive stages) the function they perform (active) for the data they carry
such as Sequential_File_0
Developing Jobs
1.
Keep it simple
jobs with many stages are hard to debug and maintain documentation
2.
4.
Iterative Design
Use Copy or Peek stage as stub Test job in phases small first, then increasing in complexity Use Peek stage to examine records
Example Phase 1
Images copyright claimed by Ascential Software Corporation
Example Phase 2
Images copyright claimed by Ascential Software Corporation
Example Phase 3
Images copyright claimed by Ascential Software Corporation
Transformer Stage
Transformer stage generates code Always include reject link Always test for null value before using a column in a function Be aware of column and stage variable data types
often developer does not pay attention to Stage Variable data type try to maintain the data type as imported
Job Parameters
things that change over time (for example passwords, filter conditions) things that different in different environments (for example DSNs, pathnames, passwords)
Job Parameters
name prompt text (mandatory) type default value (design time) help text
not delimited
APT_DUMP_SCORE
APT_CONFIG_FILE
Two DataSets
Configuration Files
Make a set for 1X, 2X,. Use different ones for test versus production Include as a parameter in each job Automatic scaling