Académique Documents
Professionnel Documents
Culture Documents
Center of Excellence
Notices
This information was developed for products and services offered in the U.S.A.
IBM may not offer the products, services, or features discussed in this document in other countries. Consult your local
IBM representative for information on the products and services currently available in your area. Any reference to an
IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service
may be used. Any functionally equivalent product, program, or service that does not infringe any IBM intellectual
property right may be used instead. However, it is the user’s responsibility to evaluate and verify the operation of any
non-IBM product, program, or service.
IBM may have patents or pending patent applications covering subject matter described in this document. The
furnishing of this document does not grant you any license to these patents. You can send license inquiries, in writing,
to:
For license inquiries regarding double-byte (DBCS) information, contact the IBM Intellectual Property Department in
your country or send inquiries, in writing, to:
The following paragraph does not apply to the United Kingdom or any other country where such provisions are
inconsistent with local law: INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS
PUBLICATION ″AS IS″ WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED,
INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT,
MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of
express or implied warranties in certain transactions, therefore, this statement may not apply to you.
This information could include technical inaccuracies or typographical errors. Changes are periodically made to the
information herein; these changes will be incorporated in new editions of the publication. IBM may make
improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without
notice.
Any references in this information to non-IBM Web sites are provided for convenience only and do not in any manner
serve as an endorsement of those Web sites. The materials at those Web sites are not part of the materials for this IBM
product and use of those Web sites is at your own risk.
IBM may use or distribute any of the information you supply in any way it believes appropriate without incurring any
obligation to you.
Licensees of this program who wish to have information about it for the purpose of enabling: (i) the exchange of
information between independently created programs and other programs (including this one) and (ii) the mutual use
of the information which has been exchanged, should contact:
IBM Corporation
J46A/G4
555 Bailey Avenue San Jose, CA 95141-1003 U.S.A.
Such information may be available, subject to appropriate terms and conditions, including in some cases, payment of
a fee.
The licensed program described in this document and all licensed material available for it are provided by IBM under
terms of the IBM Customer Agreement, IBM International Program License Agreement or any equivalent agreement
between us.
Any performance data contained herein was determined in a controlled environment. Therefore, the results obtained
in other operating environments may vary significantly. Some measurements may have been made on development-
level systems and there is no guarantee that these measurements will be the same on generally available systems.
Furthermore, some measurements may have been estimated through extrapolation. Actual results may vary. Users of
this document should verify the applicable data for their specific environment.
Information concerning non-IBM products was obtained from the suppliers of those products, their published
announcements or other publicly available sources. IBM has not tested those products and cannot confirm the
accuracy of performance, compatibility or any other claims related to non-IBM products. Questions on the capabilities
of non-IBM products should be addressed to the suppliers of those products.
All statements regarding IBM’s future direction or intent are subject to change or withdrawal without notice, and
represent goals and objectives only.
This information is for planning purposes only. The information herein is subject to change before the products
described become available.
This information contains examples of data and reports used in daily business operations. To illustrate them as
completely as possible, the examples include the names of individuals, companies, brands, and products. All of these
names are fictitious and any similarity to the names and addresses used by an actual business enterprise is entirely
coincidental.
COPYRIGHT LICENSE:
This information contains sample application programs in source language, which illustrate programming techniques
on various operating platforms. You may copy, modify, and distribute these sample programs in any form without
payment to IBM, for the purposes of developing, using, marketing or distributing application programs conforming to
the application programming interface for the operating platform for which the sample programs are written. These
examples have not been thoroughly tested under all conditions. IBM, therefore, cannot guarantee or imply reliability,
serviceability, or function of these programs.
Each copy or any portion of these sample programs or any derivative work, must include a copyright notice as
follows: (C) (your company name) (year). Portions of this code are derived from IBM Corp. Sample Programs. (C)
Copyright IBM Corp. _enter the year or years_. All rights reserved. If you are viewing this information softcopy, the
photographs and color illustrations may not appear.
Trademarks
IBM trademarks and certain non-IBM trademarks are marked at their first occurrence in this document.
Adobe, Acrobat, Portable Document Format (PDF), and PostScript are either registered trademarks or trademarks of
Adobe Systems Incorporated in the United States, other countries, or both.
Cell Broadband Engine and Cell/B.E. are trademarks of Sony Computer Entertainment, Inc. in the United States,
other countries, or both and is used under license therefrom.
Intel®, Intel logo, Intel Inside® logo, Intel Centrino™, Intel Centrino logo, Celeron®, Intel Xeon™, Intel
SpeedStep®, Itanium® and Pentium® are trademarks of Intel Corporation in the United States, other countries, or
both.
Java and all Java-based trademarks are trademarks of Sun Microsystems, Inc. in the United States, other countries, or
both.
Microsoft, Windows, Windows NT® and the Windows logo are trademarks of Microsoft Corporation in the United
States, other countries, or both.
UNIX is a registered trademark of The Open Group in the United States and other countries.
Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both.
ITIL® is a registered trademark and a registered community trademark of the Office of Government Commerce, and
is registered in the U.S. Patent and Trademark Office.
IT Infrastructure Library® is a registered trademark of the Central Computer and Telecommunications Agency which
is now part of the Office of Government Commerce.
Other company, product or service names may be trademarks or service marks of others.
Document Goals
Intended Use This document provides a checklist of prerequisites that must be verified before installing
IBM Information Server. It is intended to minimize the time required to successfully install
and configure IBM Information Server and to document the requirements in one area.
The information in this document is based on field experiences. In some cases, the
recommendations documented here may differ from the product documentation. Except
where noted, this document is intended to supplement, not replace the product
documentation.
Target Audience The primary audience for this document is administrators of IBM Information Server.
Information in certain sections may also be relevant for technical architects, system
administrators, and developers.
Product Version This document is intended for the following product releases:
IBM Information Server 8.0.1 and 8.0.1 Fix Pack 1a.
Document Conventions
This document uses the following conventions:
Convention Usage
Bold In syntax, bold indicates commands, function names, keywords, and options that must be
input exactly as shown. In text, bold indicates keys to press, function names, and menu
selections.
Italic In syntax, italic indicates information that you supply. In text, italic also indicates UNIX
commands and options, file names, and pathnames.
Plain In text, plain indicates Microsoft Windows commands and options, file names, and
pathnames.
Bold Italic Indicates: important information.
Lucida Indicates examples of source code and system output.
Console
Lucida Bold In examples, indicates characters that you type or keys that you press (for example,
<Return>).
Lucida Blue In examples, illustrates the operating system command prompt.
Î A right arrow between menu commands indicates that you should choose each command in
sequence. For example, “Choose File ÎExit” means that you should choose File from the
menu bar, and then choose Exit from the File pull-down menu.
This line The continuation character ¬ is used in source code examples to indicate a line that is too
¬ continues long to fit on the page but that must be entered as a single line on the screen.
Interaction with our example system will usually include the system prompt (in blue) and the
command, most often on two or more lines.
If appropriate, the system prompt will include the user name and directory for context. For example:
Feedback
We value your input and suggestions for continuous improvement to this content. Direct any questions,
comments, corrections, or suggested additions to cedifeed@us.ibm.com.
Table of Contents
1 IBM INFORMATION SERVER PREINSTALLATION OVERVIEW ...................................................................8
2 PREINSTALLATION CHECKLIST ........................................................................................................................10
2.1 REVIEW RELEASE NOTES ......................................................................................................................................11
2.2 REVIEW PLANNING, INSTALLATION, AND CONFIGURATION GUIDE .......................................................................12
2.3 (IF APPLICABLE) REVIEW MIGRATING TO IBM INFORMATION SERVER .................................................................12
2.4 CHOOSE AND VALIDATE AN INSTALLATION ARCHITECTURE / TOPOLOGY ............................................................13
2.5 VALIDATE IBM INFORMATION SERVER SYSTEM REQUIREMENTS FOR ALL TIERS .................................................17
2.6 VERIFY DOMAIN AND WEBSPHERE APPLICATION SERVER REQUIREMENTS .........................................................17
2.7 VERIFY DATABASE REQUIREMENTS FOR THE METADATA REPOSITORY................................................................18
2.8 (IF APPLICABLE) VERIFY THE DATABASE REQUIREMENTS FOR THE WEBSPHERE INFORMATION ANALYZER
ANALYSIS DATABASE..........................................................................................................................................................19
2.9 VERIFY AND CONFIGURE DISKS, VOLUME GROUPS, AND FILE SYSTEMS ..............................................................20
2.10 VERIFY AND CONFIGURE OPERATING SYSTEM AND RESOURCE LIMITS ................................................................26
2.11 VERIFY CONNECTIVITY AND NETWORK CONFIGURATION.....................................................................................34
2.12 CONFIGURE OPERATING SYSTEM USERS, GROUPS, AND PERMISSIONS .................................................................35
2.13 VERIFY AND INSTALL C++ COMPILER AND/OR RUNTIME LIBRARIES ....................................................................39
2.14 VERIFY IBM INFORMATION SERVER CONNECTOR REQUIREMENTS ......................................................................39
2.15 DOWNLOAD IBM INFORMATION SERVER INSTALLATION AND FIX PACK PACKAGES ............................................39
2.16 PERFORM COMPLETE SYSTEM BACKUP .................................................................................................................40
APPENDIX A: PREINSTALLATION CHECKLIST .......................................................................................................41
APPENDIX B: INSTALLATION AND CONFIGURATION CHECKLIST ..................................................................42
APPENDIX C: IBM INFORMATION SERVER INSTALLATION SETTINGS ..........................................................43
APPENDIX D: ONLINE DOCUMENTATION RESOURCES........................................................................................45
APPENDIX E: NETWORK PORTS USED BY IBM INFORMATION SERVER ........................................................46
APPENDIX F: GLOSSARY.................................................................................................................................................47
APPENDIX G: UTILIZING HP-UX 11 MEMORY WINDOWS.....................................................................................48
APPENDIX H: EXAMPLE USER SETUP FOR UNIX ENVIRONMENTS ..................................................................50
IBM Information Server is installed in four logical tiers (also called layers in the installation wizard
and documentation):
Client
Domain (platform services)
Metadata repository
Engine
Product components are installed in each tier depending on the installation selections that are made.
The IBM Information Server documentation (PDF books) can be installed on any machine. The IBM
Information Server information center (HTML) is always installed with the platform services and
available from the IBM Information Server console and Web console. The information center contains
information that is common to all IBM Information Server product modules (such as installation
information), as well as complete information for the following product modules: IBM WebSphere
Business Glossary, IBM WebSphere Information Analyzer, and IBM WebSphere Information Services
Director. Other product modules are documented in HTMLHelp and PDF books.
To ensure a successful installation that meets functional and performance requirements, it is critical
that overall planning and prerequisites be met for all tiers. This document provides a detailed
methodology for planning an IBM Information Server installation.
Although minimum system requirements are referenced in this checklist, capacity planning is
outside the scope of this document.
Where possible, links are provided for additional details and product documentation. The information
in this document is based on field experiences. In some cases, the recommendations documented here
may differ from the product documentation. Except where noted, this document is intended to
supplement, not replace the product documentation.
To install the product documentation in PDF format, choose “Documentation” during the product
installation. Additional and updated documentation for select product modules is available through the
IBM Information Server Information Center at the following URL:
http://publib.boulder.ibm.com/infocenter/iisinfsv/v8r0/index.jsp
2 Preinstallation Checklist
The following checklist outlines the areas that must be reviewed and the steps that must be completed
prior to installing IBM Information Server. A copy of this checklist is also included in Appendix A:
Complete Task
1) Review the release notes for IBM Information Server, IBM WebSphere
Application Server, and (if applicable) IBM DB2.
2) Review
IBM Information Server Planning, Installation and Configuration Guide.
3) If migrating from previous versions of WebSphere DataStage or WebSphere
QualityStage, review Migrating to IBM Information Server Version 8.
4) Choose and validate the installation architecture and topology.
5) Validate system requirements for all tiers: engine, domain, repository, and
client.
6) Verify domain (WebSphere Application Server) requirements.
7) Verify database requirements for the metadata repository.
8) (if applicable) Verify database requirements
for the WebSphere Information Analyzer analysis database.
9) Verify and configure disks, volume groups, and file systems.
10) Verify and configure operating system and resource limits.
11) Verify connectivity and network configuration.
12) Configure operating system users, groups, and associated permissions.
13) Verify and install a C++ compiler and/or runtime libraries.
14) Verify IBM Information Server connector requirements.
15) Download installation and fix pack packages for IBM Information Server,
WebSphere Application Server, and DB2.
16) Perform a complete system backup.
Specific details on each step may be found in subsequent sections of this document, the release notes,
and IBM Information Server Planning, Installation, and Configuration Guide.
When reviewing release notes, be sure to choose the version that matches the base installed version of
IBM Information Server, for example 8.0.1.
Any IBM Information Server fix packs and patches that are applied after a base installation have
separate, corresponding release notes that should also be reviewed. For example, 8.0.1 Fix Pack 1a
would be applied to an existing 8.0.1 base installation.
Release notes for WebSphere Application Server Version 6.0.2 are available at:
http://publib.boulder.ibm.com/infocenter/wasinfo/v6r0/topic/com.ibm.websphere.base.doc/info/aes/ae/
v6rn.html
Complete documentation for WebSphere Application Server Version 6.0.2 is available through the
WebSphere Application Server Information Center at:
http://publib.boulder.ibm.com/infocenter/wasinfo/v6r0/topic/com.ibm.websphere.base.doc/info/welco
me_base.html
When installing IBM Information Server, if you choose to install the “Repository Layer” option, a new
copy of DB2 V9.1 will be installed. Release notes for DB2 V9.1 can be found at:
http://publib.boulder.ibm.com/infocenter/db2luw/v9/topic/com.ibm.db2.udb.doc/doc/c0023859.htm
IBM Information Server Planning, Installation, and Configuration Guide is part of the IBM
Information Server documentation, which is included with the product installation package and on the
IBM Web site:
Note that the PDF documentation (the “Documentation layer” option) can be installed separately,
without installing the rest of the IBM Information Server.
Note that the PDF documentation (the “Documentation layer” option) can be installed separately,
without installing the rest of the IBM Information Server.
In any IBM Information Server installation, the release level (version + fix pack + patches) must match
on all layers.
IMPORTANT: Although the architecture of IBM Information Server offers many theoretical
deployment architectures, only the following subset of configurations is recommended and
supported.
Any deviation from these deployments must be reviewed by IBM Support and Engineering.
Although the IBM Information Server System Requirements state a minimum of 4GB of RAM, in this
configuration the single server housing the metadata repository, domain, and engine should have 8GB
of RAM, in addition to meeting the combined disk requirements outlined in the IBM Information
Server System Requirements.
Because the complexity of tracking the state of different layers is simplified, the two-tier deployment is
optimal for high availability (failover) scenarios.
In the three-tier configuration, both back-end servers must be located in the same physical data center
and should be connected to the same network subnet.
In the four-tier configuration, all servers must be located in the same physical data center and should be
connected to the same network subnet.
In a cluster or grid deployment, all engine servers must be the same operating system, must be located
in the same physical data center, and should be connected by a dedicated, private high-speed network
connection.
When the clients connect to IBM Information Server via a Wide Area Network (WAN), a remote
desktop solution like Citrix or Windows Terminal Server can be used to minimize the amount of data
that is normally transmitted between the IBM Information Server clients and server. It is important to
note that the system that is hosting Citrix or Windows Terminal Server be installed in the same subnet
in LAN that is used for the IBM Information Server systems.
2.5 Validate IBM Information Server System Requirements for all Tiers
After selecting the installation topology, verify the IBM Information Server system requirements for
the client, metadata repository, domain, and engine.
On most platforms, when you install IBM Information Server, you can install a new copy of
WebSphere Application Server or use an existing installation.
On some 64-bit platforms (64-bit Red Hat Linux, 64-bit SUSE-Intel Linux, HP-UX Itanium), you must
download the WebSphere Application Server 6.0.2 installation program and fix packs from IBM
Passport Advantage . Specific instructions are included in the IBM Information Server System
Requirements:
http://www.ibm.com/support/docview.wss?uid=swg27008923
To install the domain layer in an existing installation of WebSphere Application Server, the
configuration must meet the following requirements:
WebSphere Application Server Version 6.0.2 Fix Pack 11 or later fix pack
Standalone (not network) deployment
New, empty profile
IMPORTANT: To minimize risk and to ensure that the domain requirements are satisfied, use
the version of WebSphere Application Server that is bundled with the IBM Information Server
installation program, except in specific configurations (64-bit installations of Red Hat Linux,
SUSE-Intel Linux, or HP-UX Itanium).
When you install IBM Information Server, you can install a new copy of DB2 V9.1 or use an existing
installation. If you want to use Microsoft SQL Server or Oracle, you must install and configure them
before you install IBM Information Server.
Specific hardware and software requirements for DB2 can be found online:
To install the metadata repository on an existing installation of DB2, it must meet the following
requirements:
DB2 Enterprise Server Edition Version 9.1
Database cannot be partitioned
On some 64-bit platforms (64-bit Red Hat Linux, 64-bit SUSE-Intel Linux, HP-UX Itanium) you must
download the DB2 V9.1 installation program and fix packs from IBM Passport Advantage. Specific
instructions are included in the IBM Information Server System Requirements:
http://www.ibm.com/support/docview.wss?uid=swg27008923
During the installation of IBM Information Server, if you choose to install the metadata repository on a
new copy of DB2, the metadata repository database will be created as part of the installation.
If you choose to install the metadata repository on an existing installation of DB2, Oracle Database
10g, or Microsoft SQL Server 2005 the metadata repository database must be created before the IBM
Information Server installation.
Database creation scripts are included in the DatabaseSupport subdirectory of the IBM Information
Server installation. Full details on creating the metadata repository database can be found in the IBM
Information Server Planning, Installation, and Configuration Guide.
The metadata repository database should be closely monitored as the database grows to ensure
that sufficient space is available in the tablespaces and underlying file systems.
2.8 (if applicable) Verify the Database Requirements for the WebSphere
Information Analyzer Analysis Database
If you are deploying WebSphere Information Analyzer, the analysis database can be installed on DB2
V9.1, Oracle Database 10g R2, or Microsoft SQL Server 2005. By default, the analysis database is
named “iadb”.
When you install IBM Information Server, you can install a new copy of DB2 V9.1 or use an existing
installation. If you want to use Microsoft SQL Server or Oracle, you must install and configure them
before you install IBM Information Server.
If you choose to install the WebSphere Information Analyzer analysis database on an existing
installation of DB2, Oracle Database 10g, or SQL Server 2005:
The analysis database must be created before the IBM Information Server installation
Database connectivity must be configured from the domain tier to the analysis database.
Database creation scripts are included in the DatabaseSupport subdirectory of the IBM Information
Server installation. Full details on creating the analysis database can be found in the IBM Information
Server Planning, Installation, and Configuration Guide.
As a rough guideline for analysis database sizing, start with a minimum of 300GB and
plan to grow to 4x the total size of all analyzed source data.
Actual analysis database sizing depends on the source analysis requirements, total size of all
analyzed source data, number of keys and attributes, and the retention policy for existing
analysis results and baselines. A detailed sizing guideline for the WebSphere Information
Analyzer analysis database is available from IBM Information Server Support.
2.9 Verify and Configure Disks, Volume Groups, and File Systems
2.9.1 RAID / SAN Configuration
IBM Information Server uses file system mount points for its installation, libraries, temporary storage,
and Data Set storage.
The following guidelines can assist in the configuration of RAID or SAN technologies:
a) Minimize contention between temporary (scratch, buffer, and sort) and data file systems.
b) Minimize contention between the disks and controllers associated with IBM Information Server
file systems and other applications or servers.
c) Consider isolating multiple mount points to separate high-speed disk interconnects and
controllers.
d) Consider the tradeoff between the granularity of file system(s) and underlying configuration vs.
available, unused storage.
e) Don’t create overly-complex device configurations because these can be difficult to administer
and may not offer noticeable performance improvement.
f) If possible, test your configuration with expertise from hardware, storage, operating system, and
application (IBM Information Server) resources.
g) Remember that previous experience in designing I/O systems for non-parallel engines might
lead to non-optimal configurations for IBM Information Server.
The optimal disk configuration will strike a balance between cost, complexity, ease of administration,
and ultimate performance.
Database storage
- Metadata repository (“xmeta”)
- (if applicable) WebSphere Information Analyzer analysis database (“iadb”)
IMPORTANT: Each storage class should be isolated in separate file systems to accommodate
their different performance and capacity characteristics and backup requirements.
The default installation directories are best suited for small prototype environments.
On UNIX platforms, using the supplied DB2 database engine, databases are created in the home
directory of the DB2 instance owner, by default:
/home/db2inst1/
If you pre-create the DB2 user accounts, you can choose the target file system for this home directory.
When using an existing installation of DB2 V9.1, Oracle Database 10g, or Microsoft SQL Server 2005,
tablespaces are created by the database administrator using existing file systems or raw disk devices.
IMPORTANT: The metadata repository and (if applicable) WebSphere Information Analyzer
analysis databases should be closely monitored to ensure that sufficient space is available in the
tablespaces and underlying file systems as the databases grow. Regular, scheduled backups
should be performed on these databases.
During the installation process, the Projects subdirectory is created in the IBM Information Server
installation directory. By default, the WebSphere DataStage and QualityStage Administrator client
creates projects in this Projects subdirectory:
/opt/IBM/InformationServer/Server/Projects/
During installation, WebSphere Information Analyzer also creates its own WebSphere DataStage
project within this directory, which is used to run all analysis jobs.
b) On most operating systems, it is possible to create file systems at non-root levels. Create a
separate file system for the Projects directory within the IBM Information Server installation.
Be sure to backup any existing projects (retaining ownership and permissions) before mounting
a separate file system over the existing /opt/IBM/InformationServer/Projects directory.
c) For cluster or grid implementations, it is generally best to share the Projects file system across
servers (at the same mount point).
Project names cannot use the following reserved words: ds, DS, uv, or UV.
Projects names should be maintained in unison with source code control. As projects are promoted
through source control, the name of the phase and the project name should reflect the version, in the
form:
<Phase>_<ProjectName>_<version>
IMPORTANT: The project file system should be monitored to ensure adequate free space
remains. If the Project file system runs out of free space during runtime activity, the repository
may become corrupted, requiring a restore from backup.
The IBM Information Server installation program creates a default parallel configuration file that
references the default Datasets and Scratch directories:
/opt/IBM/InformationServer/Configurations/default.apt
Parallel configuration files are used to assign resources (such as processing nodes, disk and scratch file
systems) at runtime when a job is run by the IBM Information Server engine. Parallel configuration
files are discussed in detail in the IBM WebSphere DataStage Parallel Job Developer Guide (provided
as part of the installed PDF documentation).
If you are using an existing Oracle Database 10g R2 or Microsoft SQL Server 2005 database server for
the metadata repository or WebSphere Information Analyzer analysis database, reference the database
vendor’s system requirements.
(For readability, the table values in this section include commas; omit them when setting parameters.)
(For readability, the table values in this section include commas; omit them when setting parameters.)
On AIX platforms, tuning the following additional kernel parameters may improve performance for the
IBM Information Server parallel engine. Work with your AIX system administrator to determine
optimal values:
After installing DB2 on HP-UX, run the db2osconf command to verify recommended kernel settings.
(For readability, the table values in this section include commas; omit them when setting parameters.)
On 32-bit platforms, HP-UX limits the maximum size of a shared memory segment to 1.75GB. When
processing extremely large in-memory reference tables in a WebSphere DataStage Lookup stage, it
may be necessary to configure HP-UX memory windowing. This process is detailed in Appendix G:
Utilizing HP-UX 11 Memory Windows.
(For readability, the table values in this section include commas; omit them when setting parameters.)
After installing DB2 on Solaris 9, run the db2osconf command to verify recommended kernel settings.
(For readability, the table values in this section include commas; omit them when setting parameters.)
After installing DB2 on Solaris 10, run the db2osconf command a to verify recommended kernel
settings.
(For readability, the table values in this section include commas; omit them when setting parameters.)
IBM Information Server requires access to a number of TCP ports, which must be open if necessary,
through firewall configuration. Note that related connectivity (for example, databases, and enterprise
applications) may require additional port assignments. Consult vendor-supplied documentation for
their requirements.
On Windows environments, the installation program must be run from a local Administrator
account. This user must have read/write access to the target installation directories. The
installation cannot be run from a Windows Domain administrator or by a Domain administrator
that is part of the local Administrators group.
IMPORTANT: These users can be created by the installation program, but this is not
recommended.
The security configuration of many operating systems (for example, AIX) requires new users to
login before an account is activated.
Instructions on creating users can be found in the IBM Information Server Planning, Installation, and
Configuration Guide and in the IBM Information Server Administration Guide. A simple UNIX user
setup is provided in Appendix H: Example User Setup for UNIX Environments.
a) Because of the way the IBM Information Server installation program parses its parameters,
passwords should not include embedded dollar signs (“$”).
When using the internal user registry, IBM Information Server users must be mapped to at least
one operating system user.
When using the WebSphere Application Server OS or LDAP configuration, IBM Information
Server can be configured to share the user registry with the engine (WebSphere DataStage and
QualityStage) registry. This eliminates the need to individually map each IBM Information
Server user to an engine operating system user.
WebSphere DataStage supports three basic categories of users: Managers, Developers, and Operators.
These are implemented as the secondary group for each category of user. That is, the primary group ID
(GID) of every WebSphere DataStage user must be the same as the WebSphere DataStage
Administrator (for example, dstage), and the secondary GID of a user in one of these categories must
be one of these groups. The WebSphere DataStage and QualityStage Administrator client is used to
assign roles to those users who are members of each group. Minimally, the only group that must be
created is the primary group of the WebSphere DataStage super-user. This group and the super-user
(dsadm) are used to secure files and perform software maintenance.
An effective method of specifying the groups and users is a table such as the one below. On our
example system, users and groups in bold already exist. The remaining groups and users are to be
created. Note that we describe our requirements in terms of business functions wherever possible and
identify employees, contractors and “virtual users” such as dsadm.
1
Autosys is a popular job scheduler for UNIX and is used as an example.
IBM Information Server Preinstallation Checklist April 25, 2008 37 of 50
© Copyright IBM Corp. 2008. All rights reserved.
Information Platform and Solutions
Center of Excellence
If you want to set up the system so that it distinguishes between product managers, developers, and
operators, you should set up secondary groups for each class of user. Each user is then allocated to the
product manager, developer, or operator secondary group (but not to more than one secondary group).
You can then use the WebSphere DataStage and QualityStage Administrator client to assign the
appropriate WebSphere DataStage user role to the secondary groups. Operators cannot use the
WebSphere DataStage and QualityStage Designer client, and they only see released jobs in the
WebSphere DataStage and QualityStage Director client. Neither Operators nor Developers can create
protected projects or add anything to them. IBM Information Server adds the Super Operator role,
which removes the restrictions on the Operator role.
For more information about setting up users, see “Setting up security” in IBM Information Server
Administration Guide and “Assigning WebSphere DataStage roles” in IBM WebSphere DataStage and
QualityStage Administrator Client Guide, both of which are installed with the PDF documentation. The
IBM Information Server Administration Guide is also available on the IBM Web site:
PDF: http://www.ibm.com/support/docview.wss?rs=14&uid=swg27008803
IBM Information Server Information Center:
http://publib.boulder.ibm.com/infocenter/iisinfsv/v8r0/index.jsp.
If you are logged into a domain account, it must be part of the local administrators group on the server
you are installing, and you must have network access to the Windows domain controller for
authentication.
The administrative user that you are logged in as during installation of WebSphere DataStage will
become the “owner” or administrator of the WebSphere DataStage installation.
After installation of WebSphere DataStage, you can use the Windows user management tools to define
groups of associated users to restrict access to individual WebSphere DataStage projects. By default,
every user (the “Everyone” group) is given access to newly-created projects. However, you can use the
Windows user management tools to create multiple Windows groups with assigned users, and then use
the WebSphere DataStage and QualityStage Administrator client to restrict access on a per-project
basis.
IBM Information Server Preinstallation Checklist April 25, 2008 38 of 50
© Copyright IBM Corp. 2008. All rights reserved.
Information Platform and Solutions
Center of Excellence
For more information about setting up users, see “Setting up security” in IBM Information Server
Administration Guide and “Assigning WebSphere DataStage roles” in IBM WebSphere DataStage and
QualityStage Administrator Client Guide, both of which are installed with the PDF documentation. The
IBM Information Server Administration Guide is also available on the IBM Web site:
PDF: http://www.ibm.com/support/docview.wss?rs=14&uid=swg27008803
IBM Information Server Information Center:
http://publib.boulder.ibm.com/infocenter/iisinfsv/v8r0/index.jsp.
Many compilers are licensed on a per-user basis. WebSphere DataStage only invokes the C++ compiler
when the developer compiles a parallel job with Transformer stages or compiles a BuildOp component.
In other words, the maximum number of simultaneous WebSphere DataStage parallel job developers
would determine the maximum number of concurrent C++ compiler licenses.
On each platform, only one C++ compiler is supported. Check the IBM Information Server System
Requirements for information on supported compilers, and compiler installation requirements:
IBM Information Server System Requirements:
http://www.ibm.com/support/docview.wss?uid=swg27008923
For deployment (production) systems, a C++ compiler is not required; some platforms, however, may
require C++ runtime libraries to be installed. This information is also contained in the IBM Information
Server System Requirements.
2.15 Download IBM Information Server Installation and Fix Pack Packages
The installation packages for IBM Information Server are available through IBM Passport Advantage.
If you don’t already have an account (login) for IBM Passport Advantage, you will need to create one
and make sure it is associated with your customer ID to access all downloads that you have licensed.
On some platforms (specifically, 64-bit installations of Red Hat Linux, SUSE Linux, and HP-UX), you
may also need to download WebSphere Application Server and DB2 packages. Details of specific
packages required by platform are found in the IBM Information Server System Requirements.
In addition, product-specific patches may be required for issues discovered after the latest fix pack
release. Contact IBM Support for details on obtaining product patches.
This backup should be performed by the system administrator on all target installation server
computers, and it should include both operating system and user directories.
Specific details on each step may be found in this document, the release notes, and IBM Information
Server Planning, Installation, and Configuration Guide.
Complete Task
1) Review the release notes for IBM Information Server, IBM WebSphere
Application Server, and (if applicable) IBM DB2.
2) Review
IBM Information Server Planning, Installation and Configuration Guide.
3) If migrating from previous versions of WebSphere DataStage or
WebSphere QualityStage, review Migrating to IBM Information Server
Version 8.
4) Choose and validate the installation architecture and topology.
5) Validate system requirements for all tiers: engine, domain, repository, and
client.
6) Verify domain (WebSphere Application Server) requirements.
7) Verify database requirements for the metadata repository.
8) (if applicable) Verify database requirements
for the WebSphere Information Analyzer analysis database.
9) Verify and configure disks, volume groups, and file systems.
10) Verify and configure operating system and resource limits.
11) Verify connectivity and network configuration.
12) Configure operating system users, groups, and associated permissions.
13) Verify and install a C++ compiler and/or runtime libraries.
14) Verify IBM Information Server connector requirements.
15) Download installation and fix pack packages for IBM Information Server,
WebSphere Application Server, and DB2.
16) Perform a complete system backup.
Complete Task
1) Complete all items on the Preinstallation Checklist (Appendix A).
2) If you are migrating from an earlier version of WebSphere DataStage or
WebSphere QualityStage, complete the preinstallation migration tasks.
3) If you are not using the installation of DB2 that is provided with IBM
Information Server, create and configure the metadata repository (“xmeta”).
4) If you are using WebSphere Information Analyzer, create and configure the
analysis database (“iadb”).
5) If you are not using the installation of WebSphere Application Server that is
provided with IBM Information Server, install and configure WebSphere
Application Server to be the domain server for IBM Information Server.
6) Install IBM Information Server.
7) Review all installation logs.
8) Check for an install any applicable IBM Information Server fix packs and
patches.
9) Review the fix pack and patch installation program log files.
10) Install and configure optional IBM Information Server components.
11) (if applicable) Configure any applicable post-installation operating system
settings.
12) (if applicable) Verify cluster/grid configuration.
13) Configure and verify IBM Information Server environment and defaults.
14) Verify IBM Information Server administration (shell) environment.
15) Configure IBM Information Server users, roles, and permissions.
16) Configure and verify IBM Information Server connectors.
17) Configure and verify client installation(s).
18) (where applicable) Install client fix packs, patches, and optional components.
19) Review client installation, client fix pack, and client patch installation log
files.
20) Create and verify parallel configuration files.
21) Create and verify IBM Information Server projects.
22) Verify project permissions and security settings/roles.
It is important to note that not all options in this table will appear, depending on the options chosen in
the installation program.
Description URL
IBM Passport Advantage http://www.ibm.com/software/howtobuy/passportadvantage/index.html
Software Downloads
IBM Information Server 8.0.1 http://www.ibm.com/support/docview.wss?uid=swg24017220
Fix Pack 1a Download
IBM Software Support Handbook http://www14.software.ibm.com/webapp/set2/sas/f/handbook/home.html
IBM Information Server
IBM Information Server Information http://publib.boulder.ibm.com/infocenter/iisinfsv/v8r0/index.jsp
Center
IBM Information Server Publications http://www.ibm.com/support/docview.wss?rs=14&uid=swg27008803
Library (PDF and hardcopy books)
IBM Information Server Release http://publib.boulder.ibm.com/infocenter/iisinfsv/v8r0/topic/com.ibm.swg.
Notes im.iis.productization.iisinfsv.nav.doc/containers/cont_iisinfsrv_rnote.html
IBM Information Server System http://www.ibm.com/support/docview.wss?uid=swg27008923
Requirements
IBM Information Server Product http://www.ibm.com/software/data/integration/support/info_server/
Support
IBM developer Works – Information http://www.ibm.com/developerworks/db2/products/integration/index.html
Integration ?S_TACT=105AGX11&S_CMP=FP
WebSphere Application Server 6.0
WebSphere Application Server 6.0 http://publib.boulder.ibm.com/infocenter/wasinfo/v6r0/topic/com.ibm.web
Information Center sphere.base.doc/info/welcome_base.html
WebSphere Application Server 6.0.2 http://publib.boulder.ibm.com/infocenter/wasinfo/v6r0/topic/com.ibm.web
Release Notes sphere.base.doc/info/aes/ae/v6rn.html
WebSphere Application Server 6.0.2 http://www.ibm.com/support/docview.wss?rs=180&uid=swg27007250
Hardware Requirements
WebSphere Application Server 6.0.2 http://www.ibm.com/support/docview.wss?rs=180&uid=swg27007256
Software Requirements
DB2 V9.1
DB2 Database for Linux, UNIX, and http://publib.boulder.ibm.com/infocenter/db2luw/v9/index.jsp
Windows V9.1 Information Center
DB2 V9.1 Release Notes http://publib.boulder.ibm.com/infocenter/db2luw/v9/topic/com.ibm.db2.ud
b.doc/doc/c0023859.htm
DB2 v9.1 System Requirements http://publib.boulder.ibm.com/infocenter/db2luw/v9/topic/com.ibm.db2.ud
b.uprun.doc/doc/r0025127.htm
DB2 V9.1 Fix Pack Downloads http://www.ibm.com/support/docview.wss?rs=71&uid=swg21255572
Ports listed with an asterisk (*) need to be configured for firewall access when clients are on the other
side of the firewall from the engine, domain, and metadata repository components.
Appendix F: Glossary
Term Definition
domain tier IBM Information Server platform services (common and
product-specific services) and the application server in which
they run (WebSphere Application Server).
engine tier IBM Information Server engine and associated components.
The parallel engine is used to run WebSphere DataStage and
QualityStage parallel jobs. It’s also used by WebSphere
Information Analyzer to perform analysis. The engine tier
also includes the components required to manage multiple
instances of the engine and the components to run server jobs.
layer or tier A set of IBM Information Server components that must be
installed on the same computer. The four logical tiers are
engine, domain, metadata repository, and client.
metadata repository tier The database used to store IBM Information Server design,
configuration, and runtime metadata.
Metadata Server A product module that is comprised of the metadata
repository and the IBM Information Server common services.
Let’s take the example of a 32 processor machine with 32 GB of memory, called hptest. The default
configuration file for this machine would contain 16 nodes, each with the fast name entry “hptest”. If
the jobs that are going to be run on this will utilize a maximum of 8 GB, we would want to use at least
8 memory windows, because each window can hold up to 1 GB of memory. Below are the instructions
on how you would go about setting up the system and IBM Information Server’s parallel engine to
utilize 8 memory windows.
1. Change the kernel tunable max_mem_window from 0 to 40 and reboot the machine.
2. Create seven new entries in the /etc/hosts file, all pointing to the current machine. For example,
if the machine is called hptest, the original /etc/hosts file probably looks like:
3. If your machine is set up to trust a limited number of machines, then add each of the new
hostnames (hptest1, … , hptest7) to your ~/.rhosts or /ext/hosts.equiv file.
4. Create an entry in the /etc/services.window file for each hostname alias being used. For
example:
#
# /etc/services.window
#
# The format of this file is name followed by a space/tab followed
# by a unique user key. A memory window application uses the
IBM Information Server Preinstallation Checklist April 25, 2008 48 of 50
© Copyright IBM Corp. 2008. All rights reserved.
Information Integration Solutions
Center of Excellence
5. Modify your 16 node $APT_CONFIG_FILE, so the fastname entries are divided between
evenly across hptest through hptest7.
6. Ensure that the user can run remsh to the current node by trying the following command:
remsh hptest ls
7. Add the following file (named startup.apt) in the $APT_ORCHHOME/etc directory and make
sure it is executable. If you want to disable the use of memory windows in the future, you can
do so by setting the $APT_NO_STARTUP_SCRIPT environment variable.
#!/bin/sh
shift 2
winid=$(getmemwindow $8)
echo "hostname="$8 "windowid="$winid
setmemwindow -i $winid -p $$
exec $*
8. Test this new configuration with a sample parallel job and the new $APT_CONFIG_FILE. (A
simple Column Generator to Peek will work as a test.)
NOTE: When using memory windowing technique for large lookup tables, you must hash
partition the incoming data and all reference tables using the same key columns. The default
“Entire” partitioning will not use memory windowing.
groupadd db2iadm1
groupadd db2fadm1
groupadd dasadm1
groupadd dstage