Oracle® Fusion Middleware Installation Guide for Oracle Data Profiling and Oracle Data Quality for ODI 11g Release 11.1.1 Part Number E16587-01 |
|
|
View PDF |
This chapter describes how to install and configure Oracle Data Integrator. Post-installation configuration parameters are also provided.
The following topics are covered:
Review the information in this section before you begin:
Before installing any Oracle Data Quality products, you should read the system requirements and certification documentation to ensure that your environment meets the minimum installation requirements. Both of these documents are available on Oracle Technology Network (OTN).
The system requirements document covers information such as hardware and software requirements, minimum disk space and memory requirements, and required system libraries, packages, or patches:
http://www.oracle.com/technology/software/products/ias/files/fusion_requirements.htm
The certification document covers supported installation types, platforms, operating systems, databases, and third-party products:
http://www.oracle.com/technology/software/products/ias/files/fusion_certification.html
Oracle Data Quality products rely on multiple users to perform certain procedures (as defined in Table 2-1). You create the Oracle Data Quality users during and after the installation procedure. You define the UNIX operating system users before you install Oracle Data Profiling and Oracle Data Quality for Oracle Data Integrator.
Table 2-1 Required User Accounts
User | Description |
---|---|
Or:
|
Some procedures must be performed by an operating system user with root or super user access. When you enter a root password during the software installation, See your operating system documentation for more information on these user types. |
Oracle Data Quality Application Administrator |
An operating system user who installs the Oracle Data Quality Server application and administers the Oracle Data Quality Scheduler and License Manager. (This user is required when installing on either Windows operating systems or UNIX operating systems.) For information on creating this user, see Section 2.1.2.1, "Define an Oracle Data Quality Application Administrator". |
Oracle Data Quality Loader User |
User who will access data import directories (located on the Oracle Data Quality server) through a login screen in the Oracle Data Quality User Interface. This user is not required if you plan to directly access data from relational sources (Oracle, IBM DB2, ODBC). You will, however, need a user id that gives you access to each specific database. NOTE: This user is only required if you plan to load data from flat file sources (delimited, COBOL, Oracle Data Quality sources). For more information on creating this user, see Section 2.1.2.2, "Define Oracle Data Quality Loader Users". |
Metabase Administrator |
Oracle Data Quality user account that creates and maintains Oracle Data Quality repositories, and defines metabases, Oracle Data Quality users, and data connections. This user is also known as the Oracle Data Quality Repository User. NOTE: The metabase administrator is created during Oracle Data Quality installation. |
Oracle Data Quality User |
Oracle Data Quality user account that accesses Oracle Data Quality metabases through the Oracle Data Quality User Interface NOTE: Oracle Data Quality users are created by the metabase administrator after installation. |
You must create an Oracle Data Quality Application Administrator and, if you are using sudo
, you must also grant sudo
rights before installing Oracle Date Quality products. This administrator installs the server application and administers the Scheduler and License Manager on the Oracle Data Quality server.
To create an Application Administrator on UNIX Operating Systems:
As the UNIX root user, create an Oracle Data Quality administrator user account to perform Oracle Data Quality administrative activities.
There are no naming restrictions for the Oracle Data Quality administrator name, but the recommendation is that the user account be named odqadmin.
Do one of the following:
If you are not leveraging the security provided through sudo
, make sure that the Oracle Data Quality administrator has read access to any data import directories that you define. Proceed to "Define Oracle Data Quality Loader Users".
If you are leveraging the security features of sudo
, proceed to the next step.
As the UNIX root user, type the command: visudo
. This brings up the file named sudoers
for editing.
NOTE: You must always edit this file through the visudo
command.
In the Defaults specification section, add the following entries:
Defaults:<user_id> targetpw Defaults:<user_id> passwd_tries=1 Defaults:<user_id> timestamp_timeout=0
In the User Privilege specification section, add:
<user_id> ALL=(ALL) ALL
Save the file and exit.
Verify that sudo
is correctly configured.
Log in as the newly created Oracle Data Quality administrator. For example, type:
sudo -u odqadmin id
When prompted, enter the password for the user.
Type the command: id
The operating system should return the UNIX user id. If not, contact your system administrator.
To create an Application Administrator on Windows Operating Systems:
As the Windows Server Administrator, create a Windows user account for the Oracle Data Quality Administrator.
There are no naming restrictions for the Oracle Data Quality administrator name, but the recommendation is that the user account be named odqadmin.
Add the Oracle Data Quality Administrator user odqadmin to the group Administrators.
From the Windows Control Panel, open Administrative Tools, Local Security Policy and expand the Security Settings > Local Policies folder.
Add the Oracle Data Quality Application Administrator user odqadmin to the following options:
Log on as a Service
Act as part of the operating system
Define an Oracle Data Quality Loader User account on the UNIX server if you plan to import data from flat files. When you import flat file data into Oracle Data Quality, you access the file location on the server through a UNIX user id.
For example, if you are importing flat files described by COBOL copybooks that are residing on the Oracle Data Quality server in the directory /data
, then you require an Oracle Data Quality Loader User (UNIX user id) who can log on to the Oracle Data Quality server and read the files from /data
.
Note:
If you intend to import data directly from a relational source, you do not need to perform these steps since a UNIX userid is not required. The only user required is for RDBMS login accessDefine a new user account or select an existing user account to act as an Oracle Data Quality Loader User.
NOTE: For Windows operating systems, determine whether you will create a single user account that the team will share or if each user will have their own user account.
Give the account read access to the data import directory that you plan to use when you create a Loader Connection.
NOTE: For Windows operating systems, add each user account (that will access the flat file data) to the appropriate Windows user group for each secured location. User accounts not contained in the Windows group will not be able to import flat file data for that loader connection.
During installation you must provide at least 2 port numbers that are accessible from the client to the server and are not blocked. These port numbers are used for the following services:
The Oracle Data Quality Scheduler requires a port called Scheduler Port.
The Oracle Data Profiling and Quality Metabase Repository requires a port called Repository Port.
Note:
If you are planning to use ODBC datasources from a Windows-based server, you will need to reserve a third port for the Oracle Data Quality ODBC Adapter.To identify ports that have applications listening on them, type the netstat -an
command. Select two available ports and make note of them for the setup procedure.
Note:
The port numbers should be greater than 1000 and not exceed 65535, and for easy reference, should be consecutive numbers. (For example,7600
for the Repository Port and 7601
for the Scheduler Port.)This section contains information and instructions for installing Oracle Data Integrator:
To start the installer, insert the Oracle Data Quality installation disk or navigate to /Disk1
in the ODQ installation directory (where you saved the ODQ .zip or .jar file) and run the following command:
On UNIX operating systems:
./runInstaller
On Windows operating systems:
setup.exe
Refer to the Oracle Fusion Middleware Certification document to see the JDKs supported for your system: http://www.oracle.com/technology/software/products/ias/files/fusion_certification.html
The installer writes logs files to the Oracle_Inventory_Location
/log
(on UNIX operating systems) or Oracle_Inventory_Location
\logs
(on Windows operating systems) directory. On UNIX systems, if you do not know the location of your Oracle Inventory directory, you can find it in the oraInst.loc
file in the following directories (default locations):
Linux operating systems: /etc/oraInst.loc
HP-UX and Solaris operating systems: /var/opt/oracle/oraInst.loc
Windows operating systems: \Program Files\Oracle\Inventory\logs
The Oracle Data Profiling and Quality installation for UNIX operating systems includes the following components:
Oracle Data Quality for Data Integrator
Metabase Server
Metabase Definitions
Table 2-2 describes the screens included in a UNIX operating system installation. For more information, see Appendix B, "Oracle Data Profiling and Data Quality Installation Screens".
Table 2-2 UNIX Operating System Installation Flow
No. | Screen | When Does This Screen Appear? | Description and Action Required |
---|---|---|---|
1 |
Always |
Click Next to continue. |
|
2 |
Always |
This screen analyzes the host computer to ensure that specific operating system prerequisites have been met. If any of the prerequisite checks fail, then a short error message appears in the bottom portion of the screen. Fix the error and click Retry to try again. If you want to ignore the error or warning messages and continue with the installation, click Continue. Click Abort to stop prerequisite checking for all components. |
|
3 |
Always |
In the Location field, enter the Oracle home (referred to in this guide as Click Next to continue. |
|
4 |
Always |
This screen configures the Metabase Server. Provide the required information and click Next to continue. |
|
5 |
Always |
Review the summary and click Install to continue. |
|
6 |
Always |
The installer automatically executes each configuration assistant in sequence, displaying the progress in the Status column. No action is required on this screen. |
|
7 |
Always |
If you want to save this configuration to a text file, click Save. This file can be used later if you choose to perform the same installation from the command line. Click Finish to close the installer. |
Table 2-3 describes the screens included in a Windows operating system installation. For more information, see Appendix B, "Oracle Data Profiling and Data Quality Installation Screens".
Table 2-3 Windows Operating System Installation Flow
No. | Screen | When Does This Screen Appear? | Description and Action Required |
---|---|---|---|
1 |
Always |
Click Next to continue. |
|
2 |
Always |
Select the components you want to install. The options are:
Click Next to continue. |
|
3 |
Always |
Click Next to continue. |
|
4 |
Always |
Specify the absolute path to your Oracle home (referred to in this guide as Click Next to continue. |
|
5 |
Always |
This screen configures the Metabase Server. Provide the required information and click Next to continue. |
|
6 |
Only if you selected Metabase Client on the Select Components Screen (Windows Operating Systems Only) screen. |
This screen configures the client to connect to the Metabase and ODBC Servers. Provide the required information and click Next to continue. |
|
7 |
Always |
Review the summary and click Install to continue. |
|
8 |
Always |
The installer automatically executes each configuration assistant in sequence, displaying the progress in the Status column. No action is required on this screen. |
|
9 |
If you want to save this configuration to a text file, click Save. This file can be used later if you choose to perform the same installation from the command line. Click Finish to close the installer. |
The postal tables are a critical part of the data quality process because they provide the postal information that is used to validate and improve the name and address data in your records. This section describes how to install postal and census directories for for an Oracle Data Quality Server.
The postal tables and census/DPV directories are delivered in a compressed format. The file extensions are .zip (for Windows operating systems) and .tar (for UNIX operating systems).
Table 2-4 describe the file naming conventions that are used:
Table 2-4 Naming Conventions
Table or Directory Name | Naming Convention Used |
---|---|
For example:
|
|
NOTE: Global Postal Tables are a subset of international postal tables that are invoked from within Oracle Data Quality. Global Postal Tables are distinct because they require an additional service. The postal tables in this subset are: Austria, Brazil, Czech Republic, Denmark, Finland, Greece, Hungary, Ireland, Mexico, New Zealand, Norway, Poland, and Sweden. |
For example:
|
NOTE: Census data is available only for the United States. |
|
|
|
USLMMMq.ext. This is the name of the United States LACSLink directory, and ext is either zip or tar. |
|
SuiteLinkDirectory |
NOTE:
|
By default, postal-related files and directories are installed to the following locations:
General Postal Tables (including DPV, LACSLink and SuiteLink tables)
On UNIX operating systems:ODQ_HOME/oracledq/12/tables/postal_tables
On Windows operating systems: \ODQ_HOME\oracledq\tables\*
Global Postal Tables and Latitude/Longitude Tables
On UNIX operating systems:ODQ_HOME/oracledq/12/tables/postal_tables
On Windows operating systems: \ODQ_HOME\oracledq\tables\*
Asian Postal Tables (China, Japan, Korea, Taiwan)
On UNIX operating systems:ODQ_HOME/oracledq/12/tables/postal_tables
On Windows operating systems: \ODQ_HOME\oracledq\tables\*
Census
On UNIX operating systems:ODQ_HOME/oracledq/12/tables/postal_tables
On Windows operating systems: \ODQ_HOME\oracledq\tables\*
If you want to use a different location for the postal tables, follow the procedures Section 2.3.3, "Alternative Postal Code Installation Locations".
Oracle recommends that you install the postal tables in the default directories (as defined in Section 2.3.2). If you prefer to use an alternative directory, you must define the alternative locations before installing the postal tables.
To specify an alternative location for the postal tables:
Shut down Oracle Data Quality products, including all Oracle Data Quality clients.
Close the Oracle Data Quality Scheduler service.
Start the Metabase Server application.
On Windows operating systems:
Programs > Oracle Data Quality Software > Metabase
On UNIX operating systems:
./mtb_admin
When prompted, log on as the metabase administrator (madmin
).
Use the _control
metabase name to apply the alternative location to all metabases.
Ensure that changes have not already been made to the postal directory installation locations by typing the following commands, individually.
define postal_directory define apac_postal_directory define census_directory define ga_directory
The commands should return the following failure message: “No setting with name nnn_directory found in table default_settings.” For example, “No setting with name 'census_directory
' found in table 'default settings
'.”
NOTE: If a value is returned, you need to remove the current setting before defining a new one. See Removing a Postal Directory Definition for more information.
Create the alternative installation directory for non-Asian postal tables, census tables, global postal tables, and/or Asian postal tables.
Move any installed xxCITY
files from the default installation directory to the new, alternative directory.
These files were copied to the default postal directories when TS Quality project templates were installed. They must be in the same location as the postal tables.
Remove the default directories (or leave them empty).
At the mtb_admin
prompt, type:
expert
Define the alternative locations by issuing one or more of the following commands:
For non-Asian postal tables:
define postal_directory [file join {d:\newpostal}]
For Asian postal tables:
define apac_postal_directory [file join {d:\newpostal}]
For census tables:
define census_directory [file join {d:\newpostal}]
For global postal tables:
define ga_directory [file join {d:\newpostal}]
where d:\newpostal
is the path of the alternative location.
Type exit
to close the command prompt window.
For global postal tables and latitude/longitude tables, there is an additional step.
With a text editor, open the global postal table gaserver.ini file, which is located in the ODQ_HOME/oracledq/12/Software/bin
directory).
Set the CountryDataDirectory entry to point to the alternative location of the global postal table datastore \rdata
directory.
Set the KnowledgeBaseDirectory entry to point to the location of the global postal table datastore \kbase
directory.
Locate the LicenseDirectory entry and point it to the global postal table license directory. Figure 2-1 shows an example of a modified gaserver.ini
file.
Save and close the file.
Copy the file you modified and paste it in the ODQ_HOME/oracledq/12/Software/bin/latlong
directory, overwriting the existing file.
NOTE: Both versions of the gaserver.ini
file must point to the same locations.
Restart Oracle Data Quality and the Scheduler.
To remove a postal directory definition:
Shut down Oracle Data Quality products, including all Oracle Data Quality clients.
Close the Oracle Data Quality Scheduler service.
Start the Metabase Server application.
On UNIX operating systems:
./mtb_admin
On Windows operating systems:
Programs > Oracle Data Quality Software > Metabase Server > Administrator Command Prompt from the Start menu.
When prompted, log on as the metabase administrator (madmin) to the _control
metabase.
Issue the appropriate command:
undefine postal_directory undefine apac_postal_directory undefine census_directory undefine ga_directory
The basic installation procedure is the same for all postal tables categories. However, there is an extra step involved in setting up a Global Postal Table.
To install postal directory files:
Download the Postal Table file and copy it to the postal install directory. This file is located on the installation CD/DVD or was installed on your machine from an FTP session. See Postal Table Naming Conventions for a description of the file name formats.
Decompress the .tar or .zip file using the appropriate command (WinZip or tar). The expanded files are placed into the current directory.
NOTE: Oracle Data Quality Software compresses some postal tables because of their size. Review the list of files for any that have a .Z
extension, which indicates compression. If necessary, uncompress files by entering the following command:
uncompress *.Z
If you purchased one of the Global Postal Tables, complete the procedure “To set up the Global postal service in Windows” or “To set up a Global postal service in UNIX.” (The following countries use a Global Postal Table: Austria, Brazil, Czech Republic, Denmark, Finland, Greece, Hungary, Ireland, Mexico, New Zealand, Norway, Poland, and Sweden.)
Use the following steps to set up a global postal service on UNIX operating systems.
Complete the installation of the Global Postal Tables as described in Section 2.3.5.
Change to the directory that contains the file gactl.
For AIX operating systems, this file can be found in the directory:
ODQ_HOME/oracledq/12/Software/GA_server
For all other UNIX operating systems, this file can be found in the directory:
ODQ_HOME/oracledq/12/Software/bin
Enter the following command:
gactl start
NOTE: If you need to install a new Global Postal Table, you must stop the service, install the new table, then restart the service.
To stop the service, issue the following command:
gactl stop
To set up the Global postal service in Windows operating systems:
Complete the installation of the Global Postal Tables.
From the Start menu, select Programs > Oracle Data Quality products > Global Postal Matcher > Create Service.
(Optional) If you have defined a network drive as the alternate location for the GA postal tables, as described in Section 2.3.3, you must modify the Global Address Service properties.
Go to the Windows Services page (Start > Settings > Control Panel > Administrative Tools > Services).
Locate the Global Address Server entry and right-click.
Select Properties and then click the Log On tab.
Select the This Account option and enter your domain name and user name in the first field (for example, domain_name\jsmith)
.
In the Password and Confirm Password fields, enter the password associated with your user id and click OK.
From the Start menu, select Programs > Oracle Data Quality products > Global Postal Matcher > Start Service.
From the Start menu, select Settings > Control Panel > Administrative Tools > Services and confirm that the Global service is running.
NOTE: If you need to install a new Global Postal Table, you must stop the service, install the new table, then restart the service.
Depending on your installation type, you may need to manually configure some of the Oracle Data Quality components. Review the following sections to determine if additional configuration tasks are required:
If you installed the Oracle Data Profiling and Quality server, you must start the Oracle Data Quality Scheduler before you begin. The Oracle Universal Installer will start the Scheduler as part of the installation process, but you may need to start it again.
To start the Oracle Data Quality Scheduler:
Make sure that you are logged on to the Oracle Data Profiling and Quality server as the Oracle Data Quality server administrator as defined on the Metabase Server Details screen during the installation.
Navigate to the following directory:
<ODQ_HOME/oracledq/metabase_server/metabase/bin
Type the command:
./scheduler -start
For more information on using the Scheduler, see the online help for Metabase Administrators.
On UNIX platforms add the following environment variables for the user who has installed Oracle Data Integrator:
Oracle_QUALITY=ODQ_HOME/oracledq/12/Software
LD_LIBRARY_PATH=ODQ_HOME/oracledq/12/Software/bin
Oracle Data Quality products work with inetd
, a daemon process that handles network services operating on a UNIX operating system. Upon execution, inetd
reads its configuration information from a configuration file which, by default, is /etc/inetd.conf
.
If you installed the Metabase Server, the Oracle Universal Installer automatically updated the inetd.conf
file. For Linux, HP, and AIX systems, no further action is required.
If you do update the inetd.conf
file for any reason, be sure to recycle it.
On Solaris 10 and later systems, inetd
reads configuration information from a different location. If you have installed Oracle Data Profiling and Quality components on a Solaris 10 system, log on as the root user and issue the following command at the command prompt:
inetconv -f -o /var /tmp
This command converts the data quality entry in the inetd.conf
file to the format required by Solaris 10.