Convert Single Row From Source To Three Rows in Target: Solution
Convert Single Row From Source To Three Rows in Target: Solution
Convert Single Row From Source To Three Rows in Target: Solution
There is target table containg only 1 column Col. Design a mapping so that the target table contains 3 rows as
follows:
Col
a
b
c
Without using normaliser transformation.
Solution:
Create 3 expression transformations exp_1,exp_2 and exp_3 with 1 port each. Connect col1 from Source Qualifier to
port in exp_1.Connect col2 from Source Qualifier to port in exp_2.Connect col3 from source qualifier to port in exp_3.
Make 3 instances of the target. Connect port from exp_1 to target_1. Connect port from exp_2 to target_2 and
connect port from exp_3 to target_3.
2.Split the non-key columns to separate tables with key column in both:
Scenario 2:
Split the non-key columns to separate tables with key column in both / How to split the data of source table column-
wise with respect to primary key. See the source and target tables below.
source table: ID is the key column, Name and Phone No are non-key columns
ID Name Phone No
10 AAA 123
20 BBB 234
30 CCC 434
40 DDD 343
442
50 EEE
Target Table 1
ID Name
10 AAA
20 BBB
30 CCC
40 DDD
50 EEE
Target Table 2
ID Phone No
10 123
20 234
30 434
40 343
50 442
Solution:
Step 1: Source qualifier: get the source table to the mapping area. See image below.
Step 2: Drag all the port from (from the previous step) to the Aggregator transformation and group by the key
column. Since we have to split the columns to two different tables with the key column in each, so we are going use
two expression transformation, each will take the key column and one non-key column. Connect aggregator
transformation with each of the expression transformation as follows.
Step 3: We need another set of aggregator to be associated with each of the expression tranformation from the
previous step.
Step 4: In the final step connect the aggregators with the two target tables as follows.
a b c
x y z
a b c
r f u
a b c
v f r
v f r
Target Table 1: Table containing all the unique rows
COL1 COL2 COL3
a b c
x y z
r f u
v f r
Step 2: In aggregator transformation, group by the key column and add a new port call it count_rec to count the key
column.
Step 3: connect a router to the aggregator from the previous step.In router make two groups one named "original"
and another as "duplicate"
In original write count_rec=1 and in duplicate write count_rec>1.
The picture below depicting group name and the filter conditions
Scenario 4:
How to get first and last record from a table/file?
Solution:
Step 1: Drag and drop ports from source qualifier to two rank transformations.
Step 2: Create a reusable sequence generator having start value 1 and connect the next value
to both rank transformations.
Step 3: Set rank properties as follows
In Rank1
In Rank2
Solution
Step4:Chose the advance option. Set number of initial rows skip: 1 ( it can be more as per requirement )
7.Sending first half record to target:
Scenario 6: How to send first half record to target?
Solution:
1. Drag and drop the source to mapping.
3. Then connect to target.Now you are ready to run the mapping to see it in action.
8.Sending second half record to target :
Scenario 8: How to send second half record to target?
Solution
Step 1: Drag and drop the source to mapping.
Step:3 Then connect to target, and run mapping to see the results.
9.Sending alternate record to target:
Scenario 9: How to send alternate record to target?
Or
Sending Odd numbered records to one target and even numbered records to another target.
Solution:
Step 3: In expression transformation make two port, one is "odd" and another "even".
And Write the expression like below
EXPRESSION PROPERTY
a b c
x y z
a b c
r f u
a b c
v f r
v f r
Target Table
Col1 Col2 Col3
a b c
x y z
r f u
v f r
Solution:
Step 1: Bring the source to mapping.
RANK PROPERTY
11.Separate rows on group basis:
Scenario 11: In Dept table there are four departments (dept no 40,30,20,10). Separate the record to different target
department wise.
Solution:
Step 1: Drag the source to mapping.
Step 2: Connect the router transformation to source and in router make 4 groups and give condition like below.
ROUTER TRANSFORMATION
ROUTER TO TARGET
12.Get top 5 records to target without using rank :
Scenario 12: How to get top 5 records to target without using rank ?
Solution:
1. Drag the source to mapping and connect it to sorter transformation.
2. Arrange the salary in descending order in sorter as follows and send the record to expression.
SORTER PROPERTIES
3.Add the next value of sequence generator to expression.(start the value from 1 in sequence generator).
4. Connect the expression transformation to a filter or router. In the property set the condition as follows-
2. Make 4 output ports in aggregator as in the picture above : count_d10, count_d20, count_d30, count_d40.
For each port write expression like in the picture below.
5. Then connect to router transformation. And create a group and fill condition like below.
6. Finally connect to target table having one column that is dept no.
14.Extracting every nth row :
Scenario: How to load every nth row from a Flat file/ relational DB to the target? Suppose n=3, then in above
condition the row numbered 3,6,9,12,....so on, This example takes every 3 row to target table.
Solution:
1. Connect an expression transformation after source qualifier.
Add the next value port of sequence generator to expression transformation.
2. In expression create a new port (validate) and write the expression like in the picture below.
3. Connect a filter transformation to expression and write the condition in property like in the picture below.
2. Send the all ports to a router and make three groups as bellow
Group1
mod(NEXTVAL,30) >= 1 and mod(NEXTVAL,30) <= 10
Group2
mod(NEXTVAL,30) >= 11 and mod(NEXTVAL,30) <= 20
Group3
mod(NEXTVAL,30) >= 21and mod(NEXTVAL,30) <= 29 or mod(NEXTVAL,30) = 0
a x
b y
c z
a m
Target Table: T2
b n
col1 col2
a x,m
b y,n
c z
Solution:
1. We have to use the following transformation as below.
First connect a sorter transformation to source and make col1 as key and its order is ascending. After that
connect it to an expression transformation.
2. In Expression make four new port and give them name as in picture below.
3. In concat_val write expression like as describe bellow and send it to an aggregator
1 200
2 300
3 500
4 560
TARGET TABLE
Id Sal
1 200
2 500
3 1000
4 1560
1. Pull the source to mapping and then connect it to expression.
2. In expression add one column and make it output(sal1) and sal port as input only.
We will make use of a function named cume() to solve our problem, rather using any complex mapping. Write
the expression in sal1 as cume(sal) and send the output rows to target.
18.Produce files as target with dynamic names:
Scenario:How to generate file name dynamically with name of sys date ?
Solution:
1. Drag your target file to target designer and add a column as show on the picture. It’s not a normal column .click
on the ‘add file name to the table’ property. (I have given a red mark there)
2. Then drag your source to mapping area and connect it to an expression transformation.
3. In expression transformation add a new port as string data type and make it output port.
4. In that output port write the condition like describe as bellow and then map it in to filename port of target. Also
send other ports to target. Finally run the session. You will find two file one with sys date and other one is ‘.out’
file which one you can delete.
2. Create a mapping as shown in the figure( I have considered a simple scenario where a particular department id
will be filtered to the target).
3. In filter set deptno=$$v1 (that means only dept no 20 record will go to the target.)
4. Mapping parameter value can’t change throughout the session but variable can be changed. We can change
variable value by using text file. I’ll show it in next scenario.
21. Removing '$' symbol from salary column:
Q21: Reading a source file with salary prefix $, in the target the Sal column must store in number.
Source
EMPNO ENAME JOB MGR HIREDATE SAL DEPTNO
7369 SMITH CLERK 7902 17-DEC-80 $800 20
7499 ALLEN SALESMAN 7698 20-FEB-81 $1600 30
Target
EMPNO ENAME JOB MGR HIREDATE SAL DEPTNO
7369 SMITH CLERK 7902 17-DEC-80 800 20
7499 ALLEN SALESMAN 7698 20-FEB-81 1600 30
1. Drag the source to mapping area and connect each port to an expression transformation.
2. In expression transformation add a new col sal1 and make it as output and Sal as in put only as shown in
picture.
2. Then drag the source to mapping area and connect to an expression transformation.
3. In expression create an output port as sal1 and make Sal as input only as bellow.
4. In sal1 port write the condition as below
iif(instr(SAL,'$')!=0,TO_integer(SUBSTR(SAL,INSTR(SAL,'$')+1,LENGTH(SAL)-1))*$$DOLAR,
iif(instr(SAL,'£')!=0,TO_integer(SUBSTR(SAL,INSTR(SAL,'£')+1,LENGTH(SAL)-1))*$$POUND,
iif(instr(SAL,'¥')!=0,TO_integer(SUBSTR(SAL,INSTR(SAL,'¥')+1,LENGTH(SAL)-1))*$$YEN
)
)
)
$$DOLAR,$$POUND,$$YEN these are mapping parameter . You can multiply price in rupee directly for
example dollar price in rupees i.e. 48.
5. Connect required output port from expression to target directly. And run the session.
23. Sending data one after another to three tables in cyclic order:
Q23 In source there are some record. Suppose I want to send three targets. First record will go to first target,
second one will go to second target and third record will go to third target and then 4th to 1st, 5th to 2nd, 6th
to 3rd and so on.
1. Put the source to mapping and connect it to an expression transformation.
2. Drag a sequence generator transformation and set properties like this and connect the next value port to
expression.
3. Drag all output port of expression to router. In router make three groups and give the conditions Like this
2. In expression make an output port sal1 and make Sal as input port only.
2. In Expression add 6 columns like in the picture as bellow. But you can make it two columns (One for all the
vowels and one for the vowel counts). For better understanding I have added 6 columns, 5 for each of the
vowels and one for the vowel count.
The way I achieved is for each of the vowels in ename , I replaced it with null and in port total vowel count ,
I subtract the vowel port from the ename length which gives me the individual count of vowels, after adding up
for all vowels I found all the vowels present. Here are all the variable ports.
For A write REPLACECHR(0,ENAME,'a',NULL)
For E write REPLACECHR(0,ENAME,'e',NULL)
For I write REPLACECHR(0,ENAME,'i',NULL)
For O write REPLACECHR(0,ENAME,'o',NULL)
For U write REPLACECHR(0,ENAME,'u',NULL)
And for o/p column total_vowels_count write expression like this
(length(ENAME)-length(A))
+
(length(ENAME)-length(E))
+
(length(ENAME)-length(I))
+
(length(ENAME)-length(O))
+
(Length (ENAME)-length (U))
3. Finally send to target.
Source
EMPNO HIRE_DATE (numeric)
------- -----------
1 20101111
2 20090909
target
EMPNO HIRE_DATE (date)
------ -----------
1 11/11/2010
2 09/09/2009
1. Connect SQF to an expression.
2. In expression make hire_date as input only and make another port hire_date1 as o/p port with date data type.
2. In expression create another oupput port hire_date1 and make it to date data-type, shown in picture.
We have to calculate difference between order_date and delivery date in hours and send it to target.
o/p will be
2. In expression create one out/put port “diff” and make it integer type.
3. In that port write the condition like this and sent to target.
31.Sending to target with days difference more than 2 days:
Scenario: From the order_delivery table insert the records to target where , day difference between order_date
and delivery_date is greater than 2 days. ( Note: see last article , where we discussed finding the time in hour
between two dates)
Source
ORDER_NO ORDER_DATE DELIVERY_DATE
--------- --------- ---------
2 11-JAN-83 13-JAN-83
3 04-FEB-83 07-FEB-83
1 08-DEC-81 09-DEC-81
Target
ORDER_NO ORDER_DATE DELIVERY_ DATE
--------- -------- ------ --- ----------
2 11-JAN-83 13-JAN-83
3 04-FEB-83 07-FEB-83
These are the steps for achieving this scenario
1. Connect all the rows from SQF to update strategy transformation.
2. In expression create one o/p port c_year_mm_dd, make it to date type and in that port write the condition like
this.
2. In expression transformation create two output port one is f_name and other is l_name.
2. In Expression create two ports one is name1(as variable port) and Middle_Name (o/p port)
4. Connect lookup to source. In Lookup fetch the data from target table and send only CUSTOMER_ID port from
source to lookup
5. Give the lookup condition like this
6. Then rest of the columns from source send to one router transformation
8. For new records we have to generate new customer_id. For that take a sequence generator and connect the
next column to expression .New_rec group from router connect to target1(Bring two instances of target to
mapping, one for new rec and other for old rec) .Then connect next_val from expression to customer_id column
of target
9. Change_rec group of router bring to one update strategy. and give the condition like this
10. Instead of 1 you can give dd_update in update-stratgy. Then connect to target.
37.SCD Type2:
In Type 2 Slowly Changing Dimension, if one new record is added to the existing table with a new
information then both the original and the new record will be presented having new records with its own
primary key.
1. To identifying new_rec we should and one new_pm and one vesion_no.
2. This is the source.
4. All the procedure same as described in SCD TYPE1 mapping. The Only difference is , From router new_rec will
come to one update_strategy and condition will be given dd_insert and one new_pm and version_no will be
added before sending to target.
5. Old_rec also will come to update_strategy condition will given dd_insert then will send to target.
38.SCD Type3:
In SCD Type3 ,there should be added two column to identifying a single attribute. It stores one time
historical data with current data
1. This is the source
3. Up to rouer transformation ,all the procedure is same as described in Scenario_36 SCD type1.
4. The only difference is after router bring the new_rec to router and give condition dd_insert send to target.
Create one new primary key send to target.
5. For old_rec send to update_strategy and set condition dd_insert and send to target.
6. You can create one effective_date column in old_rec table
39.Unit Testing:
Unit Testing
In unit testing what we need do is something like below
1. Validate source and target
- Analyze & validate your transformation business rules.
- We need review field by field from source to target and ensure that the required
transformation logic is applied.
- We generally check the source and target counts for each mapping.
2. Analyze the success and reject rows
- In this stage we generally customized sql queries to check source and target.
- Analyze the rejected rows and build the process to handle this rejection.
3. Calculate the load time
- Run the session and view the statistics
- We observe how much time is taken by reader and writer .
- We should look at lesion log and workflow log to view the load statistics
4. Testing performance
- Source performance
- Target performance
- Session performance
- Network performance
- Database performance
After unit testing we generally prepare one document as described below
5. UNIT TEST CASE FOR LOAN_MASRER
ACTUA
FUNCTIONALITY_ VALUE EXPECTED PASS/FAIL
FIELD_NAME DETAIL L REMARK
ID PASSED RESULT RESULT
RESULT
_TYPE_ID SHOULD BE
NOT NULL ,FIRST
CHARACHER
RECOR
ALPHABET(INSCH)
STG_SCHM_DTLS LOAN INSCH0000000 ACCEPT D
AND LAST 10 PASS
_001 _ID 0002 RECORD ACCEP
CHARACTER
TED
NUMERIC VALUES
AND ALSO ITS
LENGTH IS 16
RECORD
REJECT WHEN , NOT INSERTED
NULL ,FIRST 5 INTO
CHARACHER NOT REJECT REJECTED
STG_SCHM_DTLS (INSCH) OR LAST 10 INSCP0010000 RECORDREC FILE WITH AN
LOAN_TYPE_ID PASS
_002 CHARACTER NON 00002 ORD ERROR_ID
NUMERIC VALUES REJECTED &ERROR_DE
AND ALSO ITS TAILS INTO
LENGTH <>16 ERROR_TABL
E
LOAN_COMPANY_ID
MUST BE NOT
NULL,FIRST 4
RECOR
CHRACTER
STG_SCHM_DTLS LOAN_COMPANY INCO00000000 ACCEPT D
ALPHABET(INCO) AND PASS
_003 _ID 003 RECORD ACCEP
LAST 11 CHRACTER
TED
NUMERIC VALUES
AND ALSO LENGTH IS
15
RECORD
REJECT WHEN , NOT INSERTED
NULL ,FIRST 4 INTO
CHARACHER NOT RECOR REJECTED
STG_SCHM_DTLS LOAN_COMPANY (INCO) OR LAST 11 INSO00000060 REJECT D FILE WITH AN
PASS
_004 _ID CHARACTER NON 003 RECORD REJECT ERROR_ID
NUMERIC VALUES ED &ERROR_DE
AND ALSO ITS TAILS INTO
LENGTH <>15 ERROR_TABL
E
RECORD
INSERTED
INTO
START DATE SHOULD RECOR REJECTED
STG_SCHM_DTLS NOT BE LOADED REJECT D FILE WITH AN
START_DATE 33FeB/88 PASS
_006 WHEN IT IS NOT A RECORD REJECT ERROR_ID
VALID DATE ED &ERROR_DE
TAILS INTO
ERROR_TABL
E
RECOR
SCHEME-DESC
STG_SCHM_DTLS ACCEPT D
SCHEME_DESC SHOULD BE AUTOMOBILE PASS
_007 RECORD ACCEP
ALPHABETIC TYPE
TED
RECORD
INSERTED
INTO
REJECT WHEN RECOR REJECTED
STG_SCHM_DT SCHEME_DESC SCHEME DISCOUNT REJECT D FILE WITH AN
MOTO124 PASS
LS_008 IS NOT ALPHABETIC RECORD REJECT ERROR_ID
TYPE ED &ERROR_DE
TAILS INTO
ERROR_TABL
E
RECOR
PREMIUM_PER_LACS
STG_SCHM_DTLS PREMIUM_PER_L ACCEPT D
SHOULD BE 5000 PASS
_009 ACS RECORD ACCEP
NUMERIC
TED