
- Forum
- ERP
- Data Warehousing
- Eliminating Duplicate Rows from Flat File
Eliminating Duplicate Rows from Flat File
This is a discussion on Eliminating Duplicate Rows from Flat File within the Data Warehousing forums, part of the ERP category; In Informatica, how is it done to eliminate duplicated rows from a flat file?...
-
Eliminating Duplicate Rows from Flat File
In Informatica, how is it done to eliminate duplicated rows from a flat file?
-
One way is to use aggregator. Group by all the columns and the output will not have any duplicates. If the input file size is big, use sorter before aggregator and check the [sorted input] box in the aggregator (this is for performace improvement)
Zafar
-
08-24-2007, 09:16 AM #3
- Join Date
- Aug 2007
- Location
- I live in Madagascar
- Answers
- 1
Inside UNIX you can use this commande to eliminate duplicated rows from a flat file"sed '$!N; /^\(.*\)\n\1$/!P; D'"
Ex: cat file sed '$!N; /^\(.*\)\n\1$/!P; D' > /home/tmp.dat
cat /home/tmp.dat > file
rm /home/tmp.dat
-
Sponsored Ads

Reply With Quote





