This is a discussion on Eliminating Duplicate Rows from Flat File within the Data Warehousing forums, part of the ERP category; In Informatica, how is it done to eliminate duplicated rows from a flat file?...
|
|||||||
| Register | FAQ | Members List | Calendar | Search | Today's Posts | Mark Forums Read |
|
|||
|
One way is to use aggregator. Group by all the columns and the output will not have any duplicates. If the input file size is big, use sorter before aggregator and check the [sorted input] box in the aggregator (this is for performace improvement)
Zafar |
|
|||
|
Inside UNIX you can use this commande to eliminate duplicated rows from a flat file"sed '$!N; /^\(.*\)\n\1$/!P; D'"
Ex: cat file sed '$!N; /^\(.*\)\n\1$/!P; D' > /home/tmp.dat cat /home/tmp.dat > file rm /home/tmp.dat |
![]() |
| Thread Tools | |
|
|
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| How to delete duplicate rows from a table | athermaliq | Oracle Tutorials | 5 | 01-17-2006 01:03 AM |