Exforsys
+ Reply to Thread
Results 1 to 3 of 3

Eliminating Duplicate Rows from Flat File

This is a discussion on Eliminating Duplicate Rows from Flat File within the Data Warehousing forums, part of the ERP category; In Informatica, how is it done to eliminate duplicated rows from a flat file?...

  1. #1
    cyrus is offline Senior Member Array
    Join Date
    Apr 2006
    Answers
    128

    Eliminating Duplicate Rows from Flat File

    In Informatica, how is it done to eliminate duplicated rows from a flat file?


  2. #2
    zsk_00 is offline Member Array
    Join Date
    May 2006
    Answers
    38
    One way is to use aggregator. Group by all the columns and the output will not have any duplicates. If the input file size is big, use sorter before aggregator and check the [sorted input] box in the aggregator (this is for performace improvement)

    Zafar


  3. #3
    a.joycy is offline Junior Member Array
    Join Date
    Aug 2007
    Location
    I live in Madagascar
    Answers
    1
    Inside UNIX you can use this commande to eliminate duplicated rows from a flat file"sed '$!N; /^\(.*\)\n\1$/!P; D'"

    Ex: cat file sed '$!N; /^\(.*\)\n\1$/!P; D' > /home/tmp.dat
    cat /home/tmp.dat > file
    rm /home/tmp.dat


    •    Sponsored Ads



Latest Article

Network Security Risk Assessment and Measurement

Read More...