We have raw data it is in 1st dataset
NF1 contains9 lakhs records like this structure which has shown below:
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506963 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506963 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506963 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506963 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506963 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506962 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506962 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506962 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506962 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506961 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506961 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506961 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
NF2 contains 7 lakh records in this also am containing same records but some 2 lakhs records are excluded need to find those two lakhs records we dont have any unique key also:
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506963 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506963 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506963 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506963 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506963 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506962 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506962 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506962 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506962 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506961 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506961 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
APLU027951387 ,161524310 ,1803565,179 ,04/07/2012,8 ,00992501102000506961 ,0400646871093,0400646871093,4,147020074 ,03/29/2012,C,ICM11300,2012-03-29-20.04.54.439028
But 7 th field is changing couple of sets in both datasets
E.G) 00992501102000506963,00992501102000506962,00992501102000506961 it has repeated 5,4,3 times respectively.
Do you have any idea to find those 2 lakh records,
For my understanding we need to compare that fields in both datasets:
E.g)NF1 contains
2222
2222
2222
NF2 contains
2222
2222
O/P
2222
It should be stored in resultant dataset ie) both dataset contains two times repetedly 2222 is there need to exclude only '2222' in first dataset(NF1) . If we can able to find this. So we will try this logic in my scenario whether we can able to exclude those 2 lakh records
That's why gone for join keys i dono the concept but now i understood if it is matching one time in both dataset it doesn't showing in unpaired dataset
SORTJNF1 : RCD IN= 982245,OMITTED= 0,PAIRED= 982245,UNPAIRED= 0
SORTJNF2 : RCD IN= 742926,OMITTED= 0,PAIRED= 742926,UNPAIRED= 0
OR if u have any better idea suggest me.
Thanks in advance.