Hello,
I have 2 tables/DataSets and it has different fields like Name, Age, Gender,Address. I want to run a match on the address column. I want a program which fetches only the matched addresses between the two tables. The problem here is that the same address can be entered in multiple ways
For Example
Table 1 Contains
114 Mary Street
Table 2 Contains
114 Mary St
114 Mary St.
The above sample records are same but they will be considered different when matched through a query. It requires some algo as a same address can be written in 1000 different ways and a same address can also contain typos.
I have searched a lot regarding the possible solution, many have recommended fuzzy search algorithm but i am not sure where and how to start.
I am looking for ideas for an effective algorithm. Any idea can be pseudo code or in your preferred language.
Any help would be highly appreciated.
Thanks
What I have tried:
I have searched regarding this at different places but still no luck many have recommended Fuzzy search algorithm i am not sure where and how to start.
The data is available in 2 tables and it has many records so it will be quite helpful if i can get a program that brings me similar or approximate same records.
Thanks.