Match data, exact & fuzzy,
eliminate duplicates
and much more
There are many ways to obtain better data. An intelligent search for duplicates and duplicate addresses is one of them. Because different spellings often make it difficult to match two data records that actually belong together.

Intelligent search for duplicates (dupes)
Exact hits, where every character is matched, are not the only results found: so too are near-duplicates (fuzzy matching) and duplicated addresses. In this, the following are taken into account, in particular:
- Typos
- Spelling variations
- Omissions and additions
- Misplaced words
- Abbreviations
- Pet names / nicknames
Everything you need for data clean-up:
- Search for duplicates / dupes inside a table.
- Search for duplicates / dupes between two tables, for example, to consider blacklists, to synchronize address lists or to enrich data.
- Search for duplicates by postal address (fuzzy matching), phone number, email address or any other criteria.
- Fuzzy / error-tolerant matching can deal with company names as well as addresses of private persons.
Other functions for quality improvement:
- Functions for selecting and enriching data.
- Detect gender based on first name.
- Determine the salutation of a letter.
- Correct the postal code format.
- Merging tables.
- Merging and splitting data fields.
- and much more ...
Numerous ways to use the result:
- The found duplicates can be deleted in the source table. Alternatively, the cleansed data can also be written in a new file.
- The found duplicates can be marked in the original table.
- The result can be used to enrich data. For example, a telephone number from a second table could be transferred to the first table using the matching result.
- further information ...
User-friendly and cost-effective:
- No technical knowledge required. Our products are designed so that hopefully you will never need our free support.
- See for yourself. Test our products for one week free of charge and without any restrictions.
- Local processing of data, no need to transfer data to an external service provider. This simplifies compliance with the General Data Protection Regulation (GDPR).
- For service providers you pay for each project individually, but you only pay once for our software. And all this with an excellent price-performance ratio. (Prices)
Fast, flexible and safe:
- Can also be used for large databases. Parallel, and therefore particularly fast, processing on systems with several processor cores.
- Data source (address lists and databases): Excel, Access, MS SQL Server, Azure SQL, ORACLE, MySQL, MariaDB, PostgreSQL, OpenOffice Calc, LibreOffice, dBase, CSV files and text files.
- All the program files have a digital signature. This ensures that the files are unchanged and actually originate from us. You can easily verify this digital signature: Properties for the program file (accessible via the right mouse button) -> Digital signatures -> Details -> Show certificate -> Details -> Applicant
AI or no AI?
The algorithm that our products use to find duplicates is not based on artificial intelligence in the sense of machine learning. Instead, it uses a complex rule-based algorithm. Compared to machine learning, this has the advantage that it requires less computing power. In addition, the results of such an algorithm are reproducible. The quality of the result always remains the same. For a well-defined problem such as finding duplicate addresses in address lists, such an algorithm is usually the better choice..
Our software products:
- DataQualityTools 8: our all-in-one solution for finding duplicates / dupes and quality improvement.
- DedupeWizard 8: our basic product for finding duplicate addresses in Excel.
- BatchDeduplicator 8: our solution for regular data clean-up.
DataQualityTools 8
DataQualityTools help you to improve your data and therefore your marketing. A key component are the functions for finding duplicate records. These functions can also be used to consider blacklists, to synchronise tables or to enrich data. In addition, there are a whole range of other functions for preparing data, such as a function to merge multiple tables or a function to determine the gender based on the first name from the address. Data from (almost) any data source can be processed. Further information ...
DedupeWizard 8
DedupeWizard is a program for the deduplication of Excel files that can be used without much technical knowledge. The telephone number, email address or postal address can be used to search for duplicates / dupes. In addition to the search within a single table, a comparison between two tables is also possible, as needed, for example, for the consideration of advertising black lists. Spelling variations are no problem. Note: If the functions contained in DedupeWizard are not sufficient for you, please take a look at DataQualityTools. Further information ...
BatchDeduplicator 8
BatchDeduplicator is a program for the regular deduplication of databases on a fixed schedule, in order to ensure the long-term data quality. In addition, projects can also be executed by calling BatchDeduplicator from the command line. This allows it to be integrated into batch files, for example. Or it can be called from an SQL server via a stored procedure. Data from (almost) any data source can be processed. Further information ...